Ships·3개월 전

Google DeepMind, Gemini 3.1 Flash TTS 공개 — 오디오 태그로 음성 스타일 제어, 70+ 언어 지원

Google DeepMind가 Gemini 3.1 Flash TTS를 출시했다. 이 모델은 자연어 명령으로 발화 스타일과 속도를 조절할 수 있는 오디오 태그를 도입했으며, 70개 이상의 언어를 지원한다. 모든 생성 오디오에는 SynthID 워터마크가 적용되어 출처 식별이 가능하다. Google AI Studio, Vertex AI, Google Vids에서 사용할 수 있다.

#google-deepmind
#gemini-3.1-flash-tts
#audio-generation
#synthid
#multilingual

Google DeepMind

원문 보기 →

Google DeepMind, Gemini 3.1 Flash TTS 공개 — 오디오 태그로 음성 스타일 제어, 70+ 언어 지원

Comments