Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add language input option / detected language in output / speaker identification #22

Merged
merged 11 commits into from
Dec 11, 2024

Conversation

Jeronymous
Copy link
Member

@Jeronymous Jeronymous commented Nov 30, 2024

Note: this requires STT workers based on the new API supporting language (pull request linto-ai/linto-stt#48 -- issue linto-ai/linto-stt#53)

Documentation: search for "language" / "speaker identification" in the new README.md.

@Jeronymous
Copy link
Member Author

In the API:

  • the env variable LANGUAGE set the language for text normalization ("*" means no text normalization)
  • (new) the configuration parameter language set the target language for Whisper STT ("*" means automatic language detection)
  • (new) the results include a key "language" in all the text segments with the detected language (which is the language passed as an option, if it's not "*")

Corner case : what happens when the parameter language is not specified in the transcription config and the variable env LANGUAGE is set to something (ex: "fr-FR") ?
Currently, it passes nothing (language=None) to the STT worker, so the behaviour depends on the LANGUAGE variable env of the Whisper STT worker.
I think it's correct, but some users might think that the varianve env LANGUAGE of the transcription service is passed in that case. I'll try to clarify in the doc.

@Jeronymous Jeronymous linked an issue Nov 30, 2024 that may be closed by this pull request
…rker (speakerIndentification field of the transcriptionConfig)
@Jeronymous Jeronymous changed the title Add language input option / detected language in output Add language input option / detected language in output / speaker identification Dec 6, 2024
@Jeronymous Jeronymous merged commit e017bcf into next Dec 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add language selection
1 participant