New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changelog #2
Comments
Initial models, examples, utils for VAD only uploaded (no number detector or language classifier yet) |
First readable public release |
Added VAD latency and throughput metrics |
Added number detector |
Language detector example, readme update + FAQ |
Audiotok benchmarks added |
Added a utility to tune the VAD params properly for a domain |
Some final benchmarks posted here - pyannote/pyannote-audio#604 (comment) |
Added micro (10k params, 100x smaller) VAD models |
Added micro (10k params, 100x smaller) VAD models for 8 kHz audio |
|
|
|
improved language classifier
|
updated further reading section |
New V3 Silero VAD is Already HereMain changes
MigrationPlease see the new examples. New
New
|
Even Better V3 Silero VAD
|
New V3 ONNX VAD ReleasedWe finally were able to port a model to ONNX:
|
Support For Sampling Rates Higher Than 16 kHz
|
|
New V4 VAD ReleasedChanges:
|
|
|
Just a handy issue to be notified of latest changes and micro-releases (we will mostly changing the models)
The text was updated successfully, but these errors were encountered: