neo.dzh
e7f86a751e
add audio codec and codec-based TTS model
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/15128959
2023-12-26 20:54:59 +08:00
bin.xue
1d2b7e8634
[to #42322933 ] separate audio requirements
2023-02-07 02:52:01 +00:00
wucong.lyb
e95a32deda
add args for asr_infer_pipeline, punc_pipeline, sv_pipeline & modify funasr version
...
add args for asr_infer_pipeline, punc_pipeline, sv_pipeline
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11547617
* modify pipeline args
* fix output_dir
* fix sv ValueError
* fix outputs
* code style
* add args for asr_infer_pipeline, punc_pipeline, sv_pipeline & modify funasr version
* fix kwargs and add param_dict for asr_inference_pipeline
* modify code comments
2023-02-06 14:53:42 +00:00
jiangyu.xzy
742ad4b355
sv inference & asr trainer: add new inputs
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11424993
* add sv_inference for speaker embedding extraction
* add raw inputs support for speaker_verification
* add train meta_params
2023-01-13 13:52:52 +00:00
bin.xue
78f812dbb6
[to #42322933 ] add speech separation finetune
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11379892
2023-01-12 07:02:46 +08:00
jiangyu.xzy
3989b8da32
asr inference: support new models, punctuation, vad, sv
...
asr推理支持新模型,以及支持标点后处理、端点检测、说话人确认
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11366442
* support asr new models & vad-punc models
* fix conflict
* fix format
* Merge branch 'master' into asr/asr_inference
* add punc pipeline
* add punc model
* add sv infer
* fix format
* Merge branch 'master' into asr/asr_inference
* fix sv pipeline
* remove useless comments
* fix asr_test
* fix format
* fix output format
* change test level of models which would be removed later.
2023-01-11 23:28:14 +08:00
jiaqi.sjq
453ff1dae3
[to #42322933 ] support byte input feature and refine fp implementations
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11338137
2023-01-09 20:56:52 +08:00
ni.chongjia
ac53ce3e36
modify format of itn_pipeline
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11257394
* dev for asr itn inference pipeline
* add task interface
* add pipeline input
* add modemodelscope/pipelines/audio/itn_inference_pipeline.py
* add modelscope/pipelines/audio/itn_inference_pipeline.py
* modelscope/pipelines/audio/itn_inference_pipeline.py
* update modelscope/pipelines/audio/itn_inference_pipeline.py
* modify itn_inference_pipeline.py
* modify itn_inference_pipeline.py
* modify itn_inference_pipeline.py
* remove itn.py
* modify some names
* add modify itn_inference_pipeline.py
* modify itn_inference_pipeline.py
* modify itn_inference_pipeline.py
* modify itn_inference_pipeline.py
* modify itn
* add tests/pipelines/test_inverse_text_processing.py
* modify asr_inference_pipeline.py for the original files
* modify format
* add commits files
* Merge remote-tracking branch 'origin' into remotes/origin/asr/itn_nichongjia
* Merge remote-tracking branch 'origin' into remotes/origin/asr/itn_nichongjia
* modify the pipelines
* Merge branch 'master' into remotes/origin/asr/itn_nichongjia
* [to #47031187 ]fix: hub test suites can not parallel
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11276872
* [to #47031187 ]fix: hub test suites can not parallel
* google style docs and selected file generator
ref: https://yuque.alibaba-inc.com/pai/rwqgvl/go8sc8tqzeqqfmsz
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11150212
* google style docs and selected file generator
* merge
* Merge remote-tracking branch 'origin' into remotes/origin/asr/itn_nichongjia
* Merge branch 'master' into remotes/origin/asr/itn_nichongjia
* add requirements for fun_text_processing
2023-01-05 16:36:17 +08:00
bin.xue
0fdf37312f
[to #42322933 ] feat:add speech separation pipeline
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11255740
2023-01-03 13:18:44 +08:00
jiaqi.sjq
8896087034
[to #42322933 ] support kantts infer and finetune
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11111331#tab=detail
2022-12-20 10:45:34 +08:00
wenmeng.zwm
e2bf864f63
update audio requirements to use funasr>=0.1.4
2022-12-07 11:37:38 +08:00
jiangyu.xzy
9bfc77c178
support asr new models
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10919277
* support new asr paraformer model
* support asr conformer model
2022-11-30 17:08:35 +08:00
jiangyu.xzy
2b62084146
add funasr based asr inference
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10868583
2022-11-25 17:49:24 +08:00
Yingda Chen
deb847614a
[to #42322933 ] limit espnet version
2022-10-14 21:59:52 +08:00
bin.xue
3863efc14d
[to #42322933 ] add far field KWS trainer
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10275823
2022-10-13 10:15:33 +08:00
jiaqi.sjq
e90ff9e479
[to #42322933 ] tts sambert am changs from tensorfow to PyTorch and add licenses
...
* [to #41669377 ] docs and tools refinement and release
1. add build_doc linter script
2. add sphinx-docs support
3. add development doc and api doc
4. change version to 0.1.0 for the first internal release version
Link: https://code.aone.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/8775307
2022-09-27 22:09:30 +08:00
bin.xue
d0933a2374
[to #42322933 ] add far field kws model pipeline
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9767151
2022-08-16 20:23:55 +08:00
shichen.fsc
c663dd8cf6
[to #42322933 ] add pcm-bytes supported for KWS
...
kws增加pcm bytes数据类型的支持
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9635439
2022-08-04 11:49:26 +08:00
wenmeng.zwm
34acc596e1
[to #43115513 ] fix module path error for ast and add numpy<=1.18
...
1. fix module path error, if code path contains multiple `modelscope` str, use the last one as the start position of modelscope source direcotry
2. add numpy version constraint <=1.18
3. add __init__.py to models/cv/image_to_image_translation
4. split audio requirements from all
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9587929
2022-08-01 15:42:58 +08:00
bin.xue
e3bffedb87
[to #42322933 ] aec pipeline修改C++库依赖到MinDAEC
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9563105
* use MinDAEC instead of cdll
* feat: ANS pipeline can accept bytes as input and adjust processing order to reduce the amount of computation
2022-07-28 22:59:57 +08:00
wenmeng.zwm
590d531484
[to #43115513 ] requirements refine and preparation for v0.3 release
...
* remove tensorflow numpy from audio requirements
* add audio requirements to all
* auto set model to eval model for pipeline
* add audio requirement check hint for easyasr and kwsbp
* fix docs build error
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9561021
2022-07-28 22:04:18 +08:00
wenmeng.zwm
d55525bfb6
[to #43112771 ] requirements check and lazy import support
2022-07-27 17:29:16 +08:00
shichen.fsc
5ac448f5c7
[to #42322933 ] simplify asr inference code, and remove disk-write behavior
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9410174
2022-07-27 14:49:24 +08:00
mulin.lyh
fc90bf0d1a
[to #43554786 ]fix: test error is not detected in gate test, protobuf version to (3, 3.21.0) for tensorflow
...
限制protobuf版本,修复单元测试有error返回值为0问题
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9510263
* fix test error is not detected in gate test, protobuf version to (3, 3.21.0)
2022-07-26 10:57:16 +08:00
shichen.fsc
b3b950e616
[to #42322933 ] simplify kws code, and remove disk-write behavior
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9491681
2022-07-25 22:37:15 +08:00
wenmeng.zwm
e62cd756df
[to #42322933 ] relax requirements
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9407594
2022-07-18 17:52:14 +08:00
jiaqi.sjq
a17f29ce54
[to #42322933 ] Update tts task inputs
...
Refactor tts task inputs
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9412937
2022-07-18 17:50:59 +08:00
shichen.fsc
d7c780069f
[to #42322933 ] add asr inference with pytorch(espnet framework)
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9273537
2022-07-11 16:48:47 +08:00
jiaqi.sjq
d313c440c4
[to #9303837 ] Merge frontend am and vocoder into one model card
...
Merge frontend, am and vocoder model card into one model card.
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9303837
2022-07-08 14:26:18 +08:00
wenmeng.zwm
274cf6ffa9
[to #42362425 ] fix audio_requirement and refine quickstart, changelog doc
...
* make audio requirements optional
* add changelog for version v0.2
* add numpy constraint for compatibility with tensorflow1.15
* update faq
* fix nlp requiring tensorflow
* add torchvision to multimodal dependency
* bump version from 0.2.1 to 0.2.2
* add warning msg when tensorflow is not installed
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9268278
2022-07-05 21:44:33 +08:00
wenmeng.zwm
8e51a073a6
[to #42966122 ] requirements enchanment and self-host repo support
...
* add self-hosted repo:
* add extra requirements for different field and reduce necessary requirements
* update docker file with so required by audio
* add requirements checker which will be used later when implement lazy import
* remove repeated requirements and replace opencv-python-headless with opencv-python
example usage:
```shell
pip install model_scope[all] -f https://pai-vision-data-hz.oss-cn-zhangjiakou.aliyuncs.com/release/maas/repo.html
pip install model_scope[cv] -f https://pai-vision-data-hz.oss-cn-zhangjiakou.aliyuncs.com/release/maas/repo.html
pip install model_scope[nlp] -f https://pai-vision-data-hz.oss-cn-zhangjiakou.aliyuncs.com/release/maas/repo.html
pip install model_scope[audio] -f https://pai-vision-data-hz.oss-cn-zhangjiakou.aliyuncs.com/release/maas/repo.html
pip install model_scope[multi-modal] -f https://pai-vision-data-hz.oss-cn-zhangjiakou.aliyuncs.com/release/maas/repo.html
```
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9211383
2022-07-01 16:38:06 +08:00
bin.xue
04b7eba285
[to #42322933 ] Merge ANS pipeline into master
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9178339
* refactor: move aec models to audio/aec
* refactor: move aec models to audio/aec
* refactor: move aec models to audio/aec
* refactor: move aec models to audio/aec
* feat: add unittest for ANS pipeline
* Merge branch 'master' into dev/ans
* add new SoundFile to audio dependency
* Merge branch 'master' into dev/ans
* use ANS pipeline name from metainfo
* Merge branch 'master' into dev/ans
* chore: update docstring of ANS module
* Merge branch 'master' into dev/ans
* refactor: use names from metainfo
* refactor: enable ans unittest
* refactor: add more log message in unittest
2022-06-28 14:41:08 +08:00
jiaqi.sjq
4a3f22259f
* Relax version requirements in audio.txt
...
* Fix bugs in ttsfrd which may cause text-to-speech break disappear and upgrade it to version 0.0.2
* other fix to make ut pass
2022-06-24 12:05:01 +08:00
wenmeng.zwm
e288cf076e
[to #42362853 ] refactor pipeline and standardize module_name
...
* using get_model to validate hub path
* support reading pipeline info from configuration file
* add metainfo const
* update model type and pipeline type and fix UT
* relax requimrent for protobuf
* skip two dataset tests due to temporal failure
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9118154
2022-06-22 14:15:32 +08:00
jiaqi.sjq
b1490bfd7f
[to #9061073 ] feat: merge tts to master
...
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9061073
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9061073
* [to #41669377 ] docs and tools refinement and release
1. add build_doc linter script
2. add sphinx-docs support
3. add development doc and api doc
4. change version to 0.1.0 for the first internal release version
Link: https://code.aone.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/8775307
* [to #41669377 ] add pipeline tutorial and fix bugs
1. add pipleine tutorial
2. fix bugs when using pipeline with certain model and preprocessor
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/8814301
* refine doc
* refine doc
* merge remote release/0.1 and fix conflict
* Merge branch 'release/0.1' into 'nls/tts'
Release/0.1
See merge request !1700968
* [Add] add tts preprocessor without requirements. finish requirements build later
* [Add] add requirements and frd submodule
* [Fix] remove models submodule
* [Add] add am module
* [Update] update am and vocoder
* [Update] remove submodule
* [Update] add models
* [Fix] fix init error
* [Fix] fix bugs with tts pipeline
* merge master
* [Update] merge from master
* remove frd subdmoule and using wheel from oss
* change scripts
* [Fix] fix bugs in am and vocoder
* [Merge] merge from master
* Merge branch 'master' into nls/tts
* [Fix] fix bugs
* [Fix] fix pep8
* Merge branch 'master' into nls/tts
* [Update] remove hparams and import configuration from kwargs
* Merge branch 'master' into nls/tts
* upgrade tf113 to tf115
* Merge branch 'nls/tts' of gitlab.alibaba-inc.com:Ali-MaaS/MaaS-lib into nls/tts
* add multiple versions of ttsfrd
* merge master
* [Fix] fix cr comments
* Merge branch 'master' into nls/tts
* [Fix] fix cr comments 0617
* Merge branch 'master' into nls/tts
* [Fix] remove comment out codes
* [Merge] merge from master
* [Fix] fix crash for incompatible tf and pytorch version, and frd using zip file resource
* Merge branch 'master' into nls/tts
* [Add] add cuda support
2022-06-20 17:23:11 +08:00