35 Commits

Author SHA1 Message Date
neo.dzh
e7f86a751e add audio codec and codec-based TTS model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/15128959
2023-12-26 20:54:59 +08:00
bin.xue
1d2b7e8634 [to #42322933] separate audio requirements 2023-02-07 02:52:01 +00:00
wucong.lyb
e95a32deda add args for asr_infer_pipeline, punc_pipeline, sv_pipeline & modify funasr version
add args for asr_infer_pipeline, punc_pipeline, sv_pipeline
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11547617

    * modify pipeline args

* fix output_dir

* fix sv ValueError

* fix outputs

* code style

* add args for asr_infer_pipeline, punc_pipeline, sv_pipeline & modify funasr version

* fix kwargs and add param_dict for asr_inference_pipeline

* modify code comments
2023-02-06 14:53:42 +00:00
jiangyu.xzy
742ad4b355 sv inference & asr trainer: add new inputs
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11424993

* add sv_inference for speaker embedding extraction

* add raw inputs support for speaker_verification

* add train meta_params
2023-01-13 13:52:52 +00:00
bin.xue
78f812dbb6 [to #42322933] add speech separation finetune
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11379892
2023-01-12 07:02:46 +08:00
jiangyu.xzy
3989b8da32 asr inference: support new models, punctuation, vad, sv
asr推理支持新模型,以及支持标点后处理、端点检测、说话人确认
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11366442

    * support asr new models & vad-punc models

* fix conflict

* fix format

* Merge branch 'master' into asr/asr_inference

* add punc pipeline

* add punc model

* add sv infer

* fix format

* Merge branch 'master' into asr/asr_inference

* fix sv pipeline

* remove useless comments

* fix asr_test

* fix format

* fix output format

* change test level of models which would be removed later.
2023-01-11 23:28:14 +08:00
jiaqi.sjq
453ff1dae3 [to #42322933] support byte input feature and refine fp implementations
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11338137
2023-01-09 20:56:52 +08:00
ni.chongjia
ac53ce3e36 modify format of itn_pipeline
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11257394

    * dev for asr itn inference pipeline

* add task interface

* add pipeline input

* add modemodelscope/pipelines/audio/itn_inference_pipeline.py

* add modelscope/pipelines/audio/itn_inference_pipeline.py

* modelscope/pipelines/audio/itn_inference_pipeline.py

* update modelscope/pipelines/audio/itn_inference_pipeline.py

* modify itn_inference_pipeline.py

* modify itn_inference_pipeline.py

* modify itn_inference_pipeline.py

* remove itn.py

* modify some names

* add modify itn_inference_pipeline.py

* modify itn_inference_pipeline.py

* modify itn_inference_pipeline.py

* modify itn_inference_pipeline.py

* modify itn

* add tests/pipelines/test_inverse_text_processing.py

* modify asr_inference_pipeline.py for the original files

* modify format

* add commits files

* Merge remote-tracking branch 'origin' into remotes/origin/asr/itn_nichongjia

* Merge remote-tracking branch 'origin' into remotes/origin/asr/itn_nichongjia

* modify the pipelines

* Merge branch 'master' into remotes/origin/asr/itn_nichongjia

* [to #47031187]fix: hub test suites can not parallel 
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11276872

    * [to #47031187]fix: hub test suites can not parallel

* google style docs and selected file generator 

ref: https://yuque.alibaba-inc.com/pai/rwqgvl/go8sc8tqzeqqfmsz
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11150212

    * google style docs and selected file generator

* merge

* Merge remote-tracking branch 'origin' into remotes/origin/asr/itn_nichongjia

* Merge branch 'master' into remotes/origin/asr/itn_nichongjia

* add requirements for fun_text_processing
2023-01-05 16:36:17 +08:00
bin.xue
0fdf37312f [to #42322933] feat:add speech separation pipeline
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11255740
2023-01-03 13:18:44 +08:00
jiaqi.sjq
8896087034 [to #42322933] support kantts infer and finetune
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11111331#tab=detail
2022-12-20 10:45:34 +08:00
wenmeng.zwm
e2bf864f63 update audio requirements to use funasr>=0.1.4 2022-12-07 11:37:38 +08:00
jiangyu.xzy
9bfc77c178 support asr new models
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10919277

* support new asr paraformer model

* support asr conformer model
2022-11-30 17:08:35 +08:00
jiangyu.xzy
2b62084146 add funasr based asr inference
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10868583
2022-11-25 17:49:24 +08:00
Yingda Chen
deb847614a [to #42322933] limit espnet version 2022-10-14 21:59:52 +08:00
bin.xue
3863efc14d [to #42322933] add far field KWS trainer
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10275823
2022-10-13 10:15:33 +08:00
jiaqi.sjq
e90ff9e479 [to #42322933] tts sambert am changs from tensorfow to PyTorch and add licenses
* [to #41669377] docs and tools refinement and release 

1. add build_doc linter script
2. add sphinx-docs support
3. add development doc and api doc
4. change version to 0.1.0 for the first internal release version

Link: https://code.aone.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/8775307
2022-09-27 22:09:30 +08:00
bin.xue
d0933a2374 [to #42322933] add far field kws model pipeline
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9767151
2022-08-16 20:23:55 +08:00
shichen.fsc
c663dd8cf6 [to #42322933] add pcm-bytes supported for KWS
kws增加pcm bytes数据类型的支持
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9635439
2022-08-04 11:49:26 +08:00
wenmeng.zwm
34acc596e1 [to #43115513] fix module path error for ast and add numpy<=1.18
1. fix module path error,   if code path contains multiple `modelscope` str,  use the last one as the start position of  modelscope source direcotry
2. add numpy version constraint <=1.18
3. add __init__.py to models/cv/image_to_image_translation
4. split audio requirements from all

 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9587929
2022-08-01 15:42:58 +08:00
bin.xue
e3bffedb87 [to #42322933] aec pipeline修改C++库依赖到MinDAEC
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9563105

    * use MinDAEC instead of cdll

* feat: ANS pipeline can accept bytes as input and adjust processing order to reduce the amount of computation
2022-07-28 22:59:57 +08:00
wenmeng.zwm
590d531484 [to #43115513] requirements refine and preparation for v0.3 release
*  remove tensorflow numpy from audio requirements
*  add audio requirements to all
* auto set model to eval model for pipeline
*  add audio requirement check hint for easyasr and kwsbp
*  fix docs build error

Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9561021
2022-07-28 22:04:18 +08:00
wenmeng.zwm
d55525bfb6 [to #43112771] requirements check and lazy import support 2022-07-27 17:29:16 +08:00
shichen.fsc
5ac448f5c7 [to #42322933] simplify asr inference code, and remove disk-write behavior
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9410174
2022-07-27 14:49:24 +08:00
mulin.lyh
fc90bf0d1a [to #43554786]fix: test error is not detected in gate test, protobuf version to (3, 3.21.0) for tensorflow
限制protobuf版本,修复单元测试有error返回值为0问题
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9510263

    * fix test error is not detected in gate test, protobuf version to (3, 3.21.0)
2022-07-26 10:57:16 +08:00
shichen.fsc
b3b950e616 [to #42322933] simplify kws code, and remove disk-write behavior
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9491681
2022-07-25 22:37:15 +08:00
wenmeng.zwm
e62cd756df [to #42322933] relax requirements
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9407594
2022-07-18 17:52:14 +08:00
jiaqi.sjq
a17f29ce54 [to #42322933] Update tts task inputs
Refactor tts task inputs
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9412937
2022-07-18 17:50:59 +08:00
shichen.fsc
d7c780069f [to #42322933] add asr inference with pytorch(espnet framework)
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9273537
2022-07-11 16:48:47 +08:00
jiaqi.sjq
d313c440c4 [to #9303837] Merge frontend am and vocoder into one model card
Merge frontend, am and vocoder model card into one model card.
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9303837
2022-07-08 14:26:18 +08:00
wenmeng.zwm
274cf6ffa9 [to #42362425] fix audio_requirement and refine quickstart, changelog doc
* make audio requirements optional
 * add changelog for version v0.2
 * add numpy constraint for compatibility with tensorflow1.15
 * update faq
 * fix nlp requiring tensorflow
 * add torchvision to multimodal dependency
 * bump version from 0.2.1 to 0.2.2
 * add warning msg when tensorflow is not installed
 
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9268278
2022-07-05 21:44:33 +08:00
wenmeng.zwm
8e51a073a6 [to #42966122] requirements enchanment and self-host repo support
* add self-hosted repo:
* add extra requirements for different field and reduce necessary requirements
* update docker file with so required by audio
* add requirements checker which will be used later when implement lazy import
* remove repeated requirements and replace opencv-python-headless with opencv-python

example usage:
```shell
pip install model_scope[all] -f https://pai-vision-data-hz.oss-cn-zhangjiakou.aliyuncs.com/release/maas/repo.html
pip install model_scope[cv] -f https://pai-vision-data-hz.oss-cn-zhangjiakou.aliyuncs.com/release/maas/repo.html
pip install model_scope[nlp] -f https://pai-vision-data-hz.oss-cn-zhangjiakou.aliyuncs.com/release/maas/repo.html
pip install model_scope[audio] -f https://pai-vision-data-hz.oss-cn-zhangjiakou.aliyuncs.com/release/maas/repo.html
pip install model_scope[multi-modal] -f https://pai-vision-data-hz.oss-cn-zhangjiakou.aliyuncs.com/release/maas/repo.html

```
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9211383
2022-07-01 16:38:06 +08:00
bin.xue
04b7eba285 [to #42322933] Merge ANS pipeline into master
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9178339

    * refactor: move aec models to audio/aec

* refactor: move aec models to audio/aec

* refactor: move aec models to audio/aec

* refactor: move aec models to audio/aec

* feat: add unittest for ANS pipeline

* Merge branch 'master' into dev/ans

* add new SoundFile to audio dependency

* Merge branch 'master' into dev/ans

* use ANS pipeline name from metainfo

* Merge branch 'master' into dev/ans

* chore: update docstring of ANS module

* Merge branch 'master' into dev/ans

* refactor: use names from metainfo

* refactor: enable ans unittest

* refactor: add more log message in unittest
2022-06-28 14:41:08 +08:00
jiaqi.sjq
4a3f22259f * Relax version requirements in audio.txt
* Fix bugs in ttsfrd which may cause text-to-speech break disappear and upgrade it to version 0.0.2
* other fix to make ut pass
2022-06-24 12:05:01 +08:00
wenmeng.zwm
e288cf076e [to #42362853] refactor pipeline and standardize module_name
* using get_model to validate hub path 
* support reading pipeline info from configuration file
* add metainfo const
* update model type and pipeline type and fix UT
* relax requimrent for protobuf
* skip two dataset tests due to temporal failure
 
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9118154
2022-06-22 14:15:32 +08:00
jiaqi.sjq
b1490bfd7f [to #9061073] feat: merge tts to master
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9061073
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9061073

    * [to #41669377] docs and tools refinement and release 

1. add build_doc linter script
2. add sphinx-docs support
3. add development doc and api doc
4. change version to 0.1.0 for the first internal release version

Link: https://code.aone.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/8775307

* [to #41669377] add pipeline tutorial and fix bugs 

1. add pipleine tutorial
2. fix bugs when using pipeline with certain model and preprocessor

Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/8814301

* refine doc

* refine doc

* merge remote release/0.1 and fix conflict

* Merge branch 'release/0.1' into 'nls/tts'

Release/0.1



See merge request !1700968

* [Add] add tts preprocessor without requirements. finish requirements build later

* [Add] add requirements and frd submodule

* [Fix] remove models submodule

* [Add] add am module

* [Update] update am and vocoder

* [Update] remove submodule

* [Update] add models

* [Fix] fix init error

* [Fix] fix bugs with tts pipeline

* merge master

* [Update] merge from master

* remove frd subdmoule and using wheel from oss

* change scripts

* [Fix] fix bugs in am and vocoder

* [Merge] merge from master

* Merge branch 'master' into nls/tts

* [Fix] fix bugs

* [Fix] fix pep8

* Merge branch 'master' into nls/tts

* [Update] remove hparams and import configuration from kwargs

* Merge branch 'master' into nls/tts

* upgrade tf113 to tf115

* Merge branch 'nls/tts' of gitlab.alibaba-inc.com:Ali-MaaS/MaaS-lib into nls/tts

* add multiple versions of ttsfrd

* merge master

* [Fix] fix cr comments

* Merge branch 'master' into nls/tts

* [Fix] fix cr comments 0617

* Merge branch 'master' into nls/tts

* [Fix] remove comment out codes

* [Merge] merge from master

* [Fix] fix crash for incompatible tf and pytorch version, and frd using zip file resource

* Merge branch 'master' into nls/tts

* [Add] add cuda support
2022-06-20 17:23:11 +08:00