24 Commits

Author SHA1 Message Date
Mashiro
e71057cacf feat(audio/ans): Add ZipEnhancer and related layers for acoustic nois… (#1019)
* feat(audio): Add acoustic noise suppression pipeline and tests for zipenhancer model

* Introduce `ZipEnhancer` module and associated layers (`ZipEnhancerLayer`, `Generator`, `ZipFormer`, ...).
* Add `speech_zipenhancer_ans_multiloss_16k_base` pipeline for `ZipEnhancer` module.
* Add new test cases and update metainfo.


Co-authored-by: Haoxu Wang <wanghaoxu.whx@alibaba-inc.com>
2024-10-24 15:38:09 +08:00
mulin.lyh
cba4e40bc1 fix numpy pandas compatible issue
明确受影响的模型(damo):  
ONE-PEACE-4B	ModuleNotFoundError: MyCustomPipeline: MyCustomModel: No module named 'one_peace',缺少依赖。
cv_resnet50_face-reconstruction	 不兼容tf2  
nlp_automatic_post_editing_for_translation_en2de	tf2.0兼容性问题,tf1.x需要  
cv_resnet18_ocr-detection-word-level_damo	tf2.x兼容性问题  
cv_resnet18_ocr-detection-line-level_damo	tf兼容性问题  
cv_resnet101_detection_fewshot-defrcn	模型限制必须detection0.3+torch1.11.0"  
speech_dfsmn_ans_psm_48k_causal	"librosa, numpy兼容性问题  
cv_mdm_motion-generation	"依赖numpy版本兼容性问题:   File ""/opt/conda/lib/python3.8/site-packages/smplx/body_models.py"",  
cv_resnet50_ocr-detection-vlpt	numpy兼容性问题  
cv_clip-it_video-summarization_language-guided_en	tf兼容性问题

Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13744636
* numpy and pandas no version

* modify compatible issue

* fix numpy compatible issue

* modify ci

* fix lint issue

* replace Image.ANTIALIAS to Image.Resampling.LANCZOS pillow compatible

* skip uncompatible cases

* fix numpy compatible issue, skip cases that can not compatbile numpy or tensorflow2.x

* skip compatible cases

* fix clip model issue

* fix body 3d keypoints compatible issue
2023-08-22 23:04:31 +08:00
lingcai.wl
0aaaaa13ad [to #49275037] remove demo check and fix service decoder
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12432683
2023-05-14 23:41:40 +08:00
bin.xue
b4c90d8160 [to #42322933] feat: add dingding denoise model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11731594

* feat: add dingding denoise

* refactor: delete duplicated unidfsmn class

* refactor: delete empty lines

* refactor: make some methods inline

* style: add license and optimize imports

* style: comments style
2023-03-07 21:55:10 +08:00
bin.xue
525fa3ea89 [to #42322933]test: use 'master' branch in training test
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10509580
2022-10-25 12:10:07 +08:00
wenmeng.zwm
6808e9a301 [to #44902099] add license for framework files
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10189613
2022-09-20 17:49:31 +08:00
Yingda Chen
e0ef60ca9b [to #42322933] skip demo test by default 2022-09-09 14:56:33 +08:00
bin.xue
4be7737122 [to #42322933] audio pipelines accept url as input 2022-09-09 13:51:09 +08:00
lingcai.wl
7a49fa1cc6 [to #44657982] add unittest for demo and demotest utils
unittest for demo service
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10006180
2022-09-08 14:08:51 +08:00
bin.xue
d0933a2374 [to #42322933] add far field kws model pipeline
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9767151
2022-08-16 20:23:55 +08:00
bin.xue
aef3fa34bd [to #42322933] feat: omit pipeline name in test
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9662192
2022-08-06 12:35:38 +08:00
bin.xue
b2be1abcad [to #42322933] feat: aec pipeline also accept tuple and add test 2022-08-02 14:03:11 +08:00
bin.xue
558cf01d57 [to #42322933]feat: split speech-signal-process task to subtasks 2022-08-01 20:56:32 +08:00
bin.xue
d987ac634b [to #42322933] test: disable tests of speech-signal-process task for splitting sub tasks 2022-08-01 16:16:43 +08:00
bin.xue
e3bffedb87 [to #42322933] aec pipeline修改C++库依赖到MinDAEC
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9563105

    * use MinDAEC instead of cdll

* feat: ANS pipeline can accept bytes as input and adjust processing order to reduce the amount of computation
2022-07-28 22:59:57 +08:00
bin.xue
087e684da5 [to #42322933] fix ans_pipeline bug and add test 2022-07-27 18:45:17 +08:00
wenmeng.zwm
9f1ad5da80 fix several small problems for v0.2
* rename name of whl to modelscope
* auto install all requirements when running citest
* auto download dynamic lib for aec pipeline
* fix setup.py  audio extras not set
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9297825
2022-07-07 16:40:11 +08:00
bin.xue
3cbc04c6f6 [to #43003827] update url of aec lib
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9276877

    * update url of aec lib
2022-07-05 19:50:55 +08:00
bin.xue
04b7eba285 [to #42322933] Merge ANS pipeline into master
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9178339

    * refactor: move aec models to audio/aec

* refactor: move aec models to audio/aec

* refactor: move aec models to audio/aec

* refactor: move aec models to audio/aec

* feat: add unittest for ANS pipeline

* Merge branch 'master' into dev/ans

* add new SoundFile to audio dependency

* Merge branch 'master' into dev/ans

* use ANS pipeline name from metainfo

* Merge branch 'master' into dev/ans

* chore: update docstring of ANS module

* Merge branch 'master' into dev/ans

* refactor: use names from metainfo

* refactor: enable ans unittest

* refactor: add more log message in unittest
2022-06-28 14:41:08 +08:00
Yingda Chen
cfeac7afd8 [to #42322933] skip aec test 2022-06-27 15:07:46 +08:00
yingda.chen
1a0d4af55a [to #42322933] test level check
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9143809
2022-06-23 16:55:48 +08:00
wenmeng.zwm
e288cf076e [to #42362853] refactor pipeline and standardize module_name
* using get_model to validate hub path 
* support reading pipeline info from configuration file
* add metainfo const
* update model type and pipeline type and fix UT
* relax requimrent for protobuf
* skip two dataset tests due to temporal failure
 
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9118154
2022-06-22 14:15:32 +08:00
mulin.lyh
76c6ff6329 [to #42675838]merge model hub code
合并model hub 代码
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9096493
2022-06-21 20:04:25 +08:00
bin.xue
31498c1d6a [to #41669377] add speech AEC pipeline
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/8973072

    * [to #41669377] docs and tools refinement and release 

1. add build_doc linter script
2. add sphinx-docs support
3. add development doc and api doc
4. change version to 0.1.0 for the first internal release version

Link: https://code.aone.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/8775307

* [to #41669377] add pipeline tutorial and fix bugs 

1. add pipleine tutorial
2. fix bugs when using pipeline with certain model and preprocessor

Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/8814301

* refine doc

* feat: add audio aec pipeline and preprocessor

* feat: add audio aec model classes

* feat: add audio aec loss functions

* refactor:delete no longer used loss function

* [to #42281043] support kwargs in pipeline 
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/8949062
* support kwargs in pipeline

* update develop doc with CR instruction

* Merge branch 'release/0.1' into dev/aec

* style: reformat code by pre-commit tools

* feat:support maas_lib pipeline auto downloading model

* test:add aec test case as sample code

* feat:aec pipeline use config from maashub

* feat:aec pipeline use feature parameters from maashub

* update setup.cfg to disable PEP8 rule W503 in flake8 and yapf

* format:fix double quoted strings, indent issues and optimize import

* refactor:extract some constant in aec pipeline

* refactor: delete no longer used  __main__ statement

* chore:change all Chinese comments to English

* fix: change file name style to lower case

* refactor: rename model name

* feat:load C++ .so from LD_LIBRARY_PATH

* feat:register PROPROCESSOR for LinearAECAndFbank

* refactory:move aec process from postprocess() to forward() and update comments

* refactory:add more readable error message when audio sample rate is not 16000

* fix: package maas_lib renamed to modelscope in import statement

* feat: optimize the error message of audio layer classes

* format: delete empty lines

* refactor: rename audio preprocessor and optimize error message

* refactor: change aec model id to damo/speech_dfsmn_aec_psm_16k

* refactor: change sample audio file url to public oss

* Merge branch 'master' into dev/aec

* feat: add output info for aec pipeline

* fix: normalize output audio data to [-1.0, 1.0]

* refactor:use constant from ModelFile

* feat: AEC pipeline can use c++ lib in current working directory and the test will download it

* fix: c++ downloading should work wherever test is triggerd
2022-06-17 19:56:11 +08:00