Commit Graph

21 Commits

Author SHA1 Message Date
liuyhwangyh
59760b123b Merge branch 'master' into master-merge-internal20240110 2024-01-14 10:43:11 +08:00
zhifu gao
49c04ea47e update funasr1.0 (#715)
* funasr1.0  modelscope

* fix lint issue

---------

Co-authored-by: mulin.lyh <mulin.lyh@taobao.com>
2024-01-12 12:02:01 +08:00
neo.dzh
e7f86a751e add audio codec and codec-based TTS model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/15128959
2023-12-26 20:54:59 +08:00
mulin.lyh
51a1b76e91 fix python3.10 compatible issue
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/14678226
* modify librosa version

* fix python3.10 compatible issue

* remove healpy in requirements for windowns compatible
2023-11-29 17:41:44 +08:00
jiaqi.sjq
1f05acef88 fix pip install error when pysptk>=0.1.19 2023-07-25 21:13:31 +08:00
tongmu.wh
ba1a333ba6 add language recognition pipelines and models
新增语种识别pipeline和model
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13385083
* add language recognition pipelines and models

* add a clustering method for speaker diarization

* define input and output type for language recognition
2023-07-25 21:07:56 +08:00
mulin.lyh
eb0f0216c6 fix torch 2.x compatible issue
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13045011

* fix torch 2.x compatible issue

* fix torch 2.x compatible issue

* fix complex-valued input tensor matching the output from stft with return_complex=True.

* skip plugin test temporarily for modify torch version

* fix test_speech_signal_process.py compatible issue

* fix lint issue

* upgrade funasr to 0.6.5
2023-06-27 14:40:51 +08:00
zhifu.gzf
8b4e9dcdfb ngpu bug and rm easyasr
修复ngpu指定无效的问题;移除easyasr,全部涉及模型都下架了;将funasr版本限制为>=0.6.0
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12933049
* ngpu bug and rm easyasr
2023-06-13 16:36:21 +08:00
mulin.lyh
25a7398aea docker file py38 and py37 compatible merge
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12722824
* debug

* add missing deps

* add stanza

* fix numpy issue

* numpy version

* tf case

* numpy <= 1.22.0

* fix tf 1.15.5 case

* add python38 and python37 compatible docker file

* cv add fvcore

* add easycv

* remove debug code

* fix style issue

* fix python3.7 python3.8 compatible issue

* comment

* modify run config

* scipy<=1.7.3

* revert scipy<=1.7.3

* fix compatible bug

* fix compatible bug

* add easycv

* update version: 1.6.1rc0
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12754942
fix python3.8 and python3.8 compatible issue

* numpy version change

* numpy version

* numpy version

* fix numpy version

* fix numpy version

* update version: 1.6.1rc0
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12754942
fix python3.8 and python3.8 compatible issue

* restore setup.py

* restore setup.py

* add build base image

* fix style

* numpy version fix

* optimize build image, split base and modelscope image

* fix style issue

* modify example path

* modify examples folder

* add SETUPTOOLS_USE_DISTUTILS=stdlib comments
2023-06-08 10:01:58 +08:00
mulin.lyh
6abdce929d fix py38 issue
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12660340
2023-05-17 22:11:23 +08:00
wucong.lyb
4c0b13d157 fix func update_local_model; change funasr version
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12595406
2023-05-11 20:13:20 +08:00
bin.xue
dd4299e0c8 [to #42322933] add DFSMN model to far field KWS
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12334627
* feat: set stream mode in constructor

* fix: commit model file

* doc: add more information about DFSMN

* fix: seperation model evaluate returns None in speechbrain whose version < 0.5.12
2023-04-20 22:45:07 +08:00
wucong.lyb
fd83ffc0fa modify timestamp config && add function for update_local_model
修改时间戳模型自由组合的参数配置;支持model设置为本地路径时通过参数”update_model“更新模型;
        Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12261036

    * modify timestamp args

* add update_local_model function

* fix src_path is same with dst_path

* change funasr version
2023-04-10 19:54:11 +08:00
yuze.zyz
35744958f8 specify lower version 0.9.2 for librosa to avoid compatibility problem
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12008222
2023-03-15 21:30:33 +08:00
wucong.lyb
8e88ff46fc change funasr version to 0.3.0
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11956032
2023-03-13 11:24:41 +08:00
jiaqi.sjq
8099cc9f60 [to #41669377] Refactoring by using kantts as third party wheel and add ptts finetune&infer support
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11913929
2023-03-09 11:32:44 +08:00
shichen.fsc
04f63ec71f [to #42322933] add new kws models in UT and update requirements/audio_kws.txt
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11830273

* [Add] add new models for kws UT

* [Update] update requirements

* pre-commit
2023-03-06 23:14:45 +08:00
jiangyu.xzy
9f655da220 add speaker diarization pipeline
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11808124

* add speaker diarization pipeline
2023-02-28 16:19:30 +08:00
jiangyu.xzy
0697f969a1 asr infer change vad/lm/punc input logic
Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/11747596

* change vad/punc/lm model input logic

* asr infer extra kwargs

* fix format

* new funasr version
2023-02-22 21:05:58 +08:00
wucong.lyb
1b300b2f3e [to #42322933] fix bugs: audio fs, asr & sv demo services 2023-02-13 11:46:32 +00:00
bin.xue
1d2b7e8634 [to #42322933] separate audio requirements 2023-02-07 02:52:01 +00:00