modelscope

mirror of https://github.com/modelscope/modelscope.git synced 2025-12-23 19:49:24 +01:00

Author	SHA1	Message	Date
suluyana	9ece90ee84	skip plugin test case	2023-07-29 21:35:21 +08:00
suluyan.sly	05e1357c32	Merge branch 'master-github' into master-merge-github-230728	2023-07-28 16:40:34 +08:00
wenmeng.zwm	3b485d5835	fix plugin python module missing files Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13453749 * fix plugin python module missing files	2023-07-28 16:14:56 +08:00
frozoul	2566d028cd	cv/cv nerf 3d reconstruction 4k nerf damo (#389 ) * add 4k-nerf core files * update configure file * update dataloader and model path * update unittest * Delete test_4k.py * update unittest * update unittest * update pre-commit * update dataloader * update cuda code path * check with pre-commit --------- Co-authored-by: zhongshu.wzs <zhongshu.wzs@alibaba-inc.com> Co-authored-by: wenmeng zhou <wenmeng.zwm@alibaba-inc.com>	2023-07-28 10:37:13 +08:00
tongmu.wh	475924a421	correct language recognition taks name modelscope平台同学最终定下语种识别任务名为speech-language-recognition，对应进行代码中的相关改动 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13444517 * correct language recognition taks name	2023-07-27 21:03:04 +08:00
suluyan.sly	1c6f5fe775	Merge branch 'master-github' into master-merge-github-230727 Conflicts: examples/pytorch/baichuan/finetune_baichuan.py examples/pytorch/chatglm6b/finetune.py	2023-07-27 17:29:27 +08:00
mengyang.fmy	18f998a85c	add text-to-360pano-image pipeline, mod cv requirements 7月份计划上线的360全景图生成模型，自研模型权重文件地址https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary #### 依赖项说明 ##### 由于要使用xformers，torch版本最好使用1.13.1 ``` pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116 ``` ##### 对应的diffusers和xformers版本如下 ``` pip install -U diffusers==0.18.0 pip install xformers==0.0.16 pip install triton, accelerate, transformers ``` ##### ModelScope Library 需要使用cv ``` pip install modelscope pip install "modelscope[cv]" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html ``` ##### 此外，还需要安装第三方的一个库，Real-ESRGAN，安装方法如下 ``` # Install basicsr - https://github.com/xinntao/BasicSR # We use BasicSR for both training and inference pip install basicsr # facexlib and gfpgan are for face enhancement pip install facexlib pip install gfpgan pip install Pillow pip install tqdm pip install realesrgan==0.3.0 ``` Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13346430 * add text-to-360pano-image pipeline * add text-to-360pano-image pipeline, mod cv requirements * rm redundant files and cv requirements; add standard input and output definations * fix diffusers==0.18.0 and run test * fix diffusers==0.18.0 in multi-modal and run test again * add model_revision='v1.0.0' * fix yapf * add trycatch for enabling xformers * fix key error * add install xformers in test/setup * skip highres.fix in ci * feat: Fix conflict, auto commit by WebIDE	2023-07-27 11:33:39 +08:00
Wang Qiang	66cf72a75c	Merge pull request #376 from XDUWQ/custom_diffusion Custom method for finetuning stable diffusion	2023-07-27 10:41:38 +08:00
Zackary Shen	ba4db97507	upload cv_nerf_3d-reconstruction_vector-quantize-compression (#407 ) * add vq_compression model * add vq_compression model * check pre-commit for lint test * fix by flake8 * update * update * update * the last update * the laast update * update test_level>=0 --------- Co-authored-by: 剑匣 <zackary.sz@alibaba-inc.com>	2023-07-26 17:20:13 +08:00
tongmu.wh	ba1a333ba6	add language recognition pipelines and models 新增语种识别pipeline和model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13385083 * add language recognition pipelines and models * add a clustering method for speaker diarization * define input and output type for language recognition	2023-07-25 21:07:56 +08:00
zeyinzi.jzyz	672c4899e9	add sd swift tuner SD-Tuner base on Swift (LoRA/Adapter/Prompt) Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13380798 * sd swift tuner * fix pre-checker	2023-07-25 19:00:49 +08:00
shuli.cly	526e1371f5	Merge the speaker-turn-detection codes, local test finished # Speaker Diarization Speaker-Turn Detection CR 和Dialogue-Detection一样，本模型是Speaker Diarization（`audio/speaker diarization`，语音/说话人日志）任务下的一个子模块。本次提交的是基于文本进行判断的模型，本地模型的初始模型基于huggingface训练的，此提交中复用了部分 `nlp/token-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦，在model、pipeline以及preprocessor中单独创建了相应模块并重新register。 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13364720 * std first commit * local test pass for speaker-turn-detection * update speaker-turn-detection pipeline task outputs format; update pipeline outputs; update test scripts	2023-07-25 18:57:47 +08:00
hemu.zp	80f76ca475	Support stream output for transformers model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13271136 * support stream for transformers model * set test_level >= 2 * support hf model and chatglm2 * remove streaming_output for chatglm2	2023-07-25 17:41:32 +08:00
wenmeng zhou	64203e89ee	Compatibility for huggingface transformers (#391 )	2023-07-24 20:53:27 +08:00
XDUWQ	8e00d85317	fix bugs	2023-07-24 19:46:22 +08:00
lylalala	f805d86aed	llama2 support chat (#404 ) * support chat * update llama2 chat testcase * add gen kwargs and devices * update unittest and support max_length in multi-turn dialogue	2023-07-24 15:38:01 +08:00
tingwei.gtw	d16522723a	[to #42322933 ] add files Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13158565 * [to #42322933] add files * [to #42322933] add files * [to #42322933] add files * [to #42322933] add files * [to #42322933] add files * update test data * [to #42322933] add files * Merge remote-tracking branch 'origin' into feature/sal_try_on * [to #42322933] add files * Merge remote-tracking branch 'origin' into feature/sal_try_on	2023-07-24 10:16:29 +08:00
mushenL	f77237b049	add llama2 pipeline (#399 ) * Modify the parameter passing of the text_generation_pipeline class * add llama2 pipeline * add llama pipeline v1.1 * add llama pipeline v1.2 * add llama pipeline v1.3 * add llama pipeline v1.0.4	2023-07-22 21:53:04 +08:00
shuli.cly	13e345f6d9	add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection # Speaker Diarization Dialogue Detection CR 本模型是Speaker Diarization（`audio/speaker diarization`，语音/说话人日志）任务下的一个子模块。本次提交的是基于文本进行判断的模型，其IO和中间过程和 `nlp/text-classification` 很像，且本地模型的初始模型也是基于huggingface训练的，因此此提交中复用了部分 `nlp/text-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦，在model、pipeline以及preprocessor中单独创建了相应模块并重新register。 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13269649 * start to add speaker_diarization_dialogue_detection files; Need to change constant and test * add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection * update test case * add comments for speaker diarization dialogue detection pipelines * add outputs type and inputs type for speaker_diarization_dialogue_detection	2023-07-20 19:29:59 +08:00
shenweichao.swc	05c65ba225	add s2net for panorama_depth_estimation Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13310819 * add s2net codes * fix sphdecoder * Merge branch 'master' into dev/cv_s2net_panorama_depth_estimation * revise comments in the pipeline * revise the code	2023-07-20 19:28:03 +08:00
xiangpeng.wxp	4085d821f3	[to #42322933 ] add polylm, a polyglot large language model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13339595	2023-07-20 18:21:07 +08:00
XDUWQ	66795aa3ff	change tests level	2023-07-19 09:41:21 +08:00
Wang Qiang	0b85979f2e	Update diffusers version to 0.18.0 (#377 ) * update diffusers to 0.18.0 * fix bugs	2023-07-14 19:02:52 +08:00
baiguan.yt	ceac129c6b	add parameters height and width for text-to-video Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13171907	2023-07-14 16:22:10 +08:00
XDUWQ	34ab717393	custom_diffusion	2023-07-12 19:47:32 +08:00
XDUWQ	1caa45422c	custom diffusion	2023-07-11 20:46:32 +08:00
yeqinghao.yqh	41cbb8e393	mPLUG-Owl 生成长度Bug修复 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13209284	2023-07-10 18:56:12 +08:00
tongmu.wh	a7f7a67855	fix details of speaker models Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13203011 * fix details of speaker models	2023-07-10 18:54:26 +08:00
chenyafeng.cyf	543d03e32b	3dspeaker Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13199180	2023-07-07 15:00:35 +08:00
wenmeng.zwm	0271b9c256	Merge branch 'master-github' into merge_master_github_0628	2023-06-28 20:27:34 +08:00
Wang Qiang	a018cd6107	Dreambooth method for finetuning stable diffusions (#339 ) * Copyright * dreambooth * dreambooth test trainer * fix bugs * pre-commit --------- Co-authored-by: 翊靖 <yijing.wq@alibaba-inc.com>	2023-06-28 20:10:28 +08:00
mulin.lyh	1ea9b58447	fix torch2.0 compatible issue Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13086361 * fix face-aligment compatible * fix torch2.0 compatible issue	2023-06-28 14:15:48 +08:00
Wang Qiang	6942144ad7	Stable Diffusion model checkpoint export to onnx. (#340 ) * stable diffusion export onnx * fix pre commit bugs * fix bugs * safety checker support * test export stable diffusion	2023-06-28 13:26:19 +08:00
yuze.zyz	8f18274f75	Add teardown for tests Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12643554 * add teardown for tests * add teardown for dialog_modeling_trainer,document_grounded_dialog_generate_trainer,document_grounded_dialog_rerank_trainer,document_grounded_dialog_retrieval_trainer,training_args,translation_evaluation_trainer,translation_trainer	2023-06-28 09:44:44 +08:00
mulin.lyh	eb0f0216c6	fix torch 2.x compatible issue Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13045011 * fix torch 2.x compatible issue * fix torch 2.x compatible issue * fix complex-valued input tensor matching the output from stft with return_complex=True. * skip plugin test temporarily for modify torch version * fix test_speech_signal_process.py compatible issue * fix lint issue * upgrade funasr to 0.6.5	2023-06-27 14:40:51 +08:00
yuze.zyz	a58be34384	Add Lora/Adapter/Prompt and support for chatglm6B and chatglm2-6B Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12770413 * add prompt and lora * add adapter * add prefix * add tests * adapter smoke test passed * prompt test passed * support model id in petl * migrate chatglm6b * add train script for chatglm6b * move gen_kwargs to finetune.py * add chatglm2 * add model definination	2023-06-27 14:38:18 +08:00
xingjun.wxj	1dbff6cb48	Support jsonl format in meta data Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13071970 * support jsonl in meta * add UT and refine fetch_meta_files_from_url	2023-06-27 11:58:19 +08:00
wenmeng.zwm	9e51920fdb	Merge branch 'master-github' into merge_master_github_0626	2023-06-26 21:04:05 +08:00
wenmeng zhou	6dea1d5646	Fix/citest timeout (#308 ) * timeout for citest set to 240min * update docker image * fix ci template not packed in whl * update docker image version to 1.6.1 and add python3.8 support * randome choose a model for controlnet to avoid oom	2023-06-26 11:23:10 +08:00
mulin.lyh	698c794070	[to #50537864 ]fix: fix select case issue Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13055098	2023-06-25 22:44:29 +08:00
chenyafeng.cyf	29062d9f94	eres2net_aug v2 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13046524 * eres2net_aug v2	2023-06-25 18:07:04 +08:00
tongmu.wh	f03c93cda5	add speaker diarization pipeline and improve some speaker pipelines Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12891685 * add new speaker diarization pipeline. improve the capability of speaker pipelines	2023-06-21 17:56:05 +08:00
xingjun.wxj	0db0ec5586	Merge code from github 1. Merge(add) daily regression from github PR (daily_regression.yaml) 2. Add lora stable diffusion from github PR Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13010802 * fix: device arg not work, rename device to ngpu (#272) * Correcting the lora stable diffusion example script (#300) * add vad model and punc model in README.md add vad model and punc model * Merge pull request #302 from modelscope/langgz-patch-1 add vad model and punc model in README.md * add 1.6 * modify ignore * Merge pull request #307 from modelscope/dev_rs_16 Merge release 1.6 * undo datetime to 2099 * Merge pull request #311 from modelscope/fix_master_version undo datetime to 2099 * add daily regression workflow * modify workflow name * fix cron format issue * lora trainer * Merge pull request #315 from liuyhwangyh/add_regression_workflow add daily regression workflow	2023-06-21 10:22:06 +08:00
xingjun.wxj	cc3c384d5e	Fix issues for downloading mplug-youku dataset 1. Optimize downloading meta-csv files for large-scale dataset like mPLUG-youku (> 1GB for meta csv mapping) 2. Add head and overall progress bar for NativeIterableDataset 3. Modify the try-catch info for oss_utils Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12952842	2023-06-15 15:42:21 +08:00
hemu.zp	96c2d42f09	Add StreamingMixin Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12445731 * StreamingMixin poc * update design * Merge branch 'master' into feat/StreamingMixin * add dicstr * make postprocessor input consistent	2023-06-08 19:40:14 +08:00
mulin.lyh	25a7398aea	docker file py38 and py37 compatible merge Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12722824 * debug * add missing deps * add stanza * fix numpy issue * numpy version * tf case * numpy <= 1.22.0 * fix tf 1.15.5 case * add python38 and python37 compatible docker file * cv add fvcore * add easycv * remove debug code * fix style issue * fix python3.7 python3.8 compatible issue * comment * modify run config * scipy<=1.7.3 * revert scipy<=1.7.3 * fix compatible bug * fix compatible bug * add easycv * update version: 1.6.1rc0 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12754942 fix python3.8 and python3.8 compatible issue * numpy version change * numpy version * numpy version * fix numpy version * fix numpy version * update version: 1.6.1rc0 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12754942 fix python3.8 and python3.8 compatible issue * restore setup.py * restore setup.py * add build base image * fix style * numpy version fix * optimize build image, split base and modelscope image * fix style issue * modify example path * modify examples folder * add SETUPTOOLS_USE_DISTUTILS=stdlib comments	2023-06-08 10:01:58 +08:00
xixing.tj	1b7e0f50f4	add ocr detection new model db-nas 新增5M的DB-NAS ocr detection 文字检测模型 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12760623 * add ocr detection new model db-nas * add comment	2023-05-31 21:32:46 +08:00
yuanzhi.zyz	10c39b5ce1	add new ocr recognition model (LightweightEdge) and some functions 1. 增加了新轻量化端侧识别模型 LightweightEdge，并把原来CRNN和ConvNextViT的代码整理了 2. 增加batch inference支持 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12787905	2023-05-31 21:16:22 +08:00
yuze.zyz	003c33648a	Fix CI: test merge dataset failed Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12771691 * fix CI * add teardown	2023-05-29 10:32:33 +08:00
xingjun.wxj	5b99bb24fe	update version: 1.6.1rc0 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12754942	2023-05-25 20:28:36 +08:00

... 3 4 5 6 7 ...

1195 Commits