modelscope

mirror of https://github.com/modelscope/modelscope.git synced 2026-07-10 20:39:53 +02:00

Author	SHA1	Message	Date
Wang Qiang	78f5e6a8bf	Switching model from huggingface to modelscope hub of efficient tuning (#479 )	2023-08-18 20:30:04 +08:00
Wang Qiang	f321804ab2	Merge pull request #472 from kangzhao2/baishao_test Add image2video	2023-08-18 20:29:36 +08:00
kangzhao2	2643d985dc	update test_image2video.py	2023-08-18 20:13:20 +08:00
kangzhao2	2605935797	fix pre-commit	2023-08-18 11:47:45 +08:00
kangzhao2	4ca76b0a85	fix comments again	2023-08-17 20:34:43 +08:00
kangzhao2	b8c76a426f	fix comments	2023-08-17 20:03:05 +08:00
Wang Qiang	4ed1111d70	Fix bugs of configs file path and duration (#476 ) * fix bugs of configs file path and duration * pre commit * delete configs * test videocomposer model version	2023-08-16 21:03:11 +08:00
kangzhao2	90f7a5c6c0	update files	2023-08-16 11:35:15 +08:00
kangzhao2	037e73fe6e	baishao	2023-08-15 21:32:30 +08:00
Wang Qiang	ee8afd2d62	VideoComposer: Compositional Video Synthesis with Motion Controllability (#431 ) * VideoComposer: Compositional Video Synthesis with Motion Controllability * videocomposer pipeline * pre commit * delete xformers	2023-08-15 12:01:03 +08:00
wenmeng zhou	74d8317bb0	fix pipeline check error (#455 ) * fix pipeline check error * update	2023-08-11 15:52:53 +08:00
Ran Zhou	026a9ef227	Add machine reading comprehension model, preprocessor and pipeline (#451 ) * Add machine reading comprehension model, preprocessor and pipeline * fix precommit errors * Optimize mrc preprocessor, add mrc input output definition, add mrc pipeline docstr --------- Co-authored-by: seadamo <ran.zhou@alibaba-inc.com>	2023-08-11 13:47:26 +08:00
wenmeng.zwm	725521a2af	skip test_text_to_360panorama_image test	2023-08-08 16:41:19 +08:00
lukeming.lkm	b3a61ef6f4	update LICENSE Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13516119	2023-08-03 11:07:34 +08:00
lukeming.lkm	bd2f70a6eb	add quantization in qwen pipelines and relevant unittests Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13499600 * add quant features * resolve import * resolve format * fix save vocab	2023-08-02 14:05:13 +08:00
lukeming.lkm	33bd74a7be	add qwen 7b base and chat 添加QWen 7b base模型和chat模型及相关pipelines Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13482235 * add qwen 7b base and chat * fix logger * update examples, lint test * add unittest for qwen base and chat * rename qwen to qwen-7b * resolve imports and add a registry to text-generation * reset load model from pretrained * fix precheck * skip qwen test case now * remove strange file	2023-08-02 09:25:21 +08:00
suluyan.sly	05e1357c32	Merge branch 'master-github' into master-merge-github-230728	2023-07-28 16:40:34 +08:00
wenmeng.zwm	3b485d5835	fix plugin python module missing files Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13453749 * fix plugin python module missing files	2023-07-28 16:14:56 +08:00
frozoul	2566d028cd	cv/cv nerf 3d reconstruction 4k nerf damo (#389 ) * add 4k-nerf core files * update configure file * update dataloader and model path * update unittest * Delete test_4k.py * update unittest * update unittest * update pre-commit * update dataloader * update cuda code path * check with pre-commit --------- Co-authored-by: zhongshu.wzs <zhongshu.wzs@alibaba-inc.com> Co-authored-by: wenmeng zhou <wenmeng.zwm@alibaba-inc.com>	2023-07-28 10:37:13 +08:00
tongmu.wh	475924a421	correct language recognition taks name modelscope平台同学最终定下语种识别任务名为speech-language-recognition，对应进行代码中的相关改动 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13444517 * correct language recognition taks name	2023-07-27 21:03:04 +08:00
suluyan.sly	1c6f5fe775	Merge branch 'master-github' into master-merge-github-230727 Conflicts: examples/pytorch/baichuan/finetune_baichuan.py examples/pytorch/chatglm6b/finetune.py	2023-07-27 17:29:27 +08:00
mengyang.fmy	18f998a85c	add text-to-360pano-image pipeline, mod cv requirements 7月份计划上线的360全景图生成模型，自研模型权重文件地址https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary #### 依赖项说明 ##### 由于要使用xformers，torch版本最好使用1.13.1 ``` pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116 ``` ##### 对应的diffusers和xformers版本如下 ``` pip install -U diffusers==0.18.0 pip install xformers==0.0.16 pip install triton, accelerate, transformers ``` ##### ModelScope Library 需要使用cv ``` pip install modelscope pip install "modelscope[cv]" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html ``` ##### 此外，还需要安装第三方的一个库，Real-ESRGAN，安装方法如下 ``` # Install basicsr - https://github.com/xinntao/BasicSR # We use BasicSR for both training and inference pip install basicsr # facexlib and gfpgan are for face enhancement pip install facexlib pip install gfpgan pip install Pillow pip install tqdm pip install realesrgan==0.3.0 ``` Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13346430 * add text-to-360pano-image pipeline * add text-to-360pano-image pipeline, mod cv requirements * rm redundant files and cv requirements; add standard input and output definations * fix diffusers==0.18.0 and run test * fix diffusers==0.18.0 in multi-modal and run test again * add model_revision='v1.0.0' * fix yapf * add trycatch for enabling xformers * fix key error * add install xformers in test/setup * skip highres.fix in ci * feat: Fix conflict, auto commit by WebIDE	2023-07-27 11:33:39 +08:00
Zackary Shen	ba4db97507	upload cv_nerf_3d-reconstruction_vector-quantize-compression (#407 ) * add vq_compression model * add vq_compression model * check pre-commit for lint test * fix by flake8 * update * update * update * the last update * the laast update * update test_level>=0 --------- Co-authored-by: 剑匣 <zackary.sz@alibaba-inc.com>	2023-07-26 17:20:13 +08:00
tongmu.wh	ba1a333ba6	add language recognition pipelines and models 新增语种识别pipeline和model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13385083 * add language recognition pipelines and models * add a clustering method for speaker diarization * define input and output type for language recognition	2023-07-25 21:07:56 +08:00
zeyinzi.jzyz	672c4899e9	add sd swift tuner SD-Tuner base on Swift (LoRA/Adapter/Prompt) Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13380798 * sd swift tuner * fix pre-checker	2023-07-25 19:00:49 +08:00
shuli.cly	526e1371f5	Merge the speaker-turn-detection codes, local test finished # Speaker Diarization Speaker-Turn Detection CR 和Dialogue-Detection一样，本模型是Speaker Diarization（`audio/speaker diarization`，语音/说话人日志）任务下的一个子模块。本次提交的是基于文本进行判断的模型，本地模型的初始模型基于huggingface训练的，此提交中复用了部分 `nlp/token-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦，在model、pipeline以及preprocessor中单独创建了相应模块并重新register。 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13364720 * std first commit * local test pass for speaker-turn-detection * update speaker-turn-detection pipeline task outputs format; update pipeline outputs; update test scripts	2023-07-25 18:57:47 +08:00
hemu.zp	80f76ca475	Support stream output for transformers model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13271136 * support stream for transformers model * set test_level >= 2 * support hf model and chatglm2 * remove streaming_output for chatglm2	2023-07-25 17:41:32 +08:00
tingwei.gtw	d16522723a	[to #42322933 ] add files Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13158565 * [to #42322933] add files * [to #42322933] add files * [to #42322933] add files * [to #42322933] add files * [to #42322933] add files * update test data * [to #42322933] add files * Merge remote-tracking branch 'origin' into feature/sal_try_on * [to #42322933] add files * Merge remote-tracking branch 'origin' into feature/sal_try_on	2023-07-24 10:16:29 +08:00
mushenL	f77237b049	add llama2 pipeline (#399 ) * Modify the parameter passing of the text_generation_pipeline class * add llama2 pipeline * add llama pipeline v1.1 * add llama pipeline v1.2 * add llama pipeline v1.3 * add llama pipeline v1.0.4	2023-07-22 21:53:04 +08:00
shuli.cly	13e345f6d9	add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection # Speaker Diarization Dialogue Detection CR 本模型是Speaker Diarization（`audio/speaker diarization`，语音/说话人日志）任务下的一个子模块。本次提交的是基于文本进行判断的模型，其IO和中间过程和 `nlp/text-classification` 很像，且本地模型的初始模型也是基于huggingface训练的，因此此提交中复用了部分 `nlp/text-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦，在model、pipeline以及preprocessor中单独创建了相应模块并重新register。 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13269649 * start to add speaker_diarization_dialogue_detection files; Need to change constant and test * add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection * update test case * add comments for speaker diarization dialogue detection pipelines * add outputs type and inputs type for speaker_diarization_dialogue_detection	2023-07-20 19:29:59 +08:00
shenweichao.swc	05c65ba225	add s2net for panorama_depth_estimation Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13310819 * add s2net codes * fix sphdecoder * Merge branch 'master' into dev/cv_s2net_panorama_depth_estimation * revise comments in the pipeline * revise the code	2023-07-20 19:28:03 +08:00
xiangpeng.wxp	4085d821f3	[to #42322933 ] add polylm, a polyglot large language model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13339595	2023-07-20 18:21:07 +08:00
baiguan.yt	ceac129c6b	add parameters height and width for text-to-video Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13171907	2023-07-14 16:22:10 +08:00
yeqinghao.yqh	41cbb8e393	mPLUG-Owl 生成长度Bug修复 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13209284	2023-07-10 18:56:12 +08:00
tongmu.wh	a7f7a67855	fix details of speaker models Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13203011 * fix details of speaker models	2023-07-10 18:54:26 +08:00
chenyafeng.cyf	543d03e32b	3dspeaker Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13199180	2023-07-07 15:00:35 +08:00
wenmeng.zwm	0271b9c256	Merge branch 'master-github' into merge_master_github_0628	2023-06-28 20:27:34 +08:00
Wang Qiang	a018cd6107	Dreambooth method for finetuning stable diffusions (#339 ) * Copyright * dreambooth * dreambooth test trainer * fix bugs * pre-commit --------- Co-authored-by: 翊靖 <yijing.wq@alibaba-inc.com>	2023-06-28 20:10:28 +08:00
mulin.lyh	1ea9b58447	fix torch2.0 compatible issue Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13086361 * fix face-aligment compatible * fix torch2.0 compatible issue	2023-06-28 14:15:48 +08:00
wenmeng.zwm	9e51920fdb	Merge branch 'master-github' into merge_master_github_0626	2023-06-26 21:04:05 +08:00
wenmeng zhou	6dea1d5646	Fix/citest timeout (#308 ) * timeout for citest set to 240min * update docker image * fix ci template not packed in whl * update docker image version to 1.6.1 and add python3.8 support * randome choose a model for controlnet to avoid oom	2023-06-26 11:23:10 +08:00
chenyafeng.cyf	29062d9f94	eres2net_aug v2 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13046524 * eres2net_aug v2	2023-06-25 18:07:04 +08:00
tongmu.wh	f03c93cda5	add speaker diarization pipeline and improve some speaker pipelines Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12891685 * add new speaker diarization pipeline. improve the capability of speaker pipelines	2023-06-21 17:56:05 +08:00
xingjun.wxj	0db0ec5586	Merge code from github 1. Merge(add) daily regression from github PR (daily_regression.yaml) 2. Add lora stable diffusion from github PR Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13010802 * fix: device arg not work, rename device to ngpu (#272) * Correcting the lora stable diffusion example script (#300) * add vad model and punc model in README.md add vad model and punc model * Merge pull request #302 from modelscope/langgz-patch-1 add vad model and punc model in README.md * add 1.6 * modify ignore * Merge pull request #307 from modelscope/dev_rs_16 Merge release 1.6 * undo datetime to 2099 * Merge pull request #311 from modelscope/fix_master_version undo datetime to 2099 * add daily regression workflow * modify workflow name * fix cron format issue * lora trainer * Merge pull request #315 from liuyhwangyh/add_regression_workflow add daily regression workflow	2023-06-21 10:22:06 +08:00
xingjun.wxj	cc3c384d5e	Fix issues for downloading mplug-youku dataset 1. Optimize downloading meta-csv files for large-scale dataset like mPLUG-youku (> 1GB for meta csv mapping) 2. Add head and overall progress bar for NativeIterableDataset 3. Modify the try-catch info for oss_utils Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12952842	2023-06-15 15:42:21 +08:00
hemu.zp	96c2d42f09	Add StreamingMixin Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12445731 * StreamingMixin poc * update design * Merge branch 'master' into feat/StreamingMixin * add dicstr * make postprocessor input consistent	2023-06-08 19:40:14 +08:00
xixing.tj	1b7e0f50f4	add ocr detection new model db-nas 新增5M的DB-NAS ocr detection 文字检测模型 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12760623 * add ocr detection new model db-nas * add comment	2023-05-31 21:32:46 +08:00
yuanzhi.zyz	10c39b5ce1	add new ocr recognition model (LightweightEdge) and some functions 1. 增加了新轻量化端侧识别模型 LightweightEdge，并把原来CRNN和ConvNextViT的代码整理了 2. 增加batch inference支持 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12787905	2023-05-31 21:16:22 +08:00
chenyafeng.cyf	f6ea3eadea	eres2net Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12640199	2023-05-16 22:28:20 +08:00
yeqinghao.yqh	b9c8c99776	Support mPLUG-Owl model. Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12610417	2023-05-15 16:32:46 +08:00

1 2 3 4 5 ...

708 Commits