modelscope

mirror of https://github.com/modelscope/modelscope.git synced 2026-02-24 20:19:51 +01:00

Author	SHA1	Message	Date
Wang Qiang	ee8afd2d62	VideoComposer: Compositional Video Synthesis with Motion Controllability (#431 ) * VideoComposer: Compositional Video Synthesis with Motion Controllability * videocomposer pipeline * pre commit * delete xformers	2023-08-15 12:01:03 +08:00
Jintao	18d33a4825	fix copytree python37 bug (#464 ) * fix copytree python37 bug * add copytree_py37 function	2023-08-14 11:45:33 +08:00
wenmeng zhou	74d8317bb0	fix pipeline check error (#455 ) * fix pipeline check error * update	2023-08-11 15:52:53 +08:00
Ran Zhou	026a9ef227	Add machine reading comprehension model, preprocessor and pipeline (#451 ) * Add machine reading comprehension model, preprocessor and pipeline * fix precommit errors * Optimize mrc preprocessor, add mrc input output definition, add mrc pipeline docstr --------- Co-authored-by: seadamo <ran.zhou@alibaba-inc.com>	2023-08-11 13:47:26 +08:00
chenyafeng.cyf	33605de759	eres2net_lre_v2 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13602081 * add eres2net_base_large_lre * eres2net_language_identification * eres2net_lre_v2	2023-08-10 17:44:43 +08:00
liuyhwangyh	75a14a36ba	get github diff files (#446 ) * get github diff files * add github environment --------- Co-authored-by: mulin.lyh <mulin.lyh@taobao.com>	2023-08-10 16:22:37 +08:00
wenmeng.zwm	725521a2af	skip test_text_to_360panorama_image test	2023-08-08 16:41:19 +08:00
zsl01670416	b0699fd8e2	support llama2 inputs to device in function generate fix error inputs and model were not on the same device. if they are not on the same device, inputs will be implemented function to model device. Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13546989 * support llama inputs to device in function generate * modify test qwen text generation according to github code	2023-08-07 15:41:28 +08:00
lukeming.lkm	9e033104af	change use_fast_att and fix bfloat loading Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13522792 * check flash att is installed even if use_fast_att is set True and fix bfloat loading * skip pipeline model placement for quantization * update unittest for qwen	2023-08-03 15:59:51 +08:00
lukeming.lkm	b3a61ef6f4	update LICENSE Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13516119	2023-08-03 11:07:34 +08:00
lukeming.lkm	bd2f70a6eb	add quantization in qwen pipelines and relevant unittests Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13499600 * add quant features * resolve import * resolve format * fix save vocab	2023-08-02 14:05:13 +08:00
lukeming.lkm	33bd74a7be	add qwen 7b base and chat 添加QWen 7b base模型和chat模型及相关pipelines Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13482235 * add qwen 7b base and chat * fix logger * update examples, lint test * add unittest for qwen base and chat * rename qwen to qwen-7b * resolve imports and add a registry to text-generation * reset load model from pretrained * fix precheck * skip qwen test case now * remove strange file	2023-08-02 09:25:21 +08:00
suluyana	b68b90ba15	skip plugin	2023-07-30 00:30:30 +08:00
suluyana	9ece90ee84	skip plugin test case	2023-07-29 21:35:21 +08:00
suluyan.sly	05e1357c32	Merge branch 'master-github' into master-merge-github-230728	2023-07-28 16:40:34 +08:00
wenmeng.zwm	3b485d5835	fix plugin python module missing files Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13453749 * fix plugin python module missing files	2023-07-28 16:14:56 +08:00
frozoul	2566d028cd	cv/cv nerf 3d reconstruction 4k nerf damo (#389 ) * add 4k-nerf core files * update configure file * update dataloader and model path * update unittest * Delete test_4k.py * update unittest * update unittest * update pre-commit * update dataloader * update cuda code path * check with pre-commit --------- Co-authored-by: zhongshu.wzs <zhongshu.wzs@alibaba-inc.com> Co-authored-by: wenmeng zhou <wenmeng.zwm@alibaba-inc.com>	2023-07-28 10:37:13 +08:00
tongmu.wh	475924a421	correct language recognition taks name modelscope平台同学最终定下语种识别任务名为speech-language-recognition，对应进行代码中的相关改动 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13444517 * correct language recognition taks name	2023-07-27 21:03:04 +08:00
suluyan.sly	1c6f5fe775	Merge branch 'master-github' into master-merge-github-230727 Conflicts: examples/pytorch/baichuan/finetune_baichuan.py examples/pytorch/chatglm6b/finetune.py	2023-07-27 17:29:27 +08:00
mengyang.fmy	18f998a85c	add text-to-360pano-image pipeline, mod cv requirements 7月份计划上线的360全景图生成模型，自研模型权重文件地址https://www.modelscope.cn/models/damo/cv_diffusion_text-to-360panorama-image_generation/summary #### 依赖项说明 ##### 由于要使用xformers，torch版本最好使用1.13.1 ``` pip install torch==1.13.1+cu116 torchvision==0.14.1+cu116 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu116 ``` ##### 对应的diffusers和xformers版本如下 ``` pip install -U diffusers==0.18.0 pip install xformers==0.0.16 pip install triton, accelerate, transformers ``` ##### ModelScope Library 需要使用cv ``` pip install modelscope pip install "modelscope[cv]" -f https://modelscope.oss-cn-beijing.aliyuncs.com/releases/repo.html ``` ##### 此外，还需要安装第三方的一个库，Real-ESRGAN，安装方法如下 ``` # Install basicsr - https://github.com/xinntao/BasicSR # We use BasicSR for both training and inference pip install basicsr # facexlib and gfpgan are for face enhancement pip install facexlib pip install gfpgan pip install Pillow pip install tqdm pip install realesrgan==0.3.0 ``` Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13346430 * add text-to-360pano-image pipeline * add text-to-360pano-image pipeline, mod cv requirements * rm redundant files and cv requirements; add standard input and output definations * fix diffusers==0.18.0 and run test * fix diffusers==0.18.0 in multi-modal and run test again * add model_revision='v1.0.0' * fix yapf * add trycatch for enabling xformers * fix key error * add install xformers in test/setup * skip highres.fix in ci * feat: Fix conflict, auto commit by WebIDE	2023-07-27 11:33:39 +08:00
Wang Qiang	66cf72a75c	Merge pull request #376 from XDUWQ/custom_diffusion Custom method for finetuning stable diffusion	2023-07-27 10:41:38 +08:00
Zackary Shen	ba4db97507	upload cv_nerf_3d-reconstruction_vector-quantize-compression (#407 ) * add vq_compression model * add vq_compression model * check pre-commit for lint test * fix by flake8 * update * update * update * the last update * the laast update * update test_level>=0 --------- Co-authored-by: 剑匣 <zackary.sz@alibaba-inc.com>	2023-07-26 17:20:13 +08:00
tongmu.wh	ba1a333ba6	add language recognition pipelines and models 新增语种识别pipeline和model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13385083 * add language recognition pipelines and models * add a clustering method for speaker diarization * define input and output type for language recognition	2023-07-25 21:07:56 +08:00
zeyinzi.jzyz	672c4899e9	add sd swift tuner SD-Tuner base on Swift (LoRA/Adapter/Prompt) Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13380798 * sd swift tuner * fix pre-checker	2023-07-25 19:00:49 +08:00
shuli.cly	526e1371f5	Merge the speaker-turn-detection codes, local test finished # Speaker Diarization Speaker-Turn Detection CR 和Dialogue-Detection一样，本模型是Speaker Diarization（`audio/speaker diarization`，语音/说话人日志）任务下的一个子模块。本次提交的是基于文本进行判断的模型，本地模型的初始模型基于huggingface训练的，此提交中复用了部分 `nlp/token-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦，在model、pipeline以及preprocessor中单独创建了相应模块并重新register。 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13364720 * std first commit * local test pass for speaker-turn-detection * update speaker-turn-detection pipeline task outputs format; update pipeline outputs; update test scripts	2023-07-25 18:57:47 +08:00
hemu.zp	80f76ca475	Support stream output for transformers model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13271136 * support stream for transformers model * set test_level >= 2 * support hf model and chatglm2 * remove streaming_output for chatglm2	2023-07-25 17:41:32 +08:00
wenmeng zhou	64203e89ee	Compatibility for huggingface transformers (#391 )	2023-07-24 20:53:27 +08:00
XDUWQ	8e00d85317	fix bugs	2023-07-24 19:46:22 +08:00
lylalala	f805d86aed	llama2 support chat (#404 ) * support chat * update llama2 chat testcase * add gen kwargs and devices * update unittest and support max_length in multi-turn dialogue	2023-07-24 15:38:01 +08:00
tingwei.gtw	d16522723a	[to #42322933 ] add files Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13158565 * [to #42322933] add files * [to #42322933] add files * [to #42322933] add files * [to #42322933] add files * [to #42322933] add files * update test data * [to #42322933] add files * Merge remote-tracking branch 'origin' into feature/sal_try_on * [to #42322933] add files * Merge remote-tracking branch 'origin' into feature/sal_try_on	2023-07-24 10:16:29 +08:00
mushenL	f77237b049	add llama2 pipeline (#399 ) * Modify the parameter passing of the text_generation_pipeline class * add llama2 pipeline * add llama pipeline v1.1 * add llama pipeline v1.2 * add llama pipeline v1.3 * add llama pipeline v1.0.4	2023-07-22 21:53:04 +08:00
shuli.cly	13e345f6d9	add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection # Speaker Diarization Dialogue Detection CR 本模型是Speaker Diarization（`audio/speaker diarization`，语音/说话人日志）任务下的一个子模块。本次提交的是基于文本进行判断的模型，其IO和中间过程和 `nlp/text-classification` 很像，且本地模型的初始模型也是基于huggingface训练的，因此此提交中复用了部分 `nlp/text-classification` 模型的代码。为了方便后续维护以及与nlp方面代码解耦，在model、pipeline以及preprocessor中单独创建了相应模块并重新register。 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13269649 * start to add speaker_diarization_dialogue_detection files; Need to change constant and test * add sv/speaker_diarization_dialogue_detection to branch sv/semantic-dialogue-detection * update test case * add comments for speaker diarization dialogue detection pipelines * add outputs type and inputs type for speaker_diarization_dialogue_detection	2023-07-20 19:29:59 +08:00
shenweichao.swc	05c65ba225	add s2net for panorama_depth_estimation Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13310819 * add s2net codes * fix sphdecoder * Merge branch 'master' into dev/cv_s2net_panorama_depth_estimation * revise comments in the pipeline * revise the code	2023-07-20 19:28:03 +08:00
xiangpeng.wxp	4085d821f3	[to #42322933 ] add polylm, a polyglot large language model Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13339595	2023-07-20 18:21:07 +08:00
XDUWQ	66795aa3ff	change tests level	2023-07-19 09:41:21 +08:00
Wang Qiang	0b85979f2e	Update diffusers version to 0.18.0 (#377 ) * update diffusers to 0.18.0 * fix bugs	2023-07-14 19:02:52 +08:00
baiguan.yt	ceac129c6b	add parameters height and width for text-to-video Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13171907	2023-07-14 16:22:10 +08:00
XDUWQ	34ab717393	custom_diffusion	2023-07-12 19:47:32 +08:00
XDUWQ	1caa45422c	custom diffusion	2023-07-11 20:46:32 +08:00
yeqinghao.yqh	41cbb8e393	mPLUG-Owl 生成长度Bug修复 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13209284	2023-07-10 18:56:12 +08:00
tongmu.wh	a7f7a67855	fix details of speaker models Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13203011 * fix details of speaker models	2023-07-10 18:54:26 +08:00
chenyafeng.cyf	543d03e32b	3dspeaker Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13199180	2023-07-07 15:00:35 +08:00
wenmeng.zwm	0271b9c256	Merge branch 'master-github' into merge_master_github_0628	2023-06-28 20:27:34 +08:00
Wang Qiang	a018cd6107	Dreambooth method for finetuning stable diffusions (#339 ) * Copyright * dreambooth * dreambooth test trainer * fix bugs * pre-commit --------- Co-authored-by: 翊靖 <yijing.wq@alibaba-inc.com>	2023-06-28 20:10:28 +08:00
mulin.lyh	1ea9b58447	fix torch2.0 compatible issue Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13086361 * fix face-aligment compatible * fix torch2.0 compatible issue	2023-06-28 14:15:48 +08:00
Wang Qiang	6942144ad7	Stable Diffusion model checkpoint export to onnx. (#340 ) * stable diffusion export onnx * fix pre commit bugs * fix bugs * safety checker support * test export stable diffusion	2023-06-28 13:26:19 +08:00
yuze.zyz	8f18274f75	Add teardown for tests Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12643554 * add teardown for tests * add teardown for dialog_modeling_trainer,document_grounded_dialog_generate_trainer,document_grounded_dialog_rerank_trainer,document_grounded_dialog_retrieval_trainer,training_args,translation_evaluation_trainer,translation_trainer	2023-06-28 09:44:44 +08:00
mulin.lyh	eb0f0216c6	fix torch 2.x compatible issue Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13045011 * fix torch 2.x compatible issue * fix torch 2.x compatible issue * fix complex-valued input tensor matching the output from stft with return_complex=True. * skip plugin test temporarily for modify torch version * fix test_speech_signal_process.py compatible issue * fix lint issue * upgrade funasr to 0.6.5	2023-06-27 14:40:51 +08:00
yuze.zyz	a58be34384	Add Lora/Adapter/Prompt and support for chatglm6B and chatglm2-6B Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12770413 * add prompt and lora * add adapter * add prefix * add tests * adapter smoke test passed * prompt test passed * support model id in petl * migrate chatglm6b * add train script for chatglm6b * move gen_kwargs to finetune.py * add chatglm2 * add model definination	2023-06-27 14:38:18 +08:00
xingjun.wxj	1dbff6cb48	Support jsonl format in meta data Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13071970 * support jsonl in meta * add UT and refine fetch_meta_files_from_url	2023-06-27 11:58:19 +08:00

... 2 3 4 5 6 ...

1158 Commits