tests/pipelines/test_nli.py

# Copyright (c) Alibaba, Inc. and its affiliates.
import unittest

from modelscope.hub.snapshot_download import snapshot_download
from modelscope.models import Model
from modelscope.pipelines import pipeline
from modelscope.pipelines.nlp import TextClassificationPipeline
from modelscope.preprocessors import SequenceClassificationPreprocessor
from modelscope.utils.constant import Tasks
from modelscope.utils.demo_utils import DemoCompatibilityCheck
from modelscope.utils.regress_test_utils import IgnoreKeyFn, MsRegressTool
from modelscope.utils.test_utils import test_level


class NLITest(unittest.TestCase, DemoCompatibilityCheck):

    def setUp(self) -> None:
        self.task = Tasks.nli
        self.model_id = 'damo/nlp_structbert_nli_chinese-base'

    sentence1 = '四川商务职业学院和四川财经职业学院哪个好？'
    sentence2 = '四川商务职业学院商务管理在哪个校区？'
    regress_tool = MsRegressTool(baseline=False)

    @unittest.skipUnless(test_level() >= 2, 'skip test in current test level')
    def test_run_with_direct_file_download(self):
        cache_path = snapshot_download(self.model_id)
        tokenizer = SequenceClassificationPreprocessor(cache_path)
        model = Model.from_pretrained(cache_path)
        pipeline1 = TextClassificationPipeline(model, preprocessor=tokenizer)
        pipeline2 = pipeline(Tasks.nli, model=model, preprocessor=tokenizer)
        print(f'sentence1: {self.sentence1}\nsentence2: {self.sentence2}\n'
              f'pipeline1:{pipeline1(input=(self.sentence1, self.sentence2))}')
        print(
            f'sentence1: {self.sentence1}\nsentence2: {self.sentence2}\n'
            f'pipeline1: {pipeline2(input=(self.sentence1, self.sentence2))}')

    @unittest.skipUnless(test_level() >= 2, 'skip test in current test level')
    def test_run_with_model_from_modelhub(self):
        model = Model.from_pretrained(self.model_id)
        tokenizer = SequenceClassificationPreprocessor(model.model_dir)
        pipeline_ins = pipeline(
            task=Tasks.nli, model=model, preprocessor=tokenizer)
        print(pipeline_ins(input=(self.sentence1, self.sentence2)))

    @unittest.skipUnless(test_level() >= 0, 'skip test in current test level')
    def test_run_with_model_name(self):
        pipeline_ins = pipeline(task=Tasks.nli, model=self.model_id)
        with self.regress_tool.monitor_module_single_forward(
                pipeline_ins.model,
                'sbert_nli',
                compare_fn=IgnoreKeyFn('.*intermediate_act_fn')):
            print(pipeline_ins(input=(self.sentence1, self.sentence2)))

    @unittest.skipUnless(test_level() >= 2, 'skip test in current test level')
    def test_run_with_default_model(self):
        pipeline_ins = pipeline(task=Tasks.nli)
        print(pipeline_ins(input=(self.sentence1, self.sentence2)))

    @unittest.skip('demo compatibility test is only enabled on a needed-basis')
    def test_demo_compatibility(self):
        self.compatibility_check()


if __name__ == '__main__':
    unittest.main()
[to #42322933]新增：nli，sentiment_classification，dialog_intent，dialog_modeling 添加了，nli，sentiment_classification， dialog_intent, dialog_modeling几个pipeline。同时加入了nlp里面sequence classification一些简单的抽象。去掉了zero_shot_classification Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9159089 2022-07-01 23:13:03 +08:00			`# Copyright (c) Alibaba, Inc. and its affiliates.`
			`import unittest`

			`from modelscope.hub.snapshot_download import snapshot_download`
			`from modelscope.models import Model`
[to #43112771] requirements check and lazy import support 2022-07-27 17:29:16 +08:00			`from modelscope.pipelines import pipeline`
[to #42322933] NLP 1030 Refactor Features: 1. Refactor the directory structure of nlp models. All model files are placed into either the model folder or the task_model folder 2. Refactor all the comments to google style 3. Add detail comments to important tasks and nlp models, to list the description of the model, and its preprocessor&trainer 4. Model Exporting now supports a direct all to TorchModelExporter(no need to derive from it) 5. Refactor model save_pretrained method to support direct running(independent from trainer) 6. Remove the judgement of Model in the pipeline base class, to support outer register models running in our pipelines 7. Nlp trainer now has a NLPTrainingArguments class , user can pass arguments into the dataclass, and use it as a normal cfg_modify_fn, to simplify the operation of modify cfg. 8. Merge the BACKBONES and the MODELS, so user can get a backbone with the Model.from_pretrained call 9. Model.from_pretrained now support a task argument, so user can use a backbone and load it with a specific task class. 10. Support Preprocessor.from_pretrained method 11. Add standard return classes to important nlp tasks, so some of the pipelines and the models are independent now, the return values of the models will always be tensors, and the pipelines will take care of the conversion to numpy and the following stuffs. 12. Split the file of the nlp preprocessors, to make the dir structure more clear. Bugs Fixing: 1. Fix a bug that lr_scheduler can be called earlier than the optimizer's step 2. Fix a bug that the direct call of Pipelines (not from pipeline(xxx)) throws error 3. Fix a bug that the trainer will not call the correct TaskDataset class 4. Fix a bug that the internal loading of dataset will throws error in the trainer class Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10490585 2022-10-25 12:26:25 +08:00			`from modelscope.pipelines.nlp import TextClassificationPipeline`
[to #42322933]bert with sequence classification / token classification/ fill mask refactor 1.新增支持原始bert模型（非easynlp的 backbone prefix版本） 2.支持bert的在sequence classification/fill mask /token classification上的backbone head形式 3.统一了sequence classification几个任务的pipeline到一个类 4.fill mask 支持backbone head形式 5.token classification的几个子任务（ner，word seg， part of speech）的preprocessor 统一到了一起TokenClassificationPreprocessor 6. sequence classification的几个子任务（single classification， pair classification）的preprocessor 统一到了一起SequenceClassificationPreprocessor 7. 改动register中 cls的group_key 赋值位置，之前的group_key在多个decorators的情况下，会被覆盖，obj_cls的group_key信息不正确 8. 基于backbone head形式将原本group_key和 module同名的情况尝试做调整，如下在modelscope/pipelines/nlp/sequence_classification_pipeline.py 中原本 @PIPELINES.register_module( Tasks.sentiment_classification, module_name=Pipelines.sentiment_classification) 改成 @PIPELINES.register_module( Tasks.text_classification, module_name=Pipelines.sentiment_classification) 相应的configuration.json也有改动，这样的改动更符合任务和pipline（子任务）的关系。 8. 其他相应改动为支持上述功能 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10041463 2022-09-27 23:08:33 +08:00			`from modelscope.preprocessors import SequenceClassificationPreprocessor`
[to #42322933]新增：nli，sentiment_classification，dialog_intent，dialog_modeling 添加了，nli，sentiment_classification， dialog_intent, dialog_modeling几个pipeline。同时加入了nlp里面sequence classification一些简单的抽象。去掉了zero_shot_classification Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9159089 2022-07-01 23:13:03 +08:00			`from modelscope.utils.constant import Tasks`
[to #44657982] add unittest for demo and demotest utils unittest for demo service Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10006180 2022-09-08 14:08:51 +08:00			`from modelscope.utils.demo_utils import DemoCompatibilityCheck`
fix bug 2022-11-02 20:02:18 +08:00			`from modelscope.utils.regress_test_utils import IgnoreKeyFn, MsRegressTool`
[to #42322933]新增：nli，sentiment_classification，dialog_intent，dialog_modeling 添加了，nli，sentiment_classification， dialog_intent, dialog_modeling几个pipeline。同时加入了nlp里面sequence classification一些简单的抽象。去掉了zero_shot_classification Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9159089 2022-07-01 23:13:03 +08:00			`from modelscope.utils.test_utils import test_level`


[to #44657982] add unittest for demo and demotest utils unittest for demo service Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10006180 2022-09-08 14:08:51 +08:00			`class NLITest(unittest.TestCase, DemoCompatibilityCheck):`

			`def setUp(self) -> None:`
			`self.task = Tasks.nli`
			`self.model_id = 'damo/nlp_structbert_nli_chinese-base'`

[to #42322933]新增：nli，sentiment_classification，dialog_intent，dialog_modeling 添加了，nli，sentiment_classification， dialog_intent, dialog_modeling几个pipeline。同时加入了nlp里面sequence classification一些简单的抽象。去掉了zero_shot_classification Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9159089 2022-07-01 23:13:03 +08:00			`sentence1 = '四川商务职业学院和四川财经职业学院哪个好？'`
			`sentence2 = '四川商务职业学院商务管理在哪个校区？'`
[to #42322933] add regress tests Add regression test for some unit tests. Firstly, Run a baseline test to create a pickle file which contains the inputs and outputs of modules, then changes can be observed between the latest version and the baseline file. Some baseline files are submitted in the data/test/regression folder Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9814693 2022-08-30 23:17:07 +08:00			`regress_tool = MsRegressTool(baseline=False)`
[to #42322933]新增：nli，sentiment_classification，dialog_intent，dialog_modeling 添加了，nli，sentiment_classification， dialog_intent, dialog_modeling几个pipeline。同时加入了nlp里面sequence classification一些简单的抽象。去掉了zero_shot_classification Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9159089 2022-07-01 23:13:03 +08:00
			`@unittest.skipUnless(test_level() >= 2, 'skip test in current test level')`
			`def test_run_with_direct_file_download(self):`
			`cache_path = snapshot_download(self.model_id)`
[to #42322933]bert with sequence classification / token classification/ fill mask refactor 1.新增支持原始bert模型（非easynlp的 backbone prefix版本） 2.支持bert的在sequence classification/fill mask /token classification上的backbone head形式 3.统一了sequence classification几个任务的pipeline到一个类 4.fill mask 支持backbone head形式 5.token classification的几个子任务（ner，word seg， part of speech）的preprocessor 统一到了一起TokenClassificationPreprocessor 6. sequence classification的几个子任务（single classification， pair classification）的preprocessor 统一到了一起SequenceClassificationPreprocessor 7. 改动register中 cls的group_key 赋值位置，之前的group_key在多个decorators的情况下，会被覆盖，obj_cls的group_key信息不正确 8. 基于backbone head形式将原本group_key和 module同名的情况尝试做调整，如下在modelscope/pipelines/nlp/sequence_classification_pipeline.py 中原本 @PIPELINES.register_module( Tasks.sentiment_classification, module_name=Pipelines.sentiment_classification) 改成 @PIPELINES.register_module( Tasks.text_classification, module_name=Pipelines.sentiment_classification) 相应的configuration.json也有改动，这样的改动更符合任务和pipline（子任务）的关系。 8. 其他相应改动为支持上述功能 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10041463 2022-09-27 23:08:33 +08:00			`tokenizer = SequenceClassificationPreprocessor(cache_path)`
[to #42322933] NLP 1030 Refactor Features: 1. Refactor the directory structure of nlp models. All model files are placed into either the model folder or the task_model folder 2. Refactor all the comments to google style 3. Add detail comments to important tasks and nlp models, to list the description of the model, and its preprocessor&trainer 4. Model Exporting now supports a direct all to TorchModelExporter(no need to derive from it) 5. Refactor model save_pretrained method to support direct running(independent from trainer) 6. Remove the judgement of Model in the pipeline base class, to support outer register models running in our pipelines 7. Nlp trainer now has a NLPTrainingArguments class , user can pass arguments into the dataclass, and use it as a normal cfg_modify_fn, to simplify the operation of modify cfg. 8. Merge the BACKBONES and the MODELS, so user can get a backbone with the Model.from_pretrained call 9. Model.from_pretrained now support a task argument, so user can use a backbone and load it with a specific task class. 10. Support Preprocessor.from_pretrained method 11. Add standard return classes to important nlp tasks, so some of the pipelines and the models are independent now, the return values of the models will always be tensors, and the pipelines will take care of the conversion to numpy and the following stuffs. 12. Split the file of the nlp preprocessors, to make the dir structure more clear. Bugs Fixing: 1. Fix a bug that lr_scheduler can be called earlier than the optimizer's step 2. Fix a bug that the direct call of Pipelines (not from pipeline(xxx)) throws error 3. Fix a bug that the trainer will not call the correct TaskDataset class 4. Fix a bug that the internal loading of dataset will throws error in the trainer class Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10490585 2022-10-25 12:26:25 +08:00			`model = Model.from_pretrained(cache_path)`
			`pipeline1 = TextClassificationPipeline(model, preprocessor=tokenizer)`
[to #42322933]新增：nli，sentiment_classification，dialog_intent，dialog_modeling 添加了，nli，sentiment_classification， dialog_intent, dialog_modeling几个pipeline。同时加入了nlp里面sequence classification一些简单的抽象。去掉了zero_shot_classification Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9159089 2022-07-01 23:13:03 +08:00			`pipeline2 = pipeline(Tasks.nli, model=model, preprocessor=tokenizer)`
			`print(f'sentence1: {self.sentence1}\nsentence2: {self.sentence2}\n'`
			`f'pipeline1:{pipeline1(input=(self.sentence1, self.sentence2))}')`
			`print(`
			`f'sentence1: {self.sentence1}\nsentence2: {self.sentence2}\n'`
			`f'pipeline1: {pipeline2(input=(self.sentence1, self.sentence2))}')`

			`@unittest.skipUnless(test_level() >= 2, 'skip test in current test level')`
			`def test_run_with_model_from_modelhub(self):`
			`model = Model.from_pretrained(self.model_id)`
[to #42322933]bert with sequence classification / token classification/ fill mask refactor 1.新增支持原始bert模型（非easynlp的 backbone prefix版本） 2.支持bert的在sequence classification/fill mask /token classification上的backbone head形式 3.统一了sequence classification几个任务的pipeline到一个类 4.fill mask 支持backbone head形式 5.token classification的几个子任务（ner，word seg， part of speech）的preprocessor 统一到了一起TokenClassificationPreprocessor 6. sequence classification的几个子任务（single classification， pair classification）的preprocessor 统一到了一起SequenceClassificationPreprocessor 7. 改动register中 cls的group_key 赋值位置，之前的group_key在多个decorators的情况下，会被覆盖，obj_cls的group_key信息不正确 8. 基于backbone head形式将原本group_key和 module同名的情况尝试做调整，如下在modelscope/pipelines/nlp/sequence_classification_pipeline.py 中原本 @PIPELINES.register_module( Tasks.sentiment_classification, module_name=Pipelines.sentiment_classification) 改成 @PIPELINES.register_module( Tasks.text_classification, module_name=Pipelines.sentiment_classification) 相应的configuration.json也有改动，这样的改动更符合任务和pipline（子任务）的关系。 8. 其他相应改动为支持上述功能 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10041463 2022-09-27 23:08:33 +08:00			`tokenizer = SequenceClassificationPreprocessor(model.model_dir)`
[to #42322933]新增：nli，sentiment_classification，dialog_intent，dialog_modeling 添加了，nli，sentiment_classification， dialog_intent, dialog_modeling几个pipeline。同时加入了nlp里面sequence classification一些简单的抽象。去掉了zero_shot_classification Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9159089 2022-07-01 23:13:03 +08:00			`pipeline_ins = pipeline(`
			`task=Tasks.nli, model=model, preprocessor=tokenizer)`
			`print(pipeline_ins(input=(self.sentence1, self.sentence2)))`

			`@unittest.skipUnless(test_level() >= 0, 'skip test in current test level')`
			`def test_run_with_model_name(self):`
			`pipeline_ins = pipeline(task=Tasks.nli, model=self.model_id)`
[to #42322933] add regress tests Add regression test for some unit tests. Firstly, Run a baseline test to create a pickle file which contains the inputs and outputs of modules, then changes can be observed between the latest version and the baseline file. Some baseline files are submitted in the data/test/regression folder Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9814693 2022-08-30 23:17:07 +08:00			`with self.regress_tool.monitor_module_single_forward(`
fix bug 2022-11-02 20:02:18 +08:00			`pipeline_ins.model,`
			`'sbert_nli',`
			`compare_fn=IgnoreKeyFn('.*intermediate_act_fn')):`
[to #42322933] add regress tests Add regression test for some unit tests. Firstly, Run a baseline test to create a pickle file which contains the inputs and outputs of modules, then changes can be observed between the latest version and the baseline file. Some baseline files are submitted in the data/test/regression folder Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9814693 2022-08-30 23:17:07 +08:00			`print(pipeline_ins(input=(self.sentence1, self.sentence2)))`
[to #42322933]新增：nli，sentiment_classification，dialog_intent，dialog_modeling 添加了，nli，sentiment_classification， dialog_intent, dialog_modeling几个pipeline。同时加入了nlp里面sequence classification一些简单的抽象。去掉了zero_shot_classification Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9159089 2022-07-01 23:13:03 +08:00
[to #42322933]clean up test level Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9662182 * clean up test level 2022-08-06 12:22:17 +08:00			`@unittest.skipUnless(test_level() >= 2, 'skip test in current test level')`
[to #42322933]新增：nli，sentiment_classification，dialog_intent，dialog_modeling 添加了，nli，sentiment_classification， dialog_intent, dialog_modeling几个pipeline。同时加入了nlp里面sequence classification一些简单的抽象。去掉了zero_shot_classification Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9159089 2022-07-01 23:13:03 +08:00			`def test_run_with_default_model(self):`
			`pipeline_ins = pipeline(task=Tasks.nli)`
			`print(pipeline_ins(input=(self.sentence1, self.sentence2)))`

[to #42322933] skip demo test by default 2022-09-09 14:56:15 +08:00			`@unittest.skip('demo compatibility test is only enabled on a needed-basis')`
[to #44657982] add unittest for demo and demotest utils unittest for demo service Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10006180 2022-09-08 14:08:51 +08:00			`def test_demo_compatibility(self):`
			`self.compatibility_check()`

[to #42322933]新增：nli，sentiment_classification，dialog_intent，dialog_modeling 添加了，nli，sentiment_classification， dialog_intent, dialog_modeling几个pipeline。同时加入了nlp里面sequence classification一些简单的抽象。去掉了zero_shot_classification Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9159089 2022-07-01 23:13:03 +08:00
			`if __name__ == '__main__':`
			`unittest.main()`