tests/pipelines/test_conversational_text_to_sql.py

# Copyright (c) Alibaba, Inc. and its affiliates.
import unittest

from modelscope.hub.snapshot_download import snapshot_download
from modelscope.models import Model
from modelscope.models.nlp import StarForTextToSql
from modelscope.pipelines import pipeline
from modelscope.pipelines.nlp import ConversationalTextToSqlPipeline
from modelscope.preprocessors import ConversationalTextToSqlPreprocessor
from modelscope.utils.constant import Tasks
from modelscope.utils.nlp.space_T_en.utils import \
    text2sql_tracking_and_print_results
from modelscope.utils.test_utils import test_level


@unittest.skip(
    "For compatible issue, TypeError: edge_subgraph() got an unexpected keyword argument 'preserve_nodes'"
)
class ConversationalTextToSql(unittest.TestCase):

    def setUp(self) -> None:
        self.task = Tasks.table_question_answering
        self.model_id = 'damo/nlp_star_conversational-text-to-sql'

    model_id = 'damo/nlp_star_conversational-text-to-sql'
    test_case = {
        'database_id':
        'employee_hire_evaluation',
        'local_db_path':
        None,
        'utterance': [
            "I'd like to see Shop names.", 'Which of these are hiring?',
            'Which shop is hiring the highest number of employees? | do you want the name of the shop ? | Yes'
        ]
    }

    @unittest.skipUnless(test_level() >= 2, 'skip test in current test level')
    def test_run_by_direct_model_download(self):
        cache_path = snapshot_download(self.model_id)
        preprocessor = ConversationalTextToSqlPreprocessor(
            model_dir=cache_path,
            database_id=self.test_case['database_id'],
            db_content=True)
        model = StarForTextToSql(
            model_dir=cache_path, config=preprocessor.config)

        pipelines = [
            ConversationalTextToSqlPipeline(
                model=model, preprocessor=preprocessor),
            pipeline(task=self.task, model=model, preprocessor=preprocessor)
        ]
        text2sql_tracking_and_print_results(self.test_case, pipelines)

    @unittest.skipUnless(test_level() >= 0, 'skip test in current test level')
    def test_run_with_model_from_modelhub(self):
        model = Model.from_pretrained(self.model_id)
        preprocessor = ConversationalTextToSqlPreprocessor(
            model_dir=model.model_dir)

        pipelines = [
            ConversationalTextToSqlPipeline(
                model=model, preprocessor=preprocessor),
            pipeline(task=self.task, model=model, preprocessor=preprocessor)
        ]
        text2sql_tracking_and_print_results(self.test_case, pipelines)

    @unittest.skipUnless(test_level() >= 0, 'skip test in current test level')
    def test_run_with_model_name(self):
        pipelines = [pipeline(task=self.task, model=self.model_id)]
        text2sql_tracking_and_print_results(self.test_case, pipelines)


if __name__ == '__main__':
    unittest.main()
[to #42322933] add conversational_text_to_sql pipeline Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9580066 2022-08-11 11:19:11 +08:00			`# Copyright (c) Alibaba, Inc. and its affiliates.`
			`import unittest`

			`from modelscope.hub.snapshot_download import snapshot_download`
			`from modelscope.models import Model`
			`from modelscope.models.nlp import StarForTextToSql`
			`from modelscope.pipelines import pipeline`
			`from modelscope.pipelines.nlp import ConversationalTextToSqlPipeline`
			`from modelscope.preprocessors import ConversationalTextToSqlPreprocessor`
			`from modelscope.utils.constant import Tasks`
[to #42322933] NLP 1030 Refactor Features: 1. Refactor the directory structure of nlp models. All model files are placed into either the model folder or the task_model folder 2. Refactor all the comments to google style 3. Add detail comments to important tasks and nlp models, to list the description of the model, and its preprocessor&trainer 4. Model Exporting now supports a direct all to TorchModelExporter(no need to derive from it) 5. Refactor model save_pretrained method to support direct running(independent from trainer) 6. Remove the judgement of Model in the pipeline base class, to support outer register models running in our pipelines 7. Nlp trainer now has a NLPTrainingArguments class , user can pass arguments into the dataclass, and use it as a normal cfg_modify_fn, to simplify the operation of modify cfg. 8. Merge the BACKBONES and the MODELS, so user can get a backbone with the Model.from_pretrained call 9. Model.from_pretrained now support a task argument, so user can use a backbone and load it with a specific task class. 10. Support Preprocessor.from_pretrained method 11. Add standard return classes to important nlp tasks, so some of the pipelines and the models are independent now, the return values of the models will always be tensors, and the pipelines will take care of the conversion to numpy and the following stuffs. 12. Split the file of the nlp preprocessors, to make the dir structure more clear. Bugs Fixing: 1. Fix a bug that lr_scheduler can be called earlier than the optimizer's step 2. Fix a bug that the direct call of Pipelines (not from pipeline(xxx)) throws error 3. Fix a bug that the trainer will not call the correct TaskDataset class 4. Fix a bug that the internal loading of dataset will throws error in the trainer class Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10490585 2022-10-25 12:26:25 +08:00			`from modelscope.utils.nlp.space_T_en.utils import \`
			`text2sql_tracking_and_print_results`
[to #42322933] add conversational_text_to_sql pipeline Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9580066 2022-08-11 11:19:11 +08:00			`from modelscope.utils.test_utils import test_level`


fix numpy pandas compatible issue 明确受影响的模型(damo)： ONE-PEACE-4B ModuleNotFoundError: MyCustomPipeline: MyCustomModel: No module named 'one_peace'，缺少依赖。 cv_resnet50_face-reconstruction 不兼容tf2 nlp_automatic_post_editing_for_translation_en2de tf2.0兼容性问题，tf1.x需要 cv_resnet18_ocr-detection-word-level_damo tf2.x兼容性问题 cv_resnet18_ocr-detection-line-level_damo tf兼容性问题 cv_resnet101_detection_fewshot-defrcn 模型限制必须detection0.3+torch1.11.0" speech_dfsmn_ans_psm_48k_causal "librosa， numpy兼容性问题 cv_mdm_motion-generation "依赖numpy版本兼容性问题： File ""/opt/conda/lib/python3.8/site-packages/smplx/body_models.py"", cv_resnet50_ocr-detection-vlpt numpy兼容性问题 cv_clip-it_video-summarization_language-guided_en tf兼容性问题 Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/13744636 * numpy and pandas no version * modify compatible issue * fix numpy compatible issue * modify ci * fix lint issue * replace Image.ANTIALIAS to Image.Resampling.LANCZOS pillow compatible * skip uncompatible cases * fix numpy compatible issue, skip cases that can not compatbile numpy or tensorflow2.x * skip compatible cases * fix clip model issue * fix body 3d keypoints compatible issue 2023-08-22 23:04:31 +08:00			`@unittest.skip(`
			`"For compatible issue, TypeError: edge_subgraph() got an unexpected keyword argument 'preserve_nodes'"`
			`)`
[to #49275037] remove demo check and fix service decoder Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/12432683 2023-05-14 23:41:40 +08:00			`class ConversationalTextToSql(unittest.TestCase):`
[to #44657982] add unittest for demo and demotest utils unittest for demo service Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10006180 2022-09-08 14:08:51 +08:00
			`def setUp(self) -> None:`
[to #42322933] change star3 to space_T_cn 1. 合并star和star3框架 2. 修改star和star3的model type Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10492793 2022-10-23 20:25:24 +08:00			`self.task = Tasks.table_question_answering`
[to #44657982] add unittest for demo and demotest utils unittest for demo service Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10006180 2022-09-08 14:08:51 +08:00			`self.model_id = 'damo/nlp_star_conversational-text-to-sql'`

[to #42322933] add conversational_text_to_sql pipeline Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9580066 2022-08-11 11:19:11 +08:00			`model_id = 'damo/nlp_star_conversational-text-to-sql'`
			`test_case = {`
			`'database_id':`
			`'employee_hire_evaluation',`
			`'local_db_path':`
			`None,`
			`'utterance': [`
			`"I'd like to see Shop names.", 'Which of these are hiring?',`
			`'Which shop is hiring the highest number of employees? \| do you want the name of the shop ? \| Yes'`
			`]`
			`}`

			`@unittest.skipUnless(test_level() >= 2, 'skip test in current test level')`
			`def test_run_by_direct_model_download(self):`
			`cache_path = snapshot_download(self.model_id)`
			`preprocessor = ConversationalTextToSqlPreprocessor(`
			`model_dir=cache_path,`
			`database_id=self.test_case['database_id'],`
			`db_content=True)`
			`model = StarForTextToSql(`
			`model_dir=cache_path, config=preprocessor.config)`

			`pipelines = [`
			`ConversationalTextToSqlPipeline(`
			`model=model, preprocessor=preprocessor),`
[to #44657982] add unittest for demo and demotest utils unittest for demo service Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10006180 2022-09-08 14:08:51 +08:00			`pipeline(task=self.task, model=model, preprocessor=preprocessor)`
[to #42322933] add conversational_text_to_sql pipeline Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9580066 2022-08-11 11:19:11 +08:00			`]`
[to #42322933]move postprocess helper into utilities Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9856286 2022-08-23 14:27:19 +08:00			`text2sql_tracking_and_print_results(self.test_case, pipelines)`
[to #42322933] add conversational_text_to_sql pipeline Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9580066 2022-08-11 11:19:11 +08:00
			`@unittest.skipUnless(test_level() >= 0, 'skip test in current test level')`
			`def test_run_with_model_from_modelhub(self):`
			`model = Model.from_pretrained(self.model_id)`
			`preprocessor = ConversationalTextToSqlPreprocessor(`
			`model_dir=model.model_dir)`

			`pipelines = [`
			`ConversationalTextToSqlPipeline(`
			`model=model, preprocessor=preprocessor),`
[to #44657982] add unittest for demo and demotest utils unittest for demo service Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10006180 2022-09-08 14:08:51 +08:00			`pipeline(task=self.task, model=model, preprocessor=preprocessor)`
[to #42322933] add conversational_text_to_sql pipeline Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9580066 2022-08-11 11:19:11 +08:00			`]`
[to #42322933]move postprocess helper into utilities Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9856286 2022-08-23 14:27:19 +08:00			`text2sql_tracking_and_print_results(self.test_case, pipelines)`
[to #42322933] add conversational_text_to_sql pipeline Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9580066 2022-08-11 11:19:11 +08:00
			`@unittest.skipUnless(test_level() >= 0, 'skip test in current test level')`
			`def test_run_with_model_name(self):`
[to #44657982] add unittest for demo and demotest utils unittest for demo service Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/10006180 2022-09-08 14:08:51 +08:00			`pipelines = [pipeline(task=self.task, model=self.model_id)]`
[to #42322933]move postprocess helper into utilities Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9856286 2022-08-23 14:27:19 +08:00			`text2sql_tracking_and_print_results(self.test_case, pipelines)`
[to #42322933] add conversational_text_to_sql pipeline Link: https://code.alibaba-inc.com/Ali-MaaS/MaaS-lib/codereview/9580066 2022-08-11 11:19:11 +08:00

			`if __name__ == '__main__':`
			`unittest.main()`