* support chat * update llama2 chat testcase * add gen kwargs and devices * update unittest and support max_length in multi-turn dialogue