langtest
langtest copied to clipboard
Feat/implement mts dialog based clinical summary evaluation
trafficstars
This pull request introduces several changes to the langtest module, primarily focusing on enhancing functionality and improving code structure. The most important changes include the addition of dialogue-related columns, the introduction of a new SummaryEval class for evaluating clinical summaries, and the correction of a recurring typo in class names.
Enhancements to functionality:
- Added
dialoguecolumn to various DataFrame manipulations inlangtest/langtest.pyto support new data types. [1] [2] - Introduced
SummaryEvalclass inlangtest/metrics/llm_eval.pyfor evaluating clinical summaries generated from doctor-patient dialogues.
Code structure improvements:
- Corrected the typo "BaseClincial" to "BaseClinical" throughout
langtest/transform/clinical.pyto ensure consistency and clarity. [1] [2] - Added new imports and type hints to improve code readability and type safety in
langtest/modelhandler/llm_modelhandler.pyandlangtest/modelhandler/modelhandler.py. [1] [2] - Updated the
tqdmimport inlangtest/transform/base.pyto usetqdm.autofor better compatibility.