unsupervised_topic_segmentation
unsupervised_topic_segmentation copied to clipboard
example of input dataframe
Thank you for the great work. Can you please bring one example of the inputs? i.e., what should be the data frame look like? more specifically what is caption_col_name ?
df: pd.DataFrame,
meeting_id_col_name: str,
start_col_name: str,
end_col_name: str,
caption_col_name: str,
topic_segmentation_config: TopicSegmentationConfig,
Also, what are the hyperparameters you used for textiling_hyperparameters
?
This data structure is tailored to the datasets stored in our internal database. Therefore it's better to modify this dataframe according to your data format.
I pushed the hyperparameter values we had.
Hello! Can you, please, share start_col_name and end_col_name columns data format for correct algorithm work? I`m DS intern and working on implementation your algorithm with Russian language.
you can use this dataset https://github.com/Yale-LILY/QMSum