ConvoKit issues

Storage abstraction

1

### Description Introduces a new layer of abstraction between Corpus components (`Utterance`, `Speaker`, `Conversation`, `ConvoKitMeta`) and concrete data storage. Data storage is now handled by a `StorageManager` instance variable in...

jpwchang

New corpus utilities

Allow users to now create an empty corpus should they choose to do so by simply using Corpus(). Also implements further utilities such as adding individual utterances and adding individual...

oscarso2000

Supreme Court Oral Arguments Corpus: Update Years

5

For a recent project I'm working on, we're using [ConvoKit's implementation of the Supreme Court Oral Argument Corpus](https://convokit.cornell.edu/documentation/supreme.html). However, we'd really like to include data from after 2019. How difficult...

kakeith

Add .summarize() to Pairer

Implement .summarize() in Pairer.

khonzoda

Add workflow for running all Jupyter notebooks

We want to automate the running of all Jupyter notebooks on every change to ensure that notebooks run cleanly (and to serve as additional validation for the new changes). Some...

calebchiam

Add support for Python 3.9/3.10

calebchiam

Remove deprecated arguments from various constructors and functions

The current practice of leaving in deprecated constructor arguments is actually a bad practice because it can result in confusion when an informative IDE provides argument hints, for example: We...

calebchiam

enhancement

good first issue

WikiConv Chinese Dataset

1

I see the original WikiConv paper says there were conversations in Chinese collected, are these available through ConvoKit?

thomaspzollo

good first issue

Unable to add dependency parses

5

Hi Caleb @calebchiam, I'm trying to perform politeness prediction using the example notebook given [here](https://github.com/CornellNLP/Cornell-Conversational-Analysis-Toolkit/blob/master/examples/politeness-strategies/politeness_demo.ipynb). I run into some errors while adding dependency parses. Currently, I'm doing ``` from convokit...

BonJovi1

Use tqdm whenever iterating through speakers / conversations / utterances

Would be a nice QoL update to have tqdm used by default, especially when it comes to processing larger corpora. This could be a default argument in the `iter_()` methods.

calebchiam

enhancement

good first issue

ConvoKit
ConvoKit copied to clipboard

Metadata

Storage abstraction

New corpus utilities

Supreme Court Oral Arguments Corpus: Update Years

Add .summarize() to Pairer

Add workflow for running all Jupyter notebooks

Add support for Python 3.9/3.10

Remove deprecated arguments from various constructors and functions

WikiConv Chinese Dataset

Unable to add dependency parses

Use tqdm whenever iterating through speakers / conversations / utterances

← Metadata

Owner

Metadata

ConvoKit ConvoKit copied to clipboard

Metadata

← Metadata

Owner

Metadata

ConvoKit
ConvoKit copied to clipboard