Character Identification on Dialogue with Neural Coreference Resolution

Introduction

This repository is to accomplish a shared task in SemEval 2018 - Task 4: Character Identification on Multiparty Dialogues. Main references are listed in the following:

SemEval 2018 - Task 4: Character Identification on Multiparty Dialogues
End-to-end Neural Coreference Resolution
- A demo of the code can be found here: http://www.kentonl.com/e2e-coref.
Robust Coreference Resolution and Entity Linking on Dialogues: Character Identification on TV Show Transcripts

Requirements

Python 2.7
- TensorFlow 1.4.0
- pyhocon (for parsing the configurations)
- NLTK (for sentence splitting and tokenization in the demo)

pip install -r requirements.txt

Setting Up

Download pretrained word embeddings and build custom kernels by running setup_all.sh.
- There are 3 platform-dependent ways to build custom TensorFlow kernels. Please comment/uncomment the appropriate lines in the script.
Run one of the following:
- To use the pretrained model only, run setup_pretrained.sh
- To train your own models, run setup_training.sh
  - This assumes access to OntoNotes 5.0. Please edit the ontonotes_path variable.

Training Instructions

Coreference Resolution

Experiment configurations are found in experiments.conf
Choose an experiment that you would like to run, e.g. best
For a single-machine experiment, run the following two commands:
- python singleton.py <experiment>
- python evaluator.py <experiment>
For a distributed multi-gpu experiment, edit the cluster property of the configuration and run the following commands:
- python parameter_server.py <experiment>
- python worker.py <experiment> (for every worker in your cluster)
- python evaluator.py <experiment> (on the same machine as your first worker)
Results are stored in the logs directory and can be viewed via TensorBoard.
For final evaluation of the checkpoint with the maximum dev F1:
- Run python test_single.py <experiment> for the single-model evaluation.
- Run python test_ensemble.py <experiment1> <experiment2> <experiment3>... for the ensemble-model evaluation.

Entity Linking

Prepare mention embedding data:
- python entity_linking_helper.py <experiment>
Train entity linking model:
- python entity_linking_train.py <experiment>
Test entity linkning model:
- python entity_linking_test.py <experiment>

Demo Instructions

For the command-line demo with the pretrained model:
- Run python demo.py final
For the web demo with the pretrained model:
- Run python demo.py final 8080
- Edit the URL at the end of docs/main.js to point to the demo location, e.g. localhost:8080
- Open docs/index.html in a web browser.
To run the demo with other experiments, replace final with your configuration name.

Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
docs		docs
friends.train.trial		friends.train.trial
html		html
logs/best		logs/best
report		report
viz		viz
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
char_vocab.english.txt		char_vocab.english.txt
conll-2012.zip		conll-2012.zip
conll.py		conll.py
coref_kernels.cc		coref_kernels.cc
coref_kernels.so		coref_kernels.so
coref_model.py		coref_model.py
coref_ops.py		coref_ops.py
decoder.py		decoder.py
demo.py		demo.py
dev.english.jsonlines		dev.english.jsonlines
dev.english.v4_auto_conll		dev.english.v4_auto_conll
entity_linking_helper.py		entity_linking_helper.py
entity_linking_test.py		entity_linking_test.py
entity_linking_train.py		entity_linking_train.py
evaluate.py		evaluate.py
evaluator.py		evaluator.py
experiments.conf		experiments.conf
filter_embeddings.py		filter_embeddings.py
get_char_vocab.py		get_char_vocab.py
glove.840B.300d.txt.filtered		glove.840B.300d.txt.filtered
launch.py		launch.py
metrics.py		metrics.py
minimize.py		minimize.py
parameter_server.py		parameter_server.py
setup_all.sh		setup_all.sh
setup_pretrained.sh		setup_pretrained.sh
setup_training.sh		setup_training.sh
singleton.py		singleton.py
test.english.jsonlines		test.english.jsonlines
test.english.v4_auto_conll		test.english.v4_auto_conll
test_ensemble.py		test_ensemble.py
test_single.py		test_single.py
train.english.jsonlines		train.english.jsonlines
train.english.v4_auto_conll		train.english.v4_auto_conll
util.py		util.py
worker.py		worker.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Character Identification on Dialogue with Neural Coreference Resolution

Introduction

Requirements

Setting Up

Training Instructions

Coreference Resolution

Entity Linking

Demo Instructions

Contact

About

Uh oh!

Releases

Packages

Languages

License

Douglasli/e2e-coref

Folders and files

Latest commit

History

Repository files navigation

Character Identification on Dialogue with Neural Coreference Resolution

Introduction

Requirements

Setting Up

Training Instructions

Coreference Resolution

Entity Linking

Demo Instructions

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages