Skip to content
/ MTM Public

Official repository to release the code and datasets in the paper, "Article Reranking by Memory-enhanced Key Sentence Matching for Detecting Previously Fact-checked Claims", ACL-IJCNLP 2021.

Notifications You must be signed in to change notification settings

ICTMCG/MTM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pipeline

Weibo Dataset

Preprocess

BM25

Calculate the BM25 ranking matrix, MTM/preprocess/BM25/data/bm25_scores_(#Claims, #Articles).npy, by MTM/preprocess/BM25/Weibo.ipynb.

Datasets Splits

MTM/dataset/Weibo/splits/data_splits.ipynb

  1. Split by event
  2. Split by claim

Tokenize

MTM/preprocess/tokenize/run.sh

ROT

MTM/preprocess/ROT/run.sh

  1. Prepare RougeBert's Training data
  2. Training
  3. Vectorize the claims and articles (get embeddings)

PMB

  1. Prepare the clustering data, MTM/preprocess/PMB/data/Weibo/clustering_training_data_[TS_SMALL]<dist<[TS_LARGE].pkl, by MTM/preprocess/PMB/calculate_init_thresholds.ipynb.
  2. Kmeans clustering: MTM/preprocess/PMB/run.sh
  3. See some cases of key sentences selection: MTM/preprocess/PMB/key_sentences_selection_cases.ipynb

Training

MTM/model/run.sh

Twitter Dataset

About

Official repository to release the code and datasets in the paper, "Article Reranking by Memory-enhanced Key Sentence Matching for Detecting Previously Fact-checked Claims", ACL-IJCNLP 2021.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published