TILFA: A Unified Framework for Text, Image, and Layout Fusion in Argument Mining

This repository is the official implementation of TILFA: A Unified Framework for Text, Image, and Layout Fusion in Argument Mining.

The paper is accepted to the Proceedings of the 10th Workshop on Argument Mining 2023.

Abstract

A main goal of Argument Mining (AM) is to analyze an author's stance. Unlike previous AM datasets focusing only on text, the shared task at the 10th Workshop on Argument Mining introduces a dataset including both text and images. Importantly, these images contain both visual elements and optical characters. Our new framework, TILFA (A Unified Framework for Text, Image, and Layout Fusion in Argument Mining), is designed to handle this mixed data. It excels at not only understanding text but also detecting optical characters and recognizing layout details in images. Our model significantly outperforms existing baselines, earning our team, KnowComp, the 1st place in the leaderboard of Argumentative Stance Classification subtask in this shared task.

An Overview of Our Method

Requirements

Python version is 3.7

requirements:

apex==0.9.10dev
boto3==1.28.10
botocore==1.31.10
datasets==2.3.2
detectron2==0.6+cu111
imbalanced_learn==0.10.1
imblearn==0.0
inflect==7.0.0
lxml==4.9.2
matplotlib==3.5.3
nltk==3.8.1
numpy==1.21.6
opencv_python==4.8.0.74
pandas==1.1.5
Pillow==9.5.0
Pillow==10.0.1
preprocessor==1.1.3
ptvsd==4.3.2
pytesseract==0.3.10
Requests==2.31.0
scikit_learn==1.0.2
spacy==2.2.1
stweet==2.1.1
tensorflow==2.14.0
textblob==0.17.1
timm==0.4.12
torch==1.10.0+cu111
torchvision==0.11.1+cu111
tqdm==4.65.0
transformers==4.12.5
tweet_preprocessor==0.6.0
websocket_client==1.6.3

You can install all requirements with the command

pip install -r requirements.txt

Run TILFA Framework

Step 1: Training

training examples can be found in ./run.sh

pure text

python3 main_text_alltrain.py 
--exp-dir=YOUR_EXPERIMENT_PATH
--num-epochs=25 
--batch-size=16 
--exp-mode=0 
--data-mode=0 
--lr=5e-6 
--img-model=0 
--text-model-name=microsoft/deberta-v3-large

pure image

python3 main_image_alltrain.py 
--exp-dir=YOUR_EXPERIMENT_PATH
--num-epochs=25 
--batch-size=16 
--exp-mode=0 
--data-mode=1 
--lr=1e-6 
--img-model=0 
--text-model-name=microsoft/deberta-v3-large

original multimodality

python3 main_multimodality_alltrain.py 
--exp-dir=YOUR_EXPERIMENT_PATH
--num-epochs=25 
--batch-size=16 
--exp-mode=0 
--data-mode=2 
--lr=1e-5 
--img-model=1 
--text-model-name=microsoft/deberta-v3-large 
--use-pooler=0 
--use-wordnet=1

pure layout

python3 main_layoutlmv3_alltrain.py 
--data_dir=./data 
--output_dir=YOUR_EXPERIMENT_PATH 
--do_train 
--do_eval 
--do_predict 
--model_name_or_path=microsoft/layoutlmv3-base 
--visual_embed 
--num_train_epochs=25 
--input_size=224 
--learning_rate=1e-5 
--per_gpu_train_batch_size=8 
--per_gpu_eval_batch_size=8 
--seed=22 
--gradient_accumulation_steps=1 
--text_model_name_or_path=microsoft/deberta-v3-large

layout multimodality

python3 main_multimodality_layoutlmv3_alltrain.py 
--data_dir=./data 
--output_dir=/home/data/zwanggy/2023/image_arg_experiments 
--do_train 
--do_eval 
--model_name_or_path=microsoft/layoutlmv3-base 
--visual_embed 
--num_train_epochs=25 
--input_size=224 
--learning_rate=1e-5 
--per_gpu_train_batch_size=4 
--per_gpu_eval_batch_size=4 
--seed=22  
--gradient_accumulation_steps=1 
--text_model_name_or_path=microsoft/deberta-v3-large 
--exp_mode=0  
--use_wordnet=1 
--use_pooler=0 
--cross_attn_type=-1

Step 2: Predict

predict_test_origin_text.py is for pure text predict_test_origin_image.py is for pure image predict_test_origin_multi.py is for original multimodality predict_test_layout.py is for pure layout predict_test_layout_multi.py is for layout multimodality

You should change the model name in the code to the one you want to predict with. Other parameters are consistent with the training part.

Step 3: Post Process

You should change the file name in the code to the one you want to process.

python3 final_submission.py

Step 4: Evaluate And Get The Score

If you want to get the score across topic:

python3 get_evaluation.py
-f=YOUR_FILE_PATH

If you want to get the score within topic:

python3 get_evaluation_within_topic.py
-f=YOUR_FILE_PATH
--topic=choose one in [gun_control, abortion]

Others

Address Data Imbalance

code used to address data imbalance is in path ./data/TranslateDemo
a stands for abortion, g stands for gun_control
s stands for stance, p stands for persuasiveness

cd data/TranslateDemo
python3 TranslateDemo_a_s.py

Data Augmentation

code used to do data augmentation is in path ./data/wordnet_augmentation

cd data/wordnet_augmentation
python3 preprocess_glossbert_input.py
python3 build_gloss_bert_input.py
cd GlossBERT
./run_WSD.sh
cd ..
python3 incorporate_score.py

How to Cite

@inproceedings{zong-etal-2023-tilfa,
    title = "{TILFA}: A Unified Framework for Text, Image, and Layout Fusion in Argument Mining",
    author = "Zong, Qing  and
      Wang, Zhaowei  and
      Xu, Baixuan  and
      Zheng, Tianshi  and
      Shi, Haochen  and
      Wang, Weiqi  and
      Song, Yangqiu  and
      Wong, Ginny  and
      See, Simon",
    booktitle = "Proceedings of the 10th Workshop on Argument Mining",
    month = dec,
    year = "2023",
    address = "Singapore",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.argmining-1.14",
    doi = "10.18653/v1/2023.argmining-1.14",
    pages = "139--147",
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data		data
esim		esim
evaluation_code		evaluation_code
final		final
layoutlm		layoutlm
layoutlmv3		layoutlmv3
output		output
plugins		plugins
.gitignore		.gitignore
README.md		README.md
check_data.py		check_data.py
check_model.py		check_model.py
check_sample.py		check_sample.py
dataloader.py		dataloader.py
final_submission.py		final_submission.py
get_test_data_v2.py		get_test_data_v2.py
get_train_dev_data.py		get_train_dev_data.py
main_image.py		main_image.py
main_image_alltrain.py		main_image_alltrain.py
main_image_kfold.py		main_image_kfold.py
main_layoutlm.py		main_layoutlm.py
main_layoutlmv3.py		main_layoutlmv3.py
main_layoutlmv3_alltrain.py		main_layoutlmv3_alltrain.py
main_multimodality.py		main_multimodality.py
main_multimodality_alltrain.py		main_multimodality_alltrain.py
main_multimodality_kfold.py		main_multimodality_kfold.py
main_multimodality_layoutlmv3.py		main_multimodality_layoutlmv3.py
main_multimodality_layoutlmv3_alltrain.py		main_multimodality_layoutlmv3_alltrain.py
main_text.py		main_text.py
main_text_alltrain.py		main_text_alltrain.py
main_text_kfold.py		main_text_kfold.py
method_figure.png		method_figure.png
models.py		models.py
predict_test_layout.py		predict_test_layout.py
predict_test_layout_multi.py		predict_test_layout_multi.py
predict_test_origin_image.py		predict_test_origin_image.py
predict_test_origin_multi.py		predict_test_origin_multi.py
predict_test_origin_text.py		predict_test_origin_text.py
requirements.txt		requirements.txt
run.sh		run.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TILFA: A Unified Framework for Text, Image, and Layout Fusion in Argument Mining

Abstract

An Overview of Our Method

Requirements

Run TILFA Framework

Step 1: Training

pure text

pure image

original multimodality

pure layout

layout multimodality

Step 2: Predict

Step 3: Post Process

Step 4: Evaluate And Get The Score

Others

Address Data Imbalance

Data Augmentation

How to Cite

About

Releases

Packages

Languages

HKUST-KnowComp/TILFA

Folders and files

Latest commit

History

Repository files navigation

TILFA: A Unified Framework for Text, Image, and Layout Fusion in Argument Mining

Abstract

An Overview of Our Method

Requirements

Run TILFA Framework

Step 1: Training

pure text

pure image

original multimodality

pure layout

layout multimodality

Step 2: Predict

Step 3: Post Process

Step 4: Evaluate And Get The Score

Others

Address Data Imbalance

Data Augmentation

How to Cite

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages