TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning

TAT-DQA is a large-scale Document VQA dataset, which is constructed by extending the TAT-QA. It aims to stimulate progress of QA research over more complex and realistic visually-rich documents with rich tabular and textual content, especially those requiring numerical reasoning.

You can download our TAT-DQA dataset via TAT-DQA Dataset.

For more information, please refer to our TAT-DQA Website or read our ACM MM 2022 paper PDF.

Updates

${\color{red}Jan 2024}$: We release the ground truth for the TAT-DQA test set TAT-DQA Dataset, to facilitate future research on this task!

${\color{red}May 2023}$: Source Code released! You are welcome to use the Doc2SoarGraph repo to explore the TAT-DQA dataset and start your research!

Citation

Please kindly cite our work if you use our dataset or codes, thank you.

@inproceedings{zhu2022towards,
  title={Towards complex document understanding by discrete reasoning},
  author={Zhu, Fengbin and Lei, Wenqiang and Feng, Fuli and Wang, Chao and Zhang, Haozhou and Chua, Tat-Seng},
  booktitle={Proceedings of the 30th ACM International Conference on Multimedia},
  pages={4857--4866},
  year={2022}
}

@inproceedings{zhu2024doc2soargraph,
    title = "{D}oc2{S}oar{G}raph: Discrete Reasoning over Visually-Rich Table-Text Documents via Semantic-Oriented Hierarchical Graphs",
    author = "Zhu, Fengbin  and
      Wang, Chao  and
      Feng, Fuli  and
      Ren, Zifeng  and
      Li, Moxin  and
      Chua, Tat-Seng",
    editor = "Calzolari, Nicoletta  and
      Kan, Min-Yen  and
      Hoste, Veronique  and
      Lenci, Alessandro  and
      Sakti, Sakriani  and
      Xue, Nianwen",
    booktitle = "Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)",
    year = "2024",
    address = "Torino, Italia",
    publisher = "ELRA and ICCL",
    url = "https://aclanthology.org/2024.lrec-main.456",
    pages = "5119--5131"
}

License

The TAT-DQA dataset is under the license of Creative Commons (CC BY) Attribution 4.0 International

Any Questions?

For any issues please create an issue here or kindly email us at: Fengbin Zhu [email protected], thank you.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
docs		docs
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning

Updates

Citation

License

Any Questions?

About

Releases

Packages

NExTplusplus/TAT-DQA

Folders and files

Latest commit

History

Repository files navigation

TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning

Updates

Citation

License

Any Questions?

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages