Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023) LONG PAPERS, VOL 1(2023)
Key words
Visual Question Answering,Language Understanding
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined