Document-based question AP History DBQ

Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering

Please cite this work with the following BibTeX: @inproceedings{cocchi2024augmenting, title={{Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering}}, ...

GitHub

DocuThinker - AI-Powered Document Analysis and Summarization App

The DocuThinker app is designed to provide users with a simple, AI-powered document management tool. Users can upload PDFs or Word documents and receive summaries, key insights, and discussion points.

Associated Press

Federal officers are leaving Louisiana immigration crackdown for Minneapolis, documents show

Associated Press

US-based multinational companies will be exempt from global tax deal

IEEE

GVDIE: A Zero-Shot Generative Information Extraction Method for Visual Documents Based on Large Language Models

Abstract: Document Information Extraction aims to extract entities and relationships from visually rich documents. Traditional methods require significant annotation and lack generality. In this paper ...

IEEE

Enhancing Remote Sensing Visual Question Answering: A Mask-Based Dual-Stream Feature Mutual Attention Network

Abstract: The visual question answering (VQA) method applied to remote sensing images (RSIs) can complete the interaction of image information and text information, which avoids professional barriers ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results