A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents
TLDR
MODEE combines LLMs and graph learning for open-domain event extraction, outperforming SOTA and generalizing to closed domains.
Key contributions
- Introduces MODEE, a novel multimodal approach for open-domain event extraction.
- Combines LLM text representations with graph-based learning for document-level reasoning.
- Outperforms state-of-the-art open-domain event extraction methods on large datasets.
- Generalizes effectively to closed-domain event extraction, surpassing existing algorithms.
Why it matters
Event extraction is crucial for understanding and analysis, supporting tasks like summarization and emergency decision-making. This paper introduces MODEE, which overcomes limitations of current methods by leveraging LLMs and graph-based reasoning. Its superior performance and generalization ability significantly advance the field.
Original Abstract
Event extraction is essential for event understanding and analysis. It supports tasks such as document summarization and decision-making in emergency scenarios. However, existing event extraction approaches have limitations: (1) closed-domain algorithms are restricted to predefined event types and thus rarely generalize to unseen types and (2) open-domain event extraction algorithms, capable of handling unconstrained event types, have largely overlooked the potential of large language models (LLMs) despite their advanced abilities. Additionally, they do not explicitly model document-level contextual, structural, and semantic reasoning, which are crucial for effective event extraction but remain challenging for LLMs due to lost-in-the-middle phenomenon and attention dilution. To address these limitations, we propose multimodal open-domain event extraction, MODEE , a novel approach for open-domain event extraction that combines graph-based learning with text-based representation from LLMs to model document-level reasoning. Empirical evaluations on large datasets demonstrate that MODEE outperforms state-of-the-art open-domain event extraction approaches and can be generalized to closed-domain event extraction, where it outperforms existing algorithms.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.