JFinTEB: Japanese Financial Text Embedding Benchmark

April 17, 20262604.15882

cs.IRcs.CL

TLDR

JFinTEB is the first comprehensive benchmark for evaluating Japanese financial text embeddings, covering diverse tasks and models.

Key contributions

Introduces JFinTEB, the first comprehensive benchmark for Japanese financial text embeddings.
Covers diverse tasks: retrieval (instruction-following, text generation) and classification (sentiment, categorization).
Evaluates a wide range of models, including Japanese-specific, multilingual, and commercial services.
Publicly releases datasets and an evaluation framework to foster future research.

Why it matters

This paper addresses a critical gap in resources for Japanese financial text processing. JFinTEB provides a standardized evaluation protocol, advancing domain-specific embedding research and facilitating development in the community.

Original Abstract

We introduce JFinTEB, the first comprehensive benchmark specifically designed for evaluating Japanese financial text embeddings. Existing embedding benchmarks provide limited coverage of language-specific and domain-specific aspects found in Japanese financial texts. Our benchmark encompasses diverse task categories including retrieval and classification tasks that reflect realistic and well-defined financial text processing scenarios. The retrieval tasks leverage instruction-following datasets and financial text generation queries, while classification tasks cover sentiment analysis, document categorization, and domain-specific classification challenges derived from economic survey data. We conduct extensive evaluations across a wide range of embedding models, including Japanese-specific models of various sizes, multilingual models, and commercial embedding services. We publicly release JFinTEB datasets and evaluation framework at https://github.com/retarfi/JFinTEB to facilitate future research and provide a standardized evaluation protocol for the Japanese financial text mining community. This work addresses a critical gap in Japanese financial text processing resources and establishes a foundation for advancing domain-specific embedding research.

View on arXiv Download PDF

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.

TLDR

Key contributions

Why it matters

Original Abstract

📬 Weekly AI Paper Digest

Related papers