TY - GEN
T1 - XBRL Agent
T2 - 5th ACM International Conference on AI in Finance, ICAIF 2024
AU - Han, Shijie
AU - Kang, Haoqiang
AU - Jin, Bo
AU - Liu, Xiao Yang
AU - Yang, Steve Y.
N1 - Publisher Copyright:
© 2024 ACM.
PY - 2024/11/14
Y1 - 2024/11/14
N2 - eXtensible Business Reporting Language (XBRL) has attained the status of the global de facto standard for business reporting. However, its complexity poses significant barriers to interpretation and accessibility. In this paper, we present the first evaluation of large language models' (LLMs) performance in analyzing XBRL reports. Our study identifies LLMs' limitations in the comprehension of financial domain knowledge and mathematical calculation in the context of XBRL reports. To address these issues, we propose enhancement methods using external tools under the agent framework, referred to as XBRL-Agent, which invokes retrievers and calculators. Extensive experiments on two tasks - the Domain Query Task (which involved testing 500 XBRL term explanations and 50 domain questions) and the Numeric Type Query Task (tested 1,000 financial math tests and 50 numeric queries) - demonstrate substantial performance improvements, with accuracy increasing by up to 17% for the domain task and 42% for the numeric type task. This work not only explores the potential of LLMs for analyzing XBRL reports but also augments the reliability and robustness of such analysis, although there is still much room for improvement in mathematical calculations.
AB - eXtensible Business Reporting Language (XBRL) has attained the status of the global de facto standard for business reporting. However, its complexity poses significant barriers to interpretation and accessibility. In this paper, we present the first evaluation of large language models' (LLMs) performance in analyzing XBRL reports. Our study identifies LLMs' limitations in the comprehension of financial domain knowledge and mathematical calculation in the context of XBRL reports. To address these issues, we propose enhancement methods using external tools under the agent framework, referred to as XBRL-Agent, which invokes retrievers and calculators. Extensive experiments on two tasks - the Domain Query Task (which involved testing 500 XBRL term explanations and 50 domain questions) and the Numeric Type Query Task (tested 1,000 financial math tests and 50 numeric queries) - demonstrate substantial performance improvements, with accuracy increasing by up to 17% for the domain task and 42% for the numeric type task. This work not only explores the potential of LLMs for analyzing XBRL reports but also augments the reliability and robustness of such analysis, although there is still much room for improvement in mathematical calculations.
KW - Large language models (LLM)
KW - Semantic-augmented generation
KW - XBRL reports
UR - http://www.scopus.com/inward/record.url?scp=85213736356&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85213736356&partnerID=8YFLogxK
U2 - 10.1145/3677052.3698614
DO - 10.1145/3677052.3698614
M3 - Conference contribution
AN - SCOPUS:85213736356
T3 - ICAIF 2024 - 5th ACM International Conference on AI in Finance
SP - 856
EP - 864
BT - ICAIF 2024 - 5th ACM International Conference on AI in Finance
Y2 - 14 November 2024 through 17 November 2024
ER -