TY - GEN
T1 - ECC Analyzer
T2 - 5th ACM International Conference on AI in Finance, ICAIF 2024
AU - Cao, Yupeng
AU - Chen, Zhi
AU - Pei, Qingyun
AU - Lee, Nathan
AU - Subbalakshmi, K. P.
AU - Ndiaye, Papa Momar
N1 - Publisher Copyright:
© 2024 ACM.
PY - 2024/11/14
Y1 - 2024/11/14
N2 - In the realm of financial analytics, leveraging unstructured data, such as earnings conference calls (ECCs), to forecast stock volatility is a critical challenge that has attracted both academics and investors. While previous studies have used multimodal deep learning-based models to obtain a general view of ECCs for volatility predicting, they often fail to capture detailed, complex information. Our research introduces a novel framework: ECC Analyzer, which utilizes large language models (LLMs) to extract richer, more predictive content from ECCs to aid the model's prediction performance. We use the pre-trained large models to extract textual and audio features from ECCs and implement a hierarchical information extraction strategy to extract more fine-grained information. This strategy first extracts paragraph-level general information by summarizing the text and then extracts fine-grained focus sentences using Retrieval-Augmented Generation (RAG). These features are then fused through multimodal feature fusion to perform volatility prediction. Experimental results demonstrate that our model outperforms traditional analytical benchmarks, confirming the effectiveness of advanced LLM techniques in financial analysis.
AB - In the realm of financial analytics, leveraging unstructured data, such as earnings conference calls (ECCs), to forecast stock volatility is a critical challenge that has attracted both academics and investors. While previous studies have used multimodal deep learning-based models to obtain a general view of ECCs for volatility predicting, they often fail to capture detailed, complex information. Our research introduces a novel framework: ECC Analyzer, which utilizes large language models (LLMs) to extract richer, more predictive content from ECCs to aid the model's prediction performance. We use the pre-trained large models to extract textual and audio features from ECCs and implement a hierarchical information extraction strategy to extract more fine-grained information. This strategy first extracts paragraph-level general information by summarizing the text and then extracts fine-grained focus sentences using Retrieval-Augmented Generation (RAG). These features are then fused through multimodal feature fusion to perform volatility prediction. Experimental results demonstrate that our model outperforms traditional analytical benchmarks, confirming the effectiveness of advanced LLM techniques in financial analysis.
KW - Earnings Conference Call Analysis
KW - Large Language Model
KW - Retrieval-Augmented Generation
KW - Volatility forecasting
UR - http://www.scopus.com/inward/record.url?scp=85214883095&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85214883095&partnerID=8YFLogxK
U2 - 10.1145/3677052.3698689
DO - 10.1145/3677052.3698689
M3 - Conference contribution
AN - SCOPUS:85214883095
T3 - ICAIF 2024 - 5th ACM International Conference on AI in Finance
SP - 257
EP - 265
BT - ICAIF 2024 - 5th ACM International Conference on AI in Finance
Y2 - 14 November 2024 through 17 November 2024
ER -