Innovation: An Almost Characterization of Hallucination

摘要

Hallucination is a central limitation of large language models (LLMs), and substantial effort has been devoted to understanding and mitigating it. Towards this, Kalai and Vempala (STOC 2024) introduced a probabilistic framework formalizing calibration and hallucination, and showed that, with high probability, calibrated LLMs hallucinate roughly at the rate of the "missing mass", a measure of how incomplete the training data is relative to its source. This raises two fundamental questions: (i) what property of a calibrated LLM makes hallucinations unavoidable? and (ii) can hallucinations be avoided by giving up calibration? We answer these questions by introducing a simpler property we call innovation that measures the tendency of a model to produce outputs outside the training data. We show that innovation is implied by the condition for hallucination identified by Kalai and Vempala, and, further, that it is an almost characterization of hallucination: hallucination implies innovation, and conversely, innovation implies hallucination with high probability. We also provide lower bounds on the hallucination rate based on the "innovation rate", and by relating innovation rate back to missing mass, we obtain new hallucination rate lower bounds based on missing mass that extend the results of Kalai and Vempala.

核心问题与主要方法

核心问题

Characterize when hallucination is unavoidable for LLMs and derive rate lower bounds

场景：Kalai-Vempala probabilistic framework for unprompted LLM generation over a finite statement space with a corpus sampled from a latent document distribution

主要方法

Defines innovation as g(U) > 0, where U is the complement of the observed training support, separating extrapolation beyond the corpus from calibration. Uses K-sparsity and Regular Facts to bound the posterior probability that any chosen unseen statement is factual by roughly K/|U|, so unseen model mass is likely hallucination mass. Shows calibration plus positive missing mass forces positive probability on an unseen factual statement through partition coarsening, hence implies innovation. Derives hallucination-rate lower bounds from innovation rate using a Markov-style expectation argument and a high-confidence case analysis based on the maximum probability assigned to any unseen statement. Relates innovation rate to missing mass through total variation distance between the model distribution and a coarsened document distribution, yielding missing-mass lower bounds under weaker assumptions than some prior bounds.

关键贡献与后续阅读

关键贡献

Introduces innovation rate as a simpler model-and-corpus statistic that nearly characterizes hallucination existence in the Kalai-Vempala framework. Strictly generalizes the existence-level calibration-implies-hallucination route by showing calibration with any positive missing mass implies innovation, then innovation implies hallucination with high probability. Provides lower bounds on hallucination rate directly in terms of innovation rate, including bounds that avoid the corpus-size dependence present in baseline Kalai-Vempala bounds under only K-sparsity and Regular Facts. Connects innovation-rate bounds back to missing mass via coarsening and total variation, producing new missing-mass hallucination lower bounds that remain nonvacuous in regimes where prior bounds may collapse. Adds exploratory evidence that innovation and hallucination rates align in controlled n-gram settings, supporting but not proving robustness beyond the formal assumptions.

研究启发

What are the exact constants and probability levels in Theorems 3.3, 4.1, and 4.2 after parsing the rendered formulas from the PDF/source? How sensitive are the results to violations of Regular Facts in realistic semantic or prompted settings? Can the semantic innovation estimator proposed with sentence embeddings be related to any formal relaxed version of the finite-support theory? Do the missing-mass translation bounds remain quantitatively useful when K is large relative to realistic statement spaces?

限制与不确定性

Model is abstract and depends on finite statement-space, K-sparsity, Regular Facts, and i.i.d. corpus assumptions. Empirical evidence appears toy n-gram based, not a direct validation on modern LLM behavior. Value for an information theory workflow is medium rather than urgent because the main framing is LLM reliability.

参考文献

20 条

Charikar & Pabbaraju (2025) Charikar, M. and Pabbaraju, C. Exploring facets of language generation in the limit. In Proceedings of 38th Conference on Learning Theory (COLT) , volume 291 of PMLR , pp. 854–887, 2025. URL https://proceedings.mlr.press/v291/charikar25a.html .
Farquhar et al. (2024) Farquhar, S., Kossen, J., Kuhn, L., and Gal, Y. Detecting hallucinations in large language models using semantic entropy. Nature , 630:625–630, June 2024. doi: 10.1038/s41586-024-07421-0 .
Gold (1967) Gold, E. M. Language identification in the limit. Information and Control , 10(5):447–474, 1967. doi: 10.1016/S0019-9958(67)91165-5 .
Good (1953) Good, I. J. The population frequencies of species and the estimation of population parameters. Biometrika , 40(3-4):237–264, December 1953. doi: 10.1093/biomet/40.3-4.237 .
Huang et al. (2025) Huang, L., Yu, W., Ma, W., Zhong, W., Feng, Z., Wang, H., Chen, Q., Peng, W., Feng, X., Qin, B., and Liu, T. A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions. ACM Trans. Inf. Syst. , 43(2):42:1–42:55, 2025. doi: 10.1145/3703155 .
Kalai & Vempala (2024) Kalai, A. T. and Vempala, S. S. Calibrated language models must hallucinate. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing (STOC) , pp. 160–171. ACM, 2024. doi: 10.1145/3618260.3649777 .
Kalai et al. (2025) Kalai, A. T., Nachum, O., Vempala, S. S., and Zhang, E. Why language models hallucinate, 2025. URL https://arxiv.org/abs/2509.04664 .
Kalavasis et al. (2025) Kalavasis, A., Mehrotra, A., and Velegkas, G. On the limits of language generation: Trade-offs between hallucination and mode-collapse. In Proceedings of the 57th Annual ACM Symposium on Theory of Computing (STOC) , pp. 1732–1743. ACM, 2025. doi: 10.1145/3717823.3718108 .
Kleinberg & Mullainathan (2024) Kleinberg, J. and Mullainathan, S. Language generation in the limit. In Advances in Neural Information Processing Systems (NeurIPS) , volume 37, pp. 66058–66079. Curran Associates, Inc., 2024. doi: 10.52202/079017-2111 .
Kleinberg & Wei (2025) Kleinberg, J. and Wei, F. Density measures for language generation. In Proceedings of the 66th Annual IEEE Symposium on Foundations of Computer Science (FOCS) , pp. 620–658. IEEE, 2025. doi: 10.1109/FOCS63196.2025.00034 .
Kotzias et al. (2015) Kotzias, D., Denil, M., de Freitas, N., and Smyth, P. From group to individual labels using deep features. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD) , pp. 597–606, 2015. doi: 10.1145/2783258.2783380 .
Lewis et al. (2020) Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., Küttler, H., Lewis, M., Yih, W.-t., Rocktäschel, T., Riedel, S., and Kiela, D. Retrieval-augmented generation for knowledge-intensive NLP tasks. In Advances in Neural Information Processing Systems (NeurIPS) , volume 33, pp. 9459–9474. Curran Associates, Inc., 2020.
Lin et al. (2022) Lin, S., Hilton, J., and Evans, O. TruthfulQA: Measuring how models mimic human falsehoods. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (ACL) (Volume 1: Long Papers) , pp. 3214–3252. Association for Computational Linguistics, May 2022. URL https://aclanthology.org/2022.acl-long.229/ .
Miao & Kearns (2026) Miao, M. M. and Kearns, M. Hallucination, monofacts, and miscalibration: An empirical investigation. Proceedings of the National Academy of Sciences (PNAS) , 123(8):e2533582123, 2026. doi: 10.1073/pnas.2533582123 . URL https://www.pnas.org/doi/abs/10.1073/pnas.2533582123 .
Nakano et al. (2022) Nakano, R., Hilton, J., Balaji, S., Wu, J., Ouyang, L., Kim, C., Hesse, C., Jain, S., Kosaraju, V., Saunders, W., Jiang, X., Cobbe, K., Eloundou, T., Krueger, G., Button, K., Knight, M., Chess, B., and Schulman, J. WebGPT: Browser-assisted question-answering with human feedback, June 2022. URL https://arxiv.org/abs/2112.09332v3 .
Raman & Raman (2025) Raman, A. and Raman, V. Generation from noisy examples. In Proceedings of the 42nd International Conference on Machine Learning (ICML) , volume 267 of PMLR , pp. 51079–51093, 2025. URL https://proceedings.mlr.press/v267/raman25a.html .
Raman et al. (2025) Raman, V., Li, J., and Tewari, A. Generation through the lens of learning theory. In Proceedings of 38th Conference on Learning Theory (COLT) , volume 291 of PMLR , pp. 4740–4776, 2025. URL https://proceedings.mlr.press/v291/raman25a.html .
Suzuki et al. (2025) Suzuki, A., He, Y., Tian, F., and Wang, Z. Hallucinations are inevitable but can be made statistically negligible. The “innate” inevitability of hallucinations cannot explain practical LLM issues, May 2025. URL https://arxiv.org/abs/2502.12187v2 .
Wu et al. (2025) Wu, C., Grama, A., and Szpankowski, W. No free lunch: Fundamental limits of learning non-hallucinating generative models. In Proceedings of the International Conference on Learning Representations (ICLR) , 2025. URL https://openreview.net/forum?id=OwNoTs2r8e .
Xu et al. (2025) Xu, Z., Jain, S., and Kankanhalli, M. Hallucination is inevitable: An innate limitation of large language models, 2025. URL https://arxiv.org/abs/2401.11817v2 .