Adversarial Water-Filling: Theory, Algorithms and Foundation Model

摘要

Competitive resource allocation problems over frequency and space can be formulated as minimax interaction between transmit power and worst-case interference. This formulation naturally arises in multi-operator low Earth orbit (LEO) satellite spectrum sharing, where transmissions from competing constellations interfere in real-time. Under Gaussian channels, AWF is strongly convex--concave on nondegenerate active channels, whereas discrete constellations yield generally nonconvex mercury/water-filling formulations. In this paper we propose the Adversarial Water-Filling (AWF) problem with corresponding theory and algorithms for these real situations. In addition, we develop a wireless foundation model for AWF to learn the AWF search dynamics. The architecture incorporates permutation-invariant channel representations, a constraint-aware graph neural network (GNN) with sparse message passing, and global latent variables capturing the low-dimensional water level implied by the AWF optimality. Through learned projected extragradient iterations, the model approximates stationary solutions of the constrained minimax problem arising under mercury/water-filling. We further show that, under local regularity and contractivity conditions, the learned AWF dynamics converge locally linearly around regular stationary points. Experiments demonstrate empirical generalization across unseen problem sizes, different constraints, and multiple discrete constellations, while achieving more than one-order-of-magnitude runtime improvements over iterative baselines. The related code can be found at https://github.com/convexsoft/AWF.

核心问题与主要方法

核心问题

Competitive wireless resource allocation under worst-case interference in multi-operator spectrum sharing

场景：LEO/NTN spectrum sharing with coupled frequency-space power allocation, Gaussian or discrete constellations, and optional linear spatial constraints

主要方法

Formulates transmit allocation and worst-case interference as a constrained max-min AWF game over frequency and space, with simplex budgets for p and n and optional spatial constraints Ap <= p_hat. Uses water-filling/KKT analysis to expose low-dimensional dual water levels: nu for transmit power, mu for interference, and theta-induced channel-wise shifts under spatial constraints. Extends Gaussian water-filling to mercury/water-filling via mutual-information objectives and I-MMSE-based gradients, acknowledging loss of closed-form proximal updates and possible nonconvexity. Interprets AWF through primal-dual and PDHG/extragradient dynamics, motivating a learned projected extragradient solver. Builds the foundation model from permutation-invariant set encoding for unordered channels, sparse constraint-graph message passing for A^T theta effects, distribution tokens for modulation curves, and simplex projections to enforce budgets.

关键贡献与后续阅读

关键贡献

Introduces AWF as a unified minimax framework for competitive wireless resource allocation over frequency and space, explicitly modeling worst-case interference in multi-operator LEO/NTN spectrum sharing. Separates the tractable Gaussian regime from the harder discrete-constellation mercury/water-filling regime, identifying convex-concave structure for Gaussian AWF and nonconvex behavior for mercury/water-filling. Derives KKT/water-level characterizations showing how transmit, interference, and spatial constraint dual variables coordinate high-dimensional channel allocations. Connects AWF to PDHG/proximal/extragradient optimization, using this structure to design learned projected extragradient iterations instead of a fixed-size direct regressor. Proposes a domain-structured wireless foundation model with permutation-invariant channel representations, sparse constraint-aware GNN message passing, modulation distribution tokens, and global latent variables for water-level dynamics. Provides conditional theoretical support: fixed points satisfying projected first-order conditions correspond to KKT points, and learned dynamics converge locally linearly under regularity and contractivity assumptions. Reports empirical generalization across unseen channel sizes, modulation formats including held-out 256QAM, and unseen constraint families, with more than one-order-of-magnitude runtime improvement over Mirror-Prox baselines.

研究启发

Do the reported tables show statistically robust comparisons across enough random AWF instances, especially for held-out 256QAM and dense correlated constraints? How sensitive is the learned extragradient model to the choice of unroll length T=64, stepsize clamping, and the residual-based training objective? Does the code reproduce the claimed 16x-18x speedups on hardware beyond the reported RTX 3050 setup? How realistic are the generated channel, noise, budget, and sparse constraint distributions for actual multi-operator LEO coexistence scenarios?

限制与不确定性

Learned dynamics have only local convergence guarantees under regularity and contractivity assumptions. Nonconvex mercury/water-filling setting lacks global convergence guarantees. Value depends on empirical validity of reported generalization and runtime gains, which is not verified from the full paper here.

参考文献

43 条

[1] H. Al-Hraishawi, H. Chougrani, S. Kisseleff, E. Lagunas, and S. Chatzinotas (2023) A survey on nongeostationary satellite systems: the communication perspective . IEEE Communications Surveys & Tutorials 25 ( 1 ), pp. 101–132 . External Links: Document Cited by: §I .
[2] L. Bonati, M. Polese, S. D’Oro, S. Basagni, and T. Melodia (2023) OpenRAN gym: AI/ML development, data collection, and testing for O-RAN on PAWR platforms . Computer Networks 220 , pp. 109502 . External Links: ISSN 1389-1286 , Document Cited by: § II-B .
[3] S. Boyd and L. Vandenberghe (2004) Convex optimization . Cambridge University Press . Cited by: §III .
[4] A. Chambolle and T. Pock (2011-05-01) A first-order primal-dual algorithm for convex problems with applications to imaging . Journal of Mathematical Imaging and Vision 40 ( 1 ), pp. 120–145 . External Links: ISSN 1573-7683 , Document Cited by: §I , §I , § II-A , § IV-A , § V-D , §VI .
[5] S. Chen, C. W. Tan, X. Zhai, and H. V. Poor (2025) OpenRANet: neuralized spectrum access by joint subcarrier and power allocation with optimization-based deep learning . External Links: 2409.12964 , Link Cited by: §I .
[6] T. M. Cover (1999) Elements of information theory . John Wiley & Sons . Cited by: § II-A .
[7] S. D’Oro, L. Bonati, M. Polese, and T. Melodia (2022) OrchestRAN: network automation through orchestrated intelligence in the open RAN . In Proceedings of the 2022 IEEE Conference on Computer Communications (INFOCOM) , London, United Kingdom , pp. 270––279 . External Links: Document Cited by: §I .
[8] Federal Communications Commission (2024) Space exploration holdings, LLC request for deployment and operating authority for the spaceX Gen2 NGSO satellite system . Note: Order and Authorization, DA 24-1193Accessed: 2026-05-13 External Links: Link Cited by: §I , §I .
[9] R. G. Gallager (1968) Information theory and reliable communication . Vol. 588 , Springer . Cited by: §I .
[10] A. Garnaev, A. Petropulu, W. Trappe, and H. V. Poor (2022) An anti-jamming multiple access channel game using latency as metric . IEEE Wireless Communications Letters 11 ( 9 ), pp. 1800–1804 . External Links: Document Cited by: § II-A .
[11] A. Garnaev, W. Trappe, and H. Vincent Poor (2026) Max-min resource allocation with application to anti-jamming . IEEE Wireless Communications Letters 15 ( ), pp. 1405–1409 . External Links: Document Cited by: § II-A .
[12] A. Ghosh and S. Boyd (2003) Minimax and convex-concave games . Note: EE392o Lecture course notes, Stanford University, Stanford, CA Cited by: § II-A , § III-A , §III .
[13] G. Giambene, S. Kota, and P. Pillai (2018) Satellite-5G integration: a network perspective . IEEE Network 32 ( 5 ), pp. 25–31 . Cited by: §I .
[14] T. Goldstein, M. Li, X. Yuan, E. Esser, and R. Baraniuk (2015) Adaptive primal-dual hybrid gradient methods for saddle-point problems . External Links: 1305.0546 , Link Cited by: §I , §I , § II-A , § IV-A , § V-D .
[15] D. Guo, S. Shamai, and S. Verdú (2005) Mutual information and minimum mean-square error in Gaussian channels . IEEE Transactions on Information Theory 51 ( 4 ), pp. 1261–1282 . Cited by: § III-B , § V-B .
[16] E. Habler, R. Bitton, D. Avraham, D. Mimran, E. Klevansky, O. Brodt, H. Lehmann, Y. Elovici, and A. Shabtai (2023) Adversarial machine learning threat analysis and remediation in open radio access network (O-RAN) . External Links: 2201.06093 , Link Cited by: §I .
[17] A. Jaegle, F. Gimeno, A. Brock, O. Vinyals, A. Zisserman, and J. Carreira (2021-18–24 Jul) Perceiver: general perception with iterative attention . In Proceedings of the 38th International Conference on Machine Learning , M. Meila and T. Zhang (Eds.) , Proceedings of Machine Learning Research , Vol. 139 , pp. 4651–4664 . Cited by: § II-B , § V-A , §VI , footnote 1 .
[18] S. Kassing, D. Bhattacherjee, A. B. Águas, J. E. Saethre, and A. Singla (2020) Exploring the ”Internet from space” with Hypatia . In Proceedings of the ACM Internet Measurement Conference , New York, NY, USA , pp. 214–229 . External Links: ISBN 9781450381383 , Document Cited by: §I .
[19] J. Lee, Y. Lee, J. Kim, A. Kosiorek, S. Choi, and Y. W. Teh (2019-09–15 Jun) Set transformer: a framework for attention-based permutation-invariant neural networks . In Proceedings of the 36th International Conference on Machine Learning , K. Chaudhuri and R. Salakhutdinov (Eds.) , Proceedings of Machine Learning Research , Vol. 97 , pp. 3744–3753 . Cited by: § II-B .
[20] Y. Li, Z. Ye, H. Zhang, J. Wang, J. Ma, and W. Tong (2024) Coded water-filling for multi-user interference cancellation . External Links: 2410.14136 , Link Cited by: §I .
[21] C. Liu, M. Bennis, M. Debbah, and H. V. Poor (2019) Dynamic task offloading and resource allocation for ultra-reliable low-latency edge computing . IEEE Transactions on Communications 67 ( 6 ), pp. 4132–4150 . External Links: Document Cited by: §I .
[22] A. Lozano, A. M. Tulino, and S. Verdú (2006) Optimum power allocation for parallel Gaussian channels with arbitrary input distributions . IEEE Transactions on Information Theory 52 ( 7 ), pp. 3033–3051 . Cited by: §I , § II-A .
[23] Y. S. Nasir and D. Guo (2019-10) Multi-agent deep reinforcement learning for dynamic power allocation in wireless networks . IEEE Journal on Selected Areas in Communications 37 ( 10 ), pp. 2239–2250 . External Links: ISSN 1558-0008 , Document Cited by: § II-B .
[24] A. Nemirovski (2004) Prox-method with rate of convergence O(1/t) for variational inequalities with Lipschitz continuous monotone operators and smooth convex-concave saddle point problems . SIAM Journal on Optimization 15 ( 1 ), pp. 229–251 . External Links: Document , https://doi.org/10.1137/S1052623403425629 Cited by: §VI , § VII-B .
[25] D. P. Palomar and J. R. Fonollosa (2005) Practical algorithms for a family of waterfilling solutions . IEEE Transactions on Signal Processing 53 ( 2 ), pp. 686–695 . Cited by: §I , §I .
[26] N. Parikh and S. Boyd (2014) Proximal algorithms . Foundations and Trends® in Optimization 1 ( 3 ), pp. 127–239 . External Links: Document , ISSN 2167-3888 Cited by: § IV-A .
[27] M. Polese, L. Bonati, S. D’Oro, S. Basagni, and T. Melodia (2023-10) ColO-RAN: developing machine learning-based xApps for open RAN closed-loop control on programmable experimental platforms . IEEE Transactions on Mobile Computing 22 ( 10 ), pp. 5787–5800 . External Links: Document Cited by: §I .
[28] J. Shao, J. Tong, Q. Wu, W. Guo, Z. Li, Z. Lin, and J. Zhang (2024) WirelessLLM: empowering large language models towards wireless intelligence . External Links: 2405.17053 , Link Cited by: §I , § II-B .
[29] Y. Shen, Y. Shi, J. Zhang, and K. B. Letaief (2021) Graph neural networks for scalable radio resource management: architecture design and theoretical analysis . IEEE Journal on Selected Areas in Communications 39 ( 1 ), pp. 101–115 . External Links: Document Cited by: § II-B .
[30] R. Smith, C. Freeberg, T. Machacek, and V. Ramaswamy (2021-Nov.) An O-RAN approach to spectrum sharing between commercial 5G and government satellite systems . In Proceedings of the 2021 IEEE Military Communications Conference (MILCOM) , San Diego, CA, USA , pp. 739–744 . External Links: Document Cited by: §I .