Electrical Engineering and Systems Science

2026-05-27 | | Total: 67

#1 Point Spread Function Optimization for Communication-assisted UAV-borne MIMO TomoSAR [PDF1] [Copy] [Kimi] [REL]

Authors: Pouya Fakharizadeh, Mohamed-Amine Lahmeri, Gerhard Krieger, Robert Schober

This paper tackles the optimization of the point spread function (PSF) of unmanned aerial vehicle (UAV)-borne multiple-input multiple-output (MIMO) synthetic aperture radar (SAR) tomography systems. A swarm of UAV-borne SAR systems is deployed to image an area to obtain its height profile. To achieve a high-quality three-dimensional (3D) image of the scene, the PSF has to exhibit low sidelobes. The heavy computations, required for image generation, are performed on the ground. To this end, the sensor data collected by the UAV-SARs is offloaded in real time via a frequency division multiple access (FDMA) air-to-ground backhaul link. In this work, the UAV formation and the power allocated for offloading are jointly optimized for the minimization of the PSF sidelobe levels. To this end, we propose a novel solution based on the particle swarm optimization (PSO) algorithm, which meets practical sensing and communication constraints. Our simulation results demonstrate that the proposed solution can significantly improve sidelobe suppression compared to several benchmark schemes.

Subject: Signal Processing

Publish: 2026-05-26 17:12:24 UTC


#2 TWIST: Closed-Loop token Synchronization for Application-Aware Wireless Digital Twins [PDF] [Copy] [Kimi1] [REL]

Authors: Sige Liu, Kezhi Wang

Wireless digital twins require repeated synchronization between a time-evolving physical scene and its digital counterpart under limited and time-varying communication resources. For perception-centric twins, pixel-domain transmission or uniformly protected bitstreams can be mismatched to the semantic state consumed by twin-side applications. This paper proposes TWIST, a closed-loop token synchronization framework for application-aware wireless digital twins. TWIST represents each physical observation as a token and synchronizes this state over a wireless link, rather than optimizing visual reconstruction. Token positions are grouped by task relevance and protected through mode-conditioned unequal error protection under low-, medium-, and high-synchronization modes. At the twin side, decoding confidence converts unreliable hard token decisions into erasures, which are restored by a completion model before updating the semantic twin state. The recovered state supports traffic-state inference and generates compact feedback statistics, including channel quality, receiver uncertainty, semantic drift, and application priority, for subsequent mode adaptation. Experiments on a dynamic road-scene digital-twin scenario show that TWIST improves traffic-state inference and semantic twin-state synchronization compared with fixed-mode and channel-only adaptation strategies, while reducing the average synchronization cost relative to always-high transmission.

Subjects: Image and Video Processing , Artificial Intelligence

Publish: 2026-05-26 15:59:47 UTC


#3 Container Unloading via Reinforcement Learning: Picking Order, Deadlock Avoidance, and Proof-of-Concept Simulation [PDF] [Copy] [Kimi] [REL]

Authors: Jan Rüdiger, Max Schenke, Daniel Weber

Unloading containers in the courier, express and parcel industry is a physically demanding and labor-intensive work. Automatizing this process is an important step towards increasing the efficiency of parcel-handling systems. This work investigates the potential of reinforcement learning to learn a policy for item selection in container unloading scenarios. For that, a simulation environment is created and a masked deep Q-learning with a specially designed neural network architecture is implemented. The results indicate that the agent can learn to select items with an average success rate of 60 %, which is significantly better than a random policy at a random chance of 20 %. The findings suggest that RL could be a promising approach for automatizing item unloading tasks in the future.

Subject: Systems and Control

Publish: 2026-05-26 15:08:22 UTC


#4 Unsupervised Deep Image Prior for Sparse-View and Limited-Angle Electron Tomography [PDF] [Copy] [Kimi] [REL]

Authors: Serge Brosset, Daniel del Pozo Bueno, Thomas David, Laure Guetaz, Philippe Ciuciu, Zineb Saghi

Electron tomography (ET) plays an important role in the three-dimensional (3D) characterization of nanomaterials. However, under limited-angle and sparse-view conditions, conventional algorithms produce degraded reconstructions, which compromise the quality and interpretability of resulting 3D data. In this paper, we present deep image prior (DIP), an unsupervised deep learning (DL) approach, for highly degraded tomography acquisitions and demonstrate, using simulated data, that its performance is comparable to that of supervised approaches requiring training datasets, even for tilt ranges as limited as 60° and tilt increments of 10°. We then apply it to experimental data and show that it enables reliable 3D quantification under both sparse-view and limited-angle conditions, highlighting its potential for a wide range of materials and acquisition modalities.

Subjects: Image and Video Processing , Computer Vision and Pattern Recognition , Instrumentation and Detectors

Publish: 2026-05-26 15:07:01 UTC


#5 En-route Charging Coordination for Electric Trucks [PDF] [Copy] [Kimi] [REL]

Authors: Joas Kahlert, Ruiting Wang, Jonas Mårtensson

The electrification of long-haul freight transport introduces several new challenges, such as the limited capacity and congestion at en-route charging infrastructure. To reduce waiting times during peak periods, this paper proposes a framework for coordinated charging scheduling. The approach employs a mixed-integer formulation to optimize charging-related costs across charging, operation, battery degradation, and congestion delay, considering a range of scenarios. The results demonstrate that coordinated scheduling yields substantial cost savings up to 36% compared to uncoordinated scheduling, particularly by reducing battery degradation and delay costs.

Subject: Systems and Control

Publish: 2026-05-26 14:37:50 UTC


#6 Why Can't They Remember? Uncovering Representation and Retrieval Bottlenecks in Multi-Turn Acoustic Memory [PDF] [Copy] [Kimi] [REL]

Authors: Yang Xiao, Siyi Wang, Han Yin, Hong Jia, Vidhyasaharan Sethu, Eun-Jung Holden, Ting Dang

Large audio language models (LALMs) process both speech and environmental acoustic cues, yet struggle to retain non-speech information across multi-turn interactions. The performance gap between semantic (speech) and acoustic (non-speech) understanding remains poorly understood, and the underlying mechanisms of representation and retrieval are still unclear. This work introduces EnvMem, a controlled multi-turn benchmark designed to study this gap and identify the root causes of failures at the representation (i.e., latent embeddings) and retrieval levels (i.e., attention allocation). We further conduct post-hoc interventions to probe representational structure and attention dynamics. Our results reveal representational trajectory drift as the key failure mode, while showing that attention allocation plays a limited role in explaining the observed degradation. Overall, we provide a systematic framework for analyzing and improving non-linguistic memory in long-context LALMs, shedding light on future data and training design for robust acoustic memory modeling.

Subjects: Audio and Speech Processing , Sound

Publish: 2026-05-26 13:56:33 UTC


#7 In-Orbit Intelligence or Ground Offloading? Inference Freshness under Intermittent Satellite Connectivity [PDF] [Copy] [Kimi] [REL]

Authors: Ayse Nur Pehlivanoglu, Aimin Li, Elif Uysal

This paper studies how to balance onboard and ground computation under intermittent LEO connectivity for optimized inference freshness. As connectivity varies in time, the system switches among the actions of onboard computation, cached semantic transmission, raw-data offloading, and waiting. We define Age of Inference (AoInf) as the performance metric, where the age resets only upon successful task-valid updates. We formulate long-run average AoInf minimization as a finite-state average-cost semi-Markov decision process whose state captures the ground AoInf, orbital contact phase, cache occupancy, and cache age. We then transform the SMDP into an equivalent average-cost MDP and compute the solution via normalized relative value iteration (RVI). Numerical results indicate that the resulting hybrid policy reduces average AoInf relative to onboard-only and offload-only baselines, while requiring less computational resources on the satellite than the former, and fewer communication resources than the latter.

Subject: Systems and Control

Publish: 2026-05-26 13:39:23 UTC


#8 Graph-Based Modeling, Control, and Optimization for Multi-Domain and Multi-Timescale Energy Systems [PDF] [Copy] [Kimi] [REL]

Authors: Joseph M. Pisani, Christopher T. Aksland, Philip M. Renkert, Joseph Broniszewski, Vismay Vyas, Andrew G. Alleyne, Donald J. Docimo, Justin P. Koeln, Neera Jain, Herschel C. Pangborn

Modern energy systems in vehicles and built infrastructure are governed by high-dimensional dynamics spanning multiple physical domains (e.g., electrical, thermal, mechanical) and timescales. This tutorial paper presents a graph-based modeling approach created to facilitate the modeling, analysis, control, estimation, optimization, and design of these systems. Matured and validated through more than a decade of research spanning multiple academic institutions and companies, the graph-based approach combines transient energy conservation with an explicit mathematical representation of the network by which energy is stored and transferred within a system. Following a mathematical overview of graph-based models, examples of multi-domain component and system models from the recent literature are presented, including single-phase thermal systems, two-phase thermal systems, and electro-mechanical systems. This is followed by a survey of recent applications for decentralized and hierarchical model predictive control, design optimization, and control co-design. Lastly, the paper describes an open-source toolbox created to facilitate the generation and analysis of graph-based models.

Subject: Systems and Control

Publish: 2026-05-26 13:35:16 UTC


#9 Over-the-Air Successive Interference Cancellation for Efficient 5G NR and Wi-Fi Spectrum Reuse [PDF] [Copy] [Kimi] [REL]

Authors: Mir Lodro, Francesco Raimondo, Geoffrey S. Hilton, Mark A. Beach, Andrew C. M. Austin

An over-the-air (OTA) experimental evaluation of concurrent 5G New Radio (5G NR) and Wi-Fi transmission using successive interference cancellation (SIC) in a shielded-box environment is presented. A USRP is used as the receiver, which captures the composite waveform containing both air-interface signals and applies sample-domain SIC to suppress the dominant 5G-NR signal and recover Wi-Fi signal from the residual waveform. The framework reports error vector magnitude (EVM), bit error rate (BER), sample-domain cancellation depth, and channel-estimate suppression, and, at the representative \(18\) dB attenuation point, measures \(11.88\) dB cancellation depth and \(26.96\) dB 5G channel suppression. The proposed methodology provides a practical basis for assessing cross-technology coexistence and receiver-side interference suppression under controlled OTA conditions.

Subject: Signal Processing

Publish: 2026-05-26 13:24:29 UTC


#10 OTA Characterization of Dual-User IEEE 802.11be EHT-MU Under Transmit-Chain Imbalance [PDF] [Copy] [Kimi] [REL]

Authors: Mir Lodro, Francesco Raimondo, Geoffrey S. Hilton, Mark A. Beach, Andrew C. M. Austin

This paper presents a controlled over-the-air (OTA) characterization of dual-user IEEE 802.11be Extremely High Throughput Multi-User (EHT-MU) transmission under transmit-chain imbalance. The objective is to determine whether attenuation applied to one access-point transmit chain produces packet-global degradation or appears primarily as stream-dependent payload degradation after receiver processing. Measurements are performed in a shielded RF enclosure using two NI USRP-2953R and NI USRP-2942R software-defined radios, with one USRP generating a dual-user non-OFDMA EHT-MU waveform and the other implementing synchronized dual-branch packet recovery. A calibrated attenuation sweep is applied to the second AP transmit chain (TX2), and performance is evaluated using bit error rate (BER), EHT-Data error vector magnitude (EVM), control-field success probability, payload-success probability, and subcarrier-level EVM distributions. The results show that the stream decoded as User~1 remains at the BER floor over the tested range, while the stream decoded as User~2 exhibits progressive EVM degradation followed by threshold-like BER and payload-success collapse. Common signaling fields remain recoverable, indicating that the dominant observed failure mode is stream-local at the receiver output than the packet-global. Replacing User~2 binary convolutional coding (BCC) with low density parity check (LDPC) coding delays the BER and payload-success collapse by approximately \(5\)~dB of TX2 attenuation, demonstrating a measurable coding-dependent robustness margin for the more sensitive stream.

Subject: Signal Processing

Publish: 2026-05-26 13:15:40 UTC


#11 Congestion Forecasting for Electric Vehicle Charging Scheduling with Fluid Queues [PDF] [Copy] [Kimi] [REL]

Authors: Joas Kahlert, Ruiting Wang, Jonas Mårtensson

To support the adoption of electric transport systems, public charging opportunities are becoming increasingly important. In this dynamic environment, a central challenge for route planning and charging scheduling is forecasting charging-station availability under fluctuating demand. In this work, we propose a fluid-based forecasting method that accounts for uncertainty in both known and unforeseen electric vehicle arrival patterns while respecting station capacity constraints. We further evaluate the congestion forecasting method by applying it to an electric vehicle scheduling problem. Compared to scheduling frameworks that rely on standard baselines, charging schedules based on the fluid congestion forecasting model reduce waiting-related downtime by up to 14%. Finally, we quantify how increased knowledge of vehicle arrivals and different levels of station congestion affect overall system performance.

Subject: Systems and Control

Publish: 2026-05-26 12:55:33 UTC


#12 Half-Quadratic Criterion based Adaptive Graph Signal Processing Algorithm [PDF] [Copy] [Kimi] [REL]

Authors: Chong Zhang, Haiquan Zhao, Chengjin Li

In recent years, progress in adaptive graph signal processing algorithms has provided effective solutions for processing signals defined on graph structures. As a classical strategy in information theory, the Generalized Maximum Correntropy Criterion (GMCC) exhibits good resistance to non-Gaussian noises. When non-Gaussian noise interferes with the graph signal, the graph signal processing algorithm based on GMCC (GSP GMCC) algorithm shows better performance. However, the GSP GMCC algorithm itself has three parameters that need to be manually tuned, and the process of manually tuning the parameters is complex and tedious. Meanwhile, the non-concave and non-convex nature of the GMCC function itself limits its own convergence rate and adaptive estimation accuracy. To solve the above problems, based on the strongly convex function half-quadratic criterion (HQC), the GSP HQC algorithm is proposed in this paper. The performance analysis of the GSP HQC algorithm is implemented in this paper. Simulation experiments demonstrate that the GSP HQC algorithm achieves superior performance in terms of convergence rate and adaptive estimation accuracy while maintaining computational complexity comparable to existing algorithms

Subject: Signal Processing

Publish: 2026-05-26 12:41:21 UTC


#13 On the LEO Satellite Constellation Design for North Atlantic Coverage [PDF] [Copy] [Kimi] [REL]

Authors: Alejandro Ramírez-Arroyo, Miguel Villanueva-Fernández, Preben Mogensen

Low Earth Orbit (LEO) satellite constellations are emerging as a key component of non-terrestrial networks due to their low-latency and high-capacity communication capabilities. However, satellites in these orbits are characterized by a small coverage footprint and high orbital velocity compared to those in higher orbits. This results in constantly changing and dynamic constellations that require smart design of orbital parameters to ensure continuous coverage. Existing constellation deployments are typically optimized either for low- and mid-latitude regions or for full polar coverage, leaving high-latitude regional scenarios such as the North Atlantic insufficiently explored. This work provides insights into the key characteristics associated with the deployment of satellites in LEO for North Atlantic coverage. Therefore, we investigate how constellation inclination, minimum elevation angle, altitude, and satellite footprint jointly affect visibility probability, revisit time, path loss, and coverage continuity. Results show that the minimum elevation angle is a critical design parameter since a Walker Delta constellation with 64 satellites at 1000 km altitude can provide continuous coverage above 55°N for elevations below 20°, whereas coverage probability degrades drastically for larger elevation angles. Similarly, inclinations above approximately 70° are required to achieve robust North Atlantic coverage with medium-size constellations. Thus, these results provide practical guidelines on how a satellite constellation should be designed to achieve an efficient deployment with a focus on coverage over the North Atlantic, targeting maritime, aviation, and Arctic connectivity scenarios.

Subject: Signal Processing

Publish: 2026-05-26 12:32:46 UTC


#14 NF-TrackLLM: Joint Prediction of UAV Trajectory and Near-Field Beam for LAE XL-MIMO Systems [PDF] [Copy] [Kimi] [REL]

Authors: Qianfan Lu, Mengyuan Li, Jiachen Tian, Yu Han, Xiao Li, Shi Jin

User localization and beam management are tightly linked in extremely large-scale multiple-input multiple-output (XL-MIMO) systems, especially in dense low-altitude economy (LAE) scenarios. However, the near-field propagation in XL-MIMO introduces strong distance sensitivity and complex spatial coupling, which makes joint trajectory and beam prediction challenging. Meanwhile, large language models (LLMs) have attracted attention in physical-layer transmission for modeling long-range dependencies. In this paper, we propose NF-TrackLLM, a multi-modal semantic-aware framework for near-field unmanned aerial vehicles (UAVs) positioning and beam prediction in XL-MIMO systems. By incorporating visual and LiDAR sensing into a Sionna-based channel generation pipeline, environmental semantics and GPS are utilized to guide trajectory and beam prediction. Built upon the aligned multi-modal representation, a GPT-2-based spatiotemporal reasoning backbone, and a cascaded prediction strategy are employed, where future trajectories are first inferred and then used to guide beam prediction as geometric priors. Simulation results demonstrate that NF-TrackLLM achieves accurate beam prediction and reliable UAV trajectory tracking in dense urban low-altitude scenarios.

Subject: Signal Processing

Publish: 2026-05-26 12:23:52 UTC


#15 Gaussian Process-Based Extended Object Estimation for 6G ISAC at Millimeter-Wave Frequencies [PDF] [Copy] [Kimi] [REL]

Authors: M. Ertug Pihtili, Ossi Kaltiokallio, Julia Equi, Jukka Talvitie, Elena Simona Lohan, Ertugrul Basar, Mikko Valkama

This paper introduces a Gaussian process (GP)-based method for extended object estimation (EOE) in integrated sensing and communication (ISAC) scenarios, representing a promising approach to enhance environmental awareness beyond the conventional point-scatterer assumption. The suitability of the proposed GP-based method for EOE is investigated through a practical measurement setup compliant with the fifth-generation (5G) New Radio (NR) standard and employing bistatic sensing, with results evaluated for both mapping and simultaneous localization and mapping (SLAM ) cases at millimeter-wave (mmWave) frequencies. The findings reveal that the enhanced capabilities of communication networks, when combined with bistatic sensing and GP-based EOE, enable improved environmental awareness in future wireless systems. Importantly, the results demonstrate that, under practical conditions, GP effectively performs EOE in both mmWave mapping and SLAM scenarios.

Subject: Signal Processing

Publish: 2026-05-26 12:12:59 UTC


#16 Load Management of Distribution Systems via Online Dynamic Pricing [PDF] [Copy] [Kimi] [REL]

Authors: Jiarui Yu, Zhiyu He, Wenbin Wang, Colin N. Jones, Florian Dörfler, Hanmin Cai

The growing adoption of electric vehicles (EVs) is increasing peak demand in distribution systems, which can threaten grid stability and reduce operational efficiency. Dynamic electricity pricing is a promising means of mitigating these peaks by shifting flexible demand. However, most existing approaches rely on detailed user-level consumption data and behavioral models, which are often difficult to obtain in practice and may raise privacy concerns. This paper proposes an Online Feedback Optimization (OFO) algorithm for day-ahead price design with limited data, where only aggregate loads are observed. OFO updates prices iteratively using aggregate load measurements, enabling effective peak reduction without access to individual user data. The formulation also includes a term that penalizes deviations in total electricity cost relative to a reference tariff. Although relying only on aggregate load measurements, the OFO price updates efficiently converge to the optimal price. In finite-horizon simulations, OFO achieves peak reduction close to that of the Stackelberg benchmark with full model information. Meanwhile, its computational effort is substantially lower. Additional tests under multiple initial conditions and delayed charging-window mismatch further confirm the robustness of the proposed method. Overall, these results show that OFO is a scalable and computationally efficient approach for peak-demand management in distribution systems with limited observability.

Subject: Systems and Control

Publish: 2026-05-26 12:01:42 UTC


#17 GScomp-QA: A Subjective Dataset for Quality Assessment of Compressed Gaussian Splatting [PDF] [Copy] [Kimi] [REL]

Authors: Pedro Martin, António Rodrigues, João Ascenso, Maria Paula Queluz

Gaussian Splatting (GS) has emerged as an efficient representation for high-quality 3D reconstruction and novel view synthesis. However, its large model size poses challenges for storage and transmission. While several GS compression solutions have been proposed, their perceptual impact remains poorly understood due to the lack of dedicated evaluation datasets. To address this gap, this paper introduces GScomp-QA, a subjective quality assessment dataset for evaluating synthesis quality from compressed GS models. The dataset comprises 331 video stimuli from 13 real-world scenes, covering 9 state-of-the-art GS compression solutions. By using videos synthesized from uncompressed models as reference, GScomp-QA isolates compression-induced distortions from synthesis artifacts. A subjective study with 20 participants was conducted, providing reliable perceptual scores. Based on these data, GS compression solutions are evaluated through perceptual rate-distortion analysis. In addition, 18 objective quality metrics are evaluated, showing that they do not fully capture GS-specific distortions. GScomp-QA will be publicly available and provide a benchmark for evaluating GS compression solutions and supporting the development of quality metrics tailored to GS compression.

Subjects: Image and Video Processing , Multimedia

Publish: 2026-05-26 11:41:10 UTC


#18 G-iMUSIC: Greedy Iterative MUSIC Algorithms for Multi-Target DoA Estimation [PDF] [Copy] [Kimi] [REL]

Authors: Martin Willame, Gilles Monnoyer, François Horlin, Jérôme Louveaux

This paper presents novel algorithms for multi-target direction-of-arrival (DoA) estimation in array signal processing. Although the maximum likelihood estimator (MLE) asymptotically attains the Cramér-Rao bound, its exponential complexity motivates practical alternatives, such as greedy or subspace-based methods. In this context, greedy methods such as orthogonal matching pursuit (OMP) and orthogonal least squares (OLS) are sensitive to early selection errors, especially for angularly proximate targets, whereas subspace-based methods such as multiple signal classification (MUSIC) present angular super-resolution capabilities but degrade under strong inter-target signal correlation. To overcome these limitations, we propose two greedy iterative MUSIC (G-iMUSIC) algorithms, namely OMP-iMUSIC and OLS-iMUSIC, derived from a unified framework that links subspace and greedy estimations. Unlike prior iMUSIC approaches, the proposed methods require only one initial eigen value decomposition (EVD) and avoid computing eigendecomposition at each iteration. They also admit Fast Fourier Transform (FFT)-accelerated implementations for uniform linear arrays (ULAs), enabling low-complexity operation. Monte Carlo simulations demonstrate improved detection and precision over conventional OMP, OLS, and MUSIC, as well as reduced processing time compared to greedy baselines. Finally, we introduce diagnostic metrics that interpret performance across signal correlation and angular proximity regimes, supporting generalization beyond the specific orthogonal frequency-division multiplexing (OFDM) radar scenario considered.

Subject: Signal Processing

Publish: 2026-05-26 11:32:49 UTC


#19 Critical Infrastructure Defense Against Aerial Swarms Under Sensing Uncertainty: Online Allocation With Finite-Time Guarantees [PDF] [Copy] [Kimi] [REL]

Authors: Shriya Pandey, Devaprakash Muniraj

This article presents a closed-loop, uncertainty-aware framework for defending a protected zone against coordinated incursions by swarms of small uncrewed aircraft systems (UAS). The interaction structure of the attackers is modeled as time-varying, while defenders operate under imperfect sensing. The proposed criticality-driven defender-to-attacker assignment strategy integrates three components: a probabilistic graph-based representation of the attacking swarm inferred from uncertain observations; a risk-aware attacker criticality model combining time-to-breach urgency with uncertainty; an online defender allocation mechanism that assigns and selectively reassigns defenders while limiting switching-induced instability through robust execution constraints. Analytical guarantees are established within a filtration-based first-hitting-time framework. In particular, finite-time triggering of the first capture event following detection is proven, and explicit mixed linear-geometric upper bounds are derived for the expected neutralization time. Monte Carlo simulations demonstrate the effectiveness of the proposed framework, achieving 85.6% neutralization efficiency under probabilistic sensing and 99.9% under deterministic sensing. Systematic ablation and sensitivity studies further quantify how detection thresholds and coordination parameters influence reliability and time-to-first-capture.

Subject: Systems and Control

Publish: 2026-05-26 10:54:20 UTC


#20 Same Signal, Different Story: Demystifying Receiver Effects in Wi-Fi Channel State Information [PDF] [Copy] [Kimi] [REL]

Authors: Fabian Portner, Francesco Gringoli, Matthias Hollick, Arash Asadi

Wi-Fi sensing has emerged as a versatile tool for tasks such as localization, gesture recognition, and vital-sign monitoring, enabling applications from smart environments to personalized healthcare. However, sensing accuracy often significantly degrades when pretrained models are deployed across different commodity receivers. We present the first systematic comparison of Channel State Information (CSI) across diverse Commercial Off-The-Shelf Wi-Fi sensing platforms. Using a unified experimental setup delivering precisely precoded signals simultaneously to multiple receivers, we isolate receiver-specific variability. We find that dominant cross-device differences arise from Automatic Gain Control and consistent subcarrier nonlinearities. We propose a simple gain-alignment preprocessing step, recovering most of the lost accuracy (up to 75%) in cross-device Human Activity Recognition model deployments. Without preprocessing, model accuracy sharply drops-effectively breaking practical deployments. Additional analyses reveal measurable inherent differences in receiver faithfulness, sensitivity and noise. While these receiver-induced differences do not significantly affect robust sensing tasks such as Human Activity Recognition, they become relevant in scenarios demanding high precision (e.g., single-shot time of flight). Our findings demonstrate that cross-device variability in CSI is real but manageable, and we provide tools and guidelines for robust, hardware-agnostic Wi-Fi sensing.

Subject: Signal Processing

Publish: 2026-05-26 10:53:09 UTC


#21 CFMDCTCodec: A Low-Bitrate Neural Speech Codec with Noise-Prior-aware Conditional Flow Matching for MDCT-Spectral Enhancement [PDF] [Copy] [Kimi] [REL]

Authors: Xiao-Hang Jiang, Yang Ai, Hui-Peng Du, Zhen-Hua Ling, Ji Wu

High-quality speech coding at low bitrates is crucial for bandwidth-constrained applications, yet remains challenging due to the severe loss of quality-critical information in highly compressed representations. To overcome this challenge, we propose CFMDCTCodec, a low-bitrate neural speech codec that operates entirely in the modified discrete cosine transform (MDCT) domain. CFMDCTCodec integrates a lightweight encoder-quantizer-decoder-style MDCT-spectral codec with a noise-prior-aware, conditional-flow-matching (CFM)-based MDCT-spectral enhancer. Within this framework, the codec serves as a base module that compactly discretizes the MDCT spectrum extracted from speech and produces an initial coarse reconstruction, while the enhancer further restores fine-grained spectral details. The enhancer improves the decoded MDCT spectrum by integrating a conditional MDCT velocity-field filter with an ordinary differential equation (ODE) solver, under the guidance of an MDCT-derived magnitude-adaptive noise prior, aiming to emphasize perceptually significant high-energy regions while stabilizing low-energy and silent regions. Finally, the enhanced MDCT spectrum is reconstructed into the decoded speech using the inverse MDCT. When optimizing CFMDCTCodec, we adopt a unified non-adversarial training strategy that jointly combines reconstruction, quantization and CFM objectives. Both objective and subjective evaluations show that CFMDCTCodec outperforms competitive baselines in low-bitrate regimes, e.g., 0.65 kbps, while approaching the perceptual quality of large-scale codecs with significantly fewer parameters and computations.

Subject: Audio and Speech Processing

Publish: 2026-05-26 10:30:03 UTC


#22 Incentive-Based Load Curtailment with Limited Information: A Bilevel Zeroth-Order Learning Approach [PDF] [Copy] [Kimi] [REL]

Authors: Zhisen Jiang, Florian Dörfler, Saverio Bolognani

Incentive-based load curtailment unlocks critical demand-side flexibility but is hindered by the limited knowledge of private user parameters and the inherent nonsmoothness of responses due to physical device constraints. We address this via a constrained bilevel optimization framework and propose the Bi-ZOL (Bilevel Zeroth-Order Learning) algorithm. Unlike conventional black-box methods, Bi-ZOL exploits the bilevel structure to decompose the hypergradient, integrating the exact analytical information of the SO's objective with a zeroth-order estimate of the unknown response sensitivity. This structural decomposition-based learning method mathematically smoothes the nonsmooth response landscape and reduces hypergradient estimation error. We provide theoretical convergence guarantees to an approximate stationary point and demonstrate through simulations that Bi-ZOL achieves near-optimal performance.

Subject: Systems and Control

Publish: 2026-05-26 10:10:18 UTC


#23 Enforcing Soft Monotonicity Constraints for Recursive Gaussian Process Regression in Real Time [PDF] [Copy] [Kimi] [REL]

Authors: Ricus Husmann, Sven Weishaupt, Harald Aschemann

In this work, we introduce a real-time capable algorithm for considering monotonicity assumptions for recursive Gaussian Process regression (RGP). Therefore, we present how to efficiently calculate the RGP gradients online. Then, we utilize an extended Kalman filter and pseudo-measurements in combination with a ReLU pseudo-measurement function to enforce soft inequality constraints. This work builds upon a previously published conference paper with the same goal and a similar fundamental approach. Opposite to our previous work, however, we now use an exact covariance calculation for the RGP gradients. Furthermore, we also present a real-time optimized version of this algorithm with less simplifications compared to the previously published version. These and several other algorithmic innovations lead to an algorithm with greatly improved numerical robustness. The algorithm is validated and compared to its previously published version for a 2D numerical example. The paper is concluded with a successful experimental validation of the developed algorithm for the monotonicity-preserving learning of pneumatic valve characteristics for the control of a pneumatic system, leveraging a partial input - output linearization.

Subject: Systems and Control

Publish: 2026-05-26 09:59:04 UTC


#24 Multimodal Signal Restoration with Signed Twofold Graph Learning [PDF] [Copy] [Kimi] [REL]

Authors: Haruki Yokota, Hiroshi Higashi, Yuichi Tanaka

Multimodal signals on sensor networks are commonly modeled under the twofold graph assumption (TGA), which represents spatial structure and inter-modality relations as two separate graphs. Existing TGA-based signal restoration methods, however, either assume the graphs are known or restrict edge weights to be non-negative, preventing them from capturing negative inter-modal correlations. We address both limitations by formulating joint signal restoration and twofold graph learning as MAP estimation under a matrix normal prior, where the spatial and modality graph Laplacians appear directly as precision matrices. The resulting non-convex objective is solved by alternating minimization: The signal is updated via conjugate gradient applied to the arising Sylvester-type linear system; the graphs are updated via primal-dual hybrid gradient (PDHG). We further propose a method to estimate the signed structure of the modality graph from the dominant eigenspace of a complementary kernel matrix, which is then used in PDHG to update edge magnitudes. These iterative solvers are then unrolled into a feedforward network, with regularization weights and step sizes as layer-wise trainable parameters. Experiments on synthetic multimodal graph signals and a real Japan meteorological dataset confirm that the proposed method outperforms existing baselines across a range of noise levels and missing-data patterns.

Subject: Signal Processing

Publish: 2026-05-26 09:31:05 UTC


#25 Reconstructing 3D Neural Hemodynamics using Sparse Ultrasound Localization Microscopy Data [PDF] [Copy] [Kimi] [REL]

Authors: Jipeng Yan, Oscar Bates, Jingwen Zhu, Qingyuan Tan, Biao Huang, John Goodwin, Andriy S. Kozlov, Chris Dunsby, Meng-Xing Tang

Ultrasound Localization Microscopy (ULM) has presented great potential in functional imaging, benefiting from its ability to reconstruct deep microvasculature. However, the hemodynamic reconstruction is compromised by sparsity in the ULM data, as a limited number of MB tracks cannot sample the complete speed profile in one vessel. Here, we propose to reconstruct hemodynamics using sparse ULM velocity maps by solving a laminar flow model through stochastic variational inference. In addition to vascular geometry and flow velocity maps, the proposed method generates two new ULM maps - a pressure gradient map and a map describing uncertainty of the estimation. By investigating the effect of sparsity in ULM maps on the quantification and visualization of hemodynamics, we demonstrate the effectiveness of the proposed method in dealing with sparse ULM maps via simulations and 3D rat brain imaging. Accurately reconstructing a broad range of hemodynamic parameters and associate uncertanties using sparse ULM data may help detect subtle and dynamic brain activity.

Subject: Image and Video Processing

Publish: 2026-05-26 09:26:01 UTC