Electrical Engineering and Systems Science

2025-12-05 | | Total: 72

#1 The Evolving Landscape of Interactive Surface Sensing Technologies [PDF] [Copy] [Kimi] [REL]

Authors: David Wang, Wilson Chen, Tianju Wang, Jiale Zhang

Interactive surfaces have evolved from capacitive touch and IR based systems into a diverse ecosystem of sensing technologies that support rich and expressive human computer interaction. This survey traces that progression, beginning with infrared vision based approaches, such as FTIR and diffuse illumination, and the rise of capacitive touch as the dominant technology in modern devices, to focusing on contemporary modalities including vision and acoustic sensing. New technologies under development are also discussed, including mmWave radar, and vibration based techniques. Each sensing technique is examined in terms of its operating principles, resolution, scalability, and applications, along with discussions of multimodal integration. By comparing tradeoffs between sensing modalities, the survey highlights the technical and design factors that shape interactive surface performance and user experience. The review concludes by identifying persistent challenges, including sensing accuracy, power constraints, and privacy concerns, and outlines how emerging sensing modalities can enable future interactive environments to be ubiquitous and intelligent.

Subject: Systems and Control

Publish: 2025-12-04 18:32:10 UTC


#2 A Randomized Scheduling Framework for Privacy-Preserving Multi-robot Rendezvous given Prior Information [PDF] [Copy] [Kimi] [REL]

Authors: Le Liu, Yu Kawano, Ming Cao

Privacy has become a critical concern in modern multi-robot systems, driven by both ethical considerations and operational constraints. As a result, growing attention has been directed toward privacy-preserving coordination in dynamical multi-robot systems. This work introduces a randomized scheduling mechanism for privacy-preserving robot rendezvous. The proposed approach achieves improved privacy even at lower communication rates, where privacy is quantified via pointwise maximal leakage. We show that lower transmission rates provide stronger privacy guarantees and prove that rendezvous is still achieved under the randomized scheduling mechanism. Numerical simulations are provided to demonstrate the effectiveness of the method.

Subject: Systems and Control

Publish: 2025-12-04 18:07:17 UTC


#3 Efficient Decoders for Sensing Subspace Code [PDF1] [Copy] [Kimi] [REL]

Authors: Siva Aditya Gooty, Hessam Mahdavifar

Sparse antenna array sensing of source/target via direction of arrival (DoA) estimation motivates design of the sensing framework in joint communication and sensing (JCAS) systems for sixth generation (6G) communication systems. Recently, it is established by Mahdavifar, Rajamäki, and Pal that array geometry of sparse arrays has fundamental connections with the design of subspace codes in coding theory. This was then utilized to design efficient \textit{sensing subspace codes} that estimate the DoA with good resolution. Specifically, the Bose-Chowla sensing subspace code provides near optimal code design for unique DoA estimation with tight theoretical upper bound on the error performance. However, the currently known decoder for these codes, to estimate the DoA, is a traditional \textit{Maximum-a-Posterior (MAP) decoder} with complexity that is cubic with the number of antennas. In this work, we propose novel efficient decoding algorithms for sensing subspace codes, that reduce the complexity down to quadratic while providing new knobs to tune in order to tradeoff complexity with error performance. The decoders are further evaluated for their performance via Monte Carlo simulations for a range of SNRs demonstrating promising performance that smoothly approaches the MAP performance as the complexity grows from quadratic to cubic in the number of antennas.

Subject: Signal Processing

Publish: 2025-12-04 17:46:17 UTC


#4 Generalized Pinching-Antenna Systems: A Leaky-Coaxial-Cable Perspective [PDF] [Copy] [Kimi] [REL]

Authors: Kaidi Wang, Zhiguo Ding, Lajos Hanzo

The evolution toward the sixth-generation (6G) wireless networks has flexible reconfigurable antenna architectures capable of adapting their radiation characteristics to the surrounding environment. At the center-stage, while waveguide based pinching antennas have been shown to beneficially ameliorate wireless propagation environments, their applications have remained confined to high-frequency scenarios. As a remedy, we propose a downlink generalized pinching-antenna system that adapts this compelling concept to low-frequency operation through a leaky-coaxial-cable (LCX) implementation. By endowing LCX structures with controllable radiation slots, the system inherits the key capabilities of waveguide based pinching antennas. Explicitly, these include reconfigurable line-of-sight (LoS) links, reduced path loss, and flexible deployment, while supporting a practical implementation of the pinching-antenna concept at low frequencies. A twin-stage propagation model is developed for characterizing both the guided transmission and wireless radiation encountered over LoS and non-line-of-sight (NLoS) paths. Analytical results reveal strong local gain, complemented by rapid distance-dependent decay. Hence, we conceive a matching joint optimization framework, which maximizes throughput by harnessing game-theoretic association and convex power allocation. Simulation results demonstrate substantial performance gains over conventional fixed-antenna benchmarks.

Subject: Signal Processing

Publish: 2025-12-04 16:50:57 UTC


#5 HiPPO: Exploring A Novel Hierarchical Pronunciation Assessment Approach for Spoken Languages [PDF1] [Copy] [Kimi] [REL]

Authors: Bi-Cheng Yan, Hsin-Wei Wang, Fu-An Chao, Tien-Hong Lo, Yung-Chang Hsu, Berlin Chen

Automatic pronunciation assessment (APA) seeks to quantify a second language (L2) learner's pronunciation proficiency in a target language by offering timely and fine-grained diagnostic feedback. Most existing efforts on APA have predominantly concentrated on highly constrained reading-aloud tasks (where learners are prompted to read a reference text aloud); however, assessing pronunciation quality in unscripted speech (or free-speaking scenarios) remains relatively underexplored. In light of this, we first propose HiPPO, a hierarchical pronunciation assessment model tailored for spoken languages, which evaluates an L2 learner's oral proficiency at multiple linguistic levels based solely on the speech uttered by the learner. To improve the overall accuracy of assessment, a contrastive ordinal regularizer and a curriculum learning strategy are introduced for model training. The former aims to generate score-discriminative features by exploiting the ordinal nature of regression targets, while the latter gradually ramps up the training complexity to facilitate the assessment task that takes unscripted speech as input. Experiments conducted on the Speechocean762 benchmark dataset validates the feasibility and superiority of our method in relation to several cutting-edge baselines.

Subject: Audio and Speech Processing

Publish: 2025-12-04 16:32:32 UTC


#6 TripleC Learning and Lightweight Speech Enhancement for Multi-Condition Target Speech Extraction [PDF1] [Copy] [Kimi] [REL]

Author: Ziling Huang

In our recent work, we proposed Lightweight Speech Enhancement Guided Target Speech Extraction (LGTSE) and demonstrated its effectiveness in multi-speaker-plus-noise scenarios. However, real-world applications often involve more diverse and complex conditions, such as one-speaker-plus-noise or two-speaker-without-noise. To address this challenge, we extend LGTSE with a Cross-Condition Consistency learning strategy, termed TripleC Learning. This strategy is first validated under multi-speaker-plus-noise condition and then evaluated for its generalization across diverse scenarios. Moreover, building upon the lightweight front-end denoiser in LGTSE, which can flexibly process both noisy and clean mixtures and shows strong generalization to unseen conditions, we integrate TripleC learning with a proposed parallel universal training scheme that organizes batches containing multiple scenarios for the same target speaker. By enforcing consistent extraction across different conditions, easier cases can assist harder ones, thereby fully exploiting diverse training data and fostering a robust universal model. Experimental results on the Libri2Mix three-condition tasks demonstrate that the proposed LGTSE with TripleC learning achieves superior performance over condition-specific models, highlighting its strong potential for universal deployment in real-world speech applications.

Subject: Audio and Speech Processing

Publish: 2025-12-04 16:10:53 UTC


#7 Markov-Renewal Single-Photon LiDAR Simulator [PDF] [Copy] [Kimi] [REL]

Authors: Weijian Zhang, Prateek Chennuri, Hashan K. Weerasooriya, Bole Ma, Stanley H. Chan

Single-photon LiDAR (SP-LiDAR) simulators face a dilemma: fast but inaccurate Poisson models or accurate but prohibitively slow sequential models. This paper breaks that compromise. We present a simulator that achieves both fidelity and speed by focusing on the critical, yet overlooked, component of simulation: the photon count statistics. Our key contribution is a Markov-renewal process (MRP) formulation that, for the first time, analytically predicts the mean and variance of registered photon counts under dead time. To make this MRP model computationally tractable, we introduce a spectral truncation rule that efficiently computes the complex covariance statistics. By proving the shift-invariance of the process, we extend this per-pixel model to full histogram cube generation via a precomputed lookup table. Our method generates 3D cubes indistinguishable from the sequential gold-standard, yet is orders of magnitude faster. This finally enables large-scale, physically-faithful data generation for learning-based SP-LiDAR reconstruction.

Subject: Signal Processing

Publish: 2025-12-04 15:55:59 UTC


#8 Distributed Riemannian Optimization in Geodesically Non-convex Environments [PDF] [Copy] [Kimi] [REL]

Authors: Xiuheng Wang, Ricardo Borsoi, Cédric Richard, Ali H. Sayed

This paper studies the problem of distributed Riemannian optimization over a network of agents whose cost functions are geodesically smooth but possibly geodesically non-convex. Extending a well-known distributed optimization strategy called diffusion adaptation to Riemannian manifolds, we show that the resulting algorithm, the Riemannian diffusion adaptation, provably exhibits several desirable behaviors when minimizing a sum of geodesically smooth non-convex functions over manifolds of bounded curvature. More specifically, we establish that the algorithm can approximately achieve network agreement in the sense that Fréchet variance of the iterates among the agents is small. Moreover, the algorithm is guaranteed to converge to a first-order stationary point for general geodesically non-convex cost functions. When the global cost function additionally satisfies the Riemannian Polyak-Lojasiewicz (PL) condition, we also show that it converges linearly under a constant step size up to a steady-state error. Finally, we apply this algorithm to a decentralized robust principal component analysis (PCA) problem formulated on the Grassmann manifold and illustrate its convergence and performance through numerical simulations.

Subject: Signal Processing

Publish: 2025-12-04 15:43:55 UTC


#9 Analytical and Cross-Sectional Clinical Validity of a Smartphone-Based U-Turn Test in Multiple Sclerosis [PDF] [Copy] [Kimi] [REL]

Authors: Marta Płonka, Rafał Klimas, Dimitar Stanev, Lorenza Angelini, Natan Napiórkowski, Gabriela González Chan, Lisa Bunn, Paul S Glazier, Richard Hosking, Jenny Freeman, Jeremy Hobart, Mattia Zanon, Jonathan Marsden, Licinio Craveiro, Mike D Rinderknecht

The observational GaitLab study (ISRCTN15993728) enrolled adult people with multiple sclerosis (PwMS) with Expanded Disability Status Scale (EDSS) <=6.5. PwMS performed the U-Turn Test (UTT), a smartphone-based assessment of dynamic balance, in a gait laboratory (supervised setting) using 6 smartphones at different body locations and daily during a 2-week remote period (unsupervised setting) using 1 smartphone. In the supervised setting, the accuracy of detecting turns with smartphones was compared against turns detected with a motion capture system (mocap) using F1 scores. Agreement between turn speed measured with smartphones and mocap was assessed by intraclass correlation coefficient (ICC[3,1]) and bias. In the unsupervised setting, test-retest reliability was assessed by ICC(2,1), and correlations with clinical and patient-reported measures by Spearman rank correlation. Ninety-six PwMS were included. In the supervised setting, turns were detected with high accuracy (F1 scores >95% across smartphone wear locations). Smartphone-derived turn speed was comparable across the supervised (1.44 rad/s) and unsupervised settings (1.47 rad/s), and with mocap-derived turn speed (1.47 rad/s). ICC(3,1) revealed high agreement between smartphone- and mocap-derived turn speed (ICC[3,1]: 0.87-0.92 across smartphone wear locations). Bias was minimal (-0.04 to 0.11 rad/s). In the unsupervised setting, test-retest reliability (ICC[2,1]) was >0.90 when aggregating >=2 tests. The UTT correlated with Timed 25-Foot Walk gait speed, EDSS, Ambulation score, 12-item Multiple Sclerosis Walking Scale, and Activities-specific Balance Confidence scale (r=-0.79 to -0.61). The UTT measures turn speed accurately and reproducibly irrespective of smartphone wear location and settings. These findings affirm its potential as a valuable tool in multiple sclerosis trials.

Subject: Signal Processing

Publish: 2025-12-04 15:43:13 UTC


#10 Channel-Aware Multi-Domain Feature Extraction for Automatic Modulation Recognition in MIMO Systems [PDF] [Copy] [Kimi] [REL]

Authors: Yunpeng Qu, Yazhou Sun, Bingyu Hui, Jintao Wang, Jian Wang

Automatic modulation recognition (AMR) is a key technology in non-cooperative communication systems, aiming to identify the modulation scheme from signals without prior information. Deep learning (DL)-based methods have gained wide attention due to their excellent performance, but research mainly focuses on single-input single-output (SISO) systems, with limited exploration for multiple-input multiple-output (MIMO) systems. The confounding effects of multi-antenna channels can interfere with the statistical properties of MIMO signals, making identification particularly challenging. To overcome these limitations, we propose a Channel-Aware Multi-Domain feature extraction (CAMD) framework for AMR in MIMO systems. Our CAMD framework reconstructs the transmitted signal through an efficient channel compensation module and achieves a more robust representation capability against channel interference by extracting and integrating multi-domain features, including intra-antenna temporal correlations and inter-antenna channel correlations. We have verified our method on the widely-used dataset, MIMOSig-Ref, with complex mobile channel environments. Extensive experiments confirm the performance advantages of CAMD over previous state-of-the-art methods.

Subject: Signal Processing

Publish: 2025-12-04 15:28:11 UTC


#11 Small-Signal Stability Oriented Real-Time Operation of Power Systems with a High Penetration of Inverter-Based Resources [PDF] [Copy] [Kimi] [REL]

Authors: Francesca Rossi, Juan Carlos Olives-Camps, Eduardo Prieto-Araujo, Oriol Gomis-Bellmunt

This study proposes a control strategy to ensure the safe operation of modern power systems with high penetration of inverter-based resources (IBRs) within an optimal operation framework. The objective is to obtain operating points that satisfy the optimality conditions of a predefined problem while guaranteeing small-signal stability. The methodology consists of two stages. First, an offline analysis of a set of operating points is performed to derive a data-driven regression-based expression that captures a damping-based stability index as a function of the operating conditions. Second, an Online Feedback Optimization (OFO) controller is employed to drive the system toward an optimal operating point while maintaining a secure distance from the instability region. The proposed strategy is evaluated on an academic test case based on a modified version of the IEEE 9-bus system, in which synchronous generators are replaced by IBRs operating under both grid-following and grid-forming control modes. The results demonstrate the effectiveness of the method and are discussed in detail.

Subject: Systems and Control

Publish: 2025-12-04 15:17:37 UTC


#12 Stability-Guaranteed Dual Kalman Filtering for Electrochemical Battery State Estimation [PDF] [Copy] [Kimi] [REL]

Authors: Feng Guo, Guangdi Hu, Keyi Liao, Luis D. Couto, Khiem Trad, Ru Hong, Hamid Hamed, Mohammadhosein Safari

Accurate and stable state estimation is critical for battery management. Although dual Kalman filtering can jointly estimate states and parameters, the strong coupling between filters may cause divergence under large initialization errors or model mismatch. This paper proposes a Stability Guaranteed Dual Kalman Filtering (SG-DKF) method. A Lyapunov-based analysis yields a sufficient stability condition, leading to an adaptive dead-zone rule that suspends parameter updates when the innovation exceeds a stability bound. Applied to an electrochemical battery model, SG-DKF achieves accuracy comparable to a dual EKF and reduces state of charge RMSE by over 45% under large initial state errors.

Subject: Systems and Control

Publish: 2025-12-04 15:11:54 UTC


#13 Beampattern Synthesis for Discrete Phase RIS in Communication and Sensing Systems [PDF] [Copy] [Kimi] [REL]

Authors: Xiao Cai, Hei Victor Cheng, Daniel E. Lucani

Extensive research on Reconfigurable Intelligent Surfaces (RIS) has primarily focused on optimizing reflective coefficients for passive beamforming in specific target directions. This optimization typically assumes prior knowledge of the target direction, which is unavailable before the target is detected. To enhance direction estimation, it is critical to develop array pattern synthesis techniques that yield a wider beam by maximizing the received power over the entire target area. Although this challenge has been addressed with active antennas, RIS systems pose a unique challenge due to their inherent phase constraints, which can be continuous or discrete. This work addresses this challenge through a novel array pattern synthesis method tailored for discrete phase constraints in RIS. We introduce a penalty method that pushes these constraints to the boundary of the convex hull. Then, the Minorization-Maximization (MM) method is utilized to reformulate the problem into a convex one. Our numerical results show that our algorithm can generate a wide beam pattern comparable to that achievable with per-power constraints, with both the amplitudes and phases being adjustable. We compare our method with a traditional beam sweeping technique, showing a) several orders of magnitude reduction of the MSE of Angle of Arrival (AOA) at low to medium Signal-to-Noise Ratio (SNR)s; and b) $8$~dB SNR reduction to achieve a high probability of detection.

Subjects: Signal Processing , Information Theory

Publish: 2025-12-04 15:11:20 UTC


#14 Cute but Cunning: Effective Closed-Form Alternatives to the Exact Lognormal Statistics [PDF1] [Copy] [Kimi] [REL]

Authors: Carlos Rafael Nogueira da Silva, Maria Cecilia Luna Alvarado, Fernando Darío Almeida García, Michel Daoud Yacoub

The Lognormal distribution is a fundamental statistical model widely used in different fields of science, including biology, finance, economics, engineering, etc. In wireless communications, it is the primary statistic for large-scale fading modeling. However, its known analytical intractability presents persistent channel characterization and performance analysis challenges. This paper introduces two effective and mathematically tractable surrogate models for the Lognormal distribution, constructed from the product of Nakagami-$m$ and Inverse Nakagami-$m$ (I-Nakagami-$m$) variates. These models yield asymptotically exact closed-form expressions for key performance metrics -- including the characteristic function, bit error rate, and Shannon's capacity -- and enable analytically tractable expressions for the probability density function and cumulative distribution function of the composite $α$-$μ$-Lognormal fading model. To facilitate implementation, a moment-matching framework is developed to map the Lognormal parameters to the surrogate model parameters. In addition, a random mixture approach is proposed to enhance convergence by exploiting the complementary approximation properties of the Nakagami-$m$ and I-Nakagami-$m$ distributions. The methodology is further extended to heterogeneous cascaded fading channels comprising arbitrary combinations of $α$-$μ$, $κ$-$μ$, and $η$-$μ$ variates, for which moment-based mappings to the equivalent Lognormal distributions are derived. Numerical results confirm the accuracy and efficiency of the proposed approach, positioning it as a practical and reliable alternative to exact Lognormal statistics.

Subject: Signal Processing

Publish: 2025-12-04 14:56:59 UTC


#15 Safe model-based Reinforcement Learning via Model Predictive Control and Control Barrier Functions [PDF] [Copy] [Kimi] [REL]

Authors: Kerim Dzhumageldyev, Filippo Airaldi, Azita Dabiri

Optimal control strategies are often combined with safety certificates to ensure both performance and safety in safety-critical systems. A prominent example is combining Model Predictive Control (MPC) with Control Barrier Functions (CBF). Yet, efficient tuning of MPC parameters and choosing an appropriate class $\mathcal{K}$ function in the CBF is challenging and problem dependent. This paper introduces a safe model-based Reinforcement Learning (RL) framework where a parametric MPC controller incorporates a CBF constraint with a parameterized class $\mathcal{K}$ function and serves as a function approximator to learn improved safe control policies from data. Three variations of the framework are introduced, distinguished by the way the optimization problem is formulated and the class $\mathcal{K}$ function is parameterized, including neural architectures. Numerical experiments on a discrete double-integrator with static and dynamic obstacles demonstrate that the proposed methods improve performance while ensuring safety.

Subject: Systems and Control

Publish: 2025-12-04 14:38:55 UTC


#16 Counterfactual Explanations for Power System Optimisation [PDF] [Copy] [Kimi] [REL]

Authors: Benjamin Fritz, Waqquas Bukhsh

Enhanced computational capabilities of modern decision-making software have allowed us to solve increasingly sophisticated optimisation problems. But in complex socio-economic, technical environments such as electricity markets, transparent operation is key to ensure a fair treatment of all parties involved, particularly regarding dispatch decisions. We address this issue by building on the concept of counterfactual explanations, answering questions such as "Why was this generator not dispatched?" by identifying minimum changes in the input parameters that would have changed the optimal solution. Both DC Optimal Power Flow and Unit Commitment problems are considered, wherein the variable parameters are the spatial and temporal demand profiles, respectively. The thereby obtained explanations allow users to identify the most important differences between the real and expected market outcomes and observe which constraints have led to the solution. The framework uses a bilevel optimisation problem to find the counterfactual demand scenarios. State-of-the-art methods are compared with data-driven heuristics on the basis of computational efficiency and explanation accuracy. Results show that leveraging historical data from previously solved instances can provide significant speed benefits and allows us to derive explanations in cases where conventional methods would not be tractable.

Subject: Systems and Control

Publish: 2025-12-04 14:16:19 UTC


#17 Constrained Control of PDE Traffic Flow via Spatial Control Barrier Functions [PDF] [Copy] [Kimi] [REL]

Authors: Brian Block, Stephanie Stockar

In this paper, a constrained control approach to variable speed limit (VSL) control for macroscopic partial differential equations (PDE) traffic models is developed. Control Lyapunov function (CLF) theory for ordinary differential equations (ODE) is extended to account for spatially and temporally varying states and control inputs. The stabilizing CLF is then unified with safety constraints through the introduction of spatially varying control barrier functions (sCBF). These methods are applied to in-domain VSL control of the Lighthill-Whitham-Richards (LWR) model to regulate traffic density to a desired profile while ensuring the density remains below prescribed limits enforced by the sCBF. Results show that incorporating constrained control minimally affects the stabilizing control input while successfully maintaining the density with the defined safe set.

Subject: Systems and Control

Publish: 2025-12-04 14:06:36 UTC


#18 Movable Antenna Assisted Flexible Beamforming for Integrated Sensing and Communication in Vehicular Networks [PDF] [Copy] [Kimi] [REL]

Authors: Luyang Sun, Zhiqing Wei, Haotian Liu, Kan Yu, Zhendong Li, Zhiyong Feng

Integrated sensing and communication (ISAC) has been recognized as a key technology in sixth-generation wireless networks, and the additional spatial degrees of freedom obtained by movable antenna (MA) technology can significantly improve the performance of ISAC systems. This paper considers an ISAC-assisted vehicle-to-infrastructure (V2I) network, where extended kalman filter-based prediction is combined with real-time optimization to jointly optimize transmit antenna positions and beamforming and power allocation vectors in dynamic environments. We propose two algorithms: a preprocessing-schur complement-projected gradient ascent algorithm for scenarios without sensing quality of service (QoS) constraints, which explores the potential range of sensing performance to provide reference and warm-starting for subsequent constrained optimization; and a heuristic reflective projected dynamic particle swarm optimization algorithm for sensing QoS-constrained scenarios, which achieves substantial performance gains under non-convex constraints with a small number of iterations. Simulation results demonstrate that these approaches enhance both the communication sum-rate and the lower of the Cramér-Rao lower bound of motion parameter estimation, validating the effectiveness of MA-assisted beamforming in dynamic V2I ISAC networks.

Subject: Signal Processing

Publish: 2025-12-04 13:54:37 UTC


#19 Towards predicting binaural audio quality in listeners with normal and impaired hearing [PDF] [Copy] [Kimi] [REL]

Authors: Thomas Biberger, Stephan D. Ewert

Eurich et al. (2024) recently introduced the computationally efficient monaural and binaural audio quality model (eMoBi-Q). This model integrates both monaural and binaural auditory features and has been validated across six audio datasets encompassing quality ratings for music and speech, processed via algorithms commonly employed in modern hearing devices (e.g., acoustic transparency, feedback cancellation, and binaural beamforming) or presented via loudspeakers. In the current study, we expand eMoBi-Q to account for perceptual effects of sensorineural hearing loss (HL) on audio quality. For this, the model was extended by a nonlinear auditory filterbank. Given that altered loudness perception is a prevalent issue among listeners with hearing impairment, our goal is to incorporate loudness as a sub-dimension for predicting audio quality in both normal-hearing and hearing-impaired populations. While predicting loudness itself is important in the context of loudness-based hearing aid fitting, loudness as audio quality sub-measure may be helpful for the selection of reliable auditory features in hearing impaired listeners. The parameters of the filterbank and subsequent processing stages were informed by the physiologically-based (binaural) loudness model proposed by Pieper et al. (2018). This study presents and discusses the initial implementation of the extended binaural quality model.

Subject: Audio and Speech Processing

Publish: 2025-12-04 13:38:44 UTC


#20 Pick-to-Learn for Systems and Control: Data-driven Synthesis with State-of-the-art Safety Guarantees [PDF] [Copy] [Kimi] [REL]

Authors: Dario Paccagnan, Daniel Marks, Marco C. Campi, Simone Garatti

Data-driven methods have become paramount in modern systems and control problems characterized by growing levels of complexity. In safety-critical environments, deploying these methods requires rigorous guarantees, a need that has motivated much recent work at the interface of statistical learning and control. However, many existing approaches achieve this goal at the cost of sacrificing valuable data for testing and calibration, or by constraining the choice of learning algorithm, thus leading to suboptimal performances. In this paper, we describe Pick-to-Learn (P2L) for Systems and Control, a framework that allows any data-driven control method to be equipped with state-of-the-art safety and performance guarantees. P2L enables the use of all available data to jointly synthesize and certify the design, eliminating the need to set aside data for calibration or validation purposes. In presenting a comprehensive version of P2L for systems and control, this paper demonstrates its effectiveness across a range of core problems, including optimal control, reachability analysis, safe synthesis, and robust control. In many of these applications, P2L delivers designs and certificates that outperform commonly employed methods, and shows strong potential for broad applicability in diverse practical settings.

Subjects: Systems and Control , Machine Learning

Publish: 2025-12-04 13:27:11 UTC


#21 Secret Key Generation on Aerial Rician Fading Channels Against a Curious Receiver [PDF] [Copy] [Kimi] [REL]

Authors: Mattia Piana, Stefano Tomasin

Secret key generation at the physical layer is expected to be a fundamental enabler for next-generation networks. We consider a network where the user equipment is a drone and propose a novel secret key generation solution when the eavesdropper is another node belonging to the network (curious device). We exploit drone mobility over realistic Rician fading channels. In our protocol, after a prior training phase, drone Alice chooses a trajectory of positions in space and transmits a message to Bob, on the ground, from each position. From the received messages, Bob estimates the channel gain from which a secret key is extracted. The choice of the positions is made to maximize a lower bound on the secret key capacity. Numerical simulations are used to prove the effectiveness of the proposed approach.

Subject: Signal Processing

Publish: 2025-12-04 12:47:08 UTC


#22 CIG-MAE: Cross-Modal Information-Guided Masked Autoencoder for Self-Supervised WiFi Sensing [PDF] [Copy] [Kimi] [REL]

Authors: Gang Liu, Yanling Hao, Yixuan Zou

Human Action Recognition using WiFi Channel State Information (CSI) has emerged as an attractive alternative to vision-based methods due to its ubiquity, device-agnostic nature, and inherent privacy-preserving capabilities. However, the high cost of manual annotation and the limited scale of publicly available CSI datasets restrict the performance of supervised approaches. Self-supervised learning (SSL) offers a promising avenue, but existing contrastive paradigms rely on data augmentations that conflict with the physical semantics of radio signals and require large-batch training, making them poorly suited for CSI. To overcome these challenges, we introduce CIG-MAE -- a Cross-modal Information-Guided Masked Autoencoder -- that reconstructs both the amplitude and phase of CSI using a symmetric dual-stream architecture with a high masking ratio. Specifically, we propose an Adaptive Information-Guided Masking strategy that dynamically allocates attention to time-frequency regions with high information density to improve learning efficiency, and incorporate a Barlow Twins regularizer to align cross-modal representations without negative samples. Experiments on three public datasets show that CIG-MAE consistently outperforms SOTA SSL methods and even surpasses a fully supervised baseline, demonstrating superior data efficiency, robustness, and representation generalization.

Subject: Signal Processing

Publish: 2025-12-04 12:06:48 UTC


#23 Pinching-Antenna System Design under Random LoS and NLoS Channels [PDF] [Copy] [Kimi] [REL]

Authors: Yanqing Xu, Yang Lu, Zhiguo Ding, Tsung-Hui Chang

Pinching antennas, realized through position-adjustable radiating elements along dielectric waveguides, have emerged as a promising flexible-antenna technology thanks to their ability to dynamically reshape large-scale channel conditions. However, most existing studies focus on idealized LoS-dominated environments, overlooking the stochastic nature of realistic wireless propagation. This paper investigates a more practical multiuser pinching-antenna system under a composite probabilistic channel model that captures distance-dependent LoS blockage and NLoS scattering. To account for both efficiency and reliability aspects of communication, two complementary design metrics are considered: an average signal-to-noise ratio (SNR) metric characterizing long-term throughput and fairness, and an outage-constrained metric ensuring a prescribed reliability level. Based on these metrics, we formulate two optimization problems: the first maximizes the max-min average SNR across users, while the second maximizes a guaranteed SNR threshold under per-user outage constraints. Although both problems are inherently nonconvex, we exploit their underlying monotonic structures and develop low-complexity, bisection-based algorithms that achieve globally optimal solutions using only simple scalar evaluations. Extensive simulations validate the effectiveness of the proposed methods and demonstrate that pinching-antenna systems significantly outperform conventional fixed-antenna designs even under random LoS and NLoS channels.

Subjects: Signal Processing , Information Theory

Publish: 2025-12-04 12:03:23 UTC


#24 Multi Task Denoiser Training for Solving Linear Inverse Problems [PDF] [Copy] [Kimi] [REL]

Authors: Clément Bled, François Pitié

Plug-and-Play Priors (PnP) and Regularisation by Denoising (RED) have established that image denoisers can effectively replace traditional regularisers in linear inverse problem solvers for tasks like super-resolution, demosaicing, and inpainting. It is now well established in the literature that a denoiser's residual links to the gradient of the image log prior (Miyasawa and Tweedie), enabling iterative, gradient ascent-based image generation (e.g., diffusion models), as well as new methods for solving inverse problems. Building on this, we propose enhancing Kadkhodaie and Simoncelli's gradient-based inverse solvers by fine-tuning the denoiser within the iterative solving process itself. Training the denoiser end-to-end across the solver framework and simultaneously across multiple tasks yields a single, versatile denoiser optimised for inverse problems. We demonstrate that even a simple baseline model fine-tuned this way achieves an average PSNR improvement of +1.34 dB across six diverse inverse problems while reducing the required iterations. Furthermore, we analyse the fine-tuned denoiser's properties, finding that its optimisation objective implicitly shifts from minimising standard denoising error (MMSE) towards approximating an ideal prior gradient specifically tailored for guiding inverse recovery.

Subject: Image and Video Processing

Publish: 2025-12-04 11:57:15 UTC


#25 A Unified Low-rank ADI Framework with Shared Linear Solves for Simultaneously Solving Multiple Lyapunov, Sylvester, and Riccati Equations [PDF] [Copy] [Kimi] [REL]

Authors: Umair Zulfiqar, Zhong-Yi Huang

It is known in the literature that the low-rank ADI method for Lyapunov equations is a Petrov-Galerkin projection algorithm that implicitly performs model order reduction. In this paper, we show that the low-rank ADI methods for Sylvester and Riccati equations are also Petrov-Galerkin projection algorithms that implicitly perform model order reduction. By observing that the ADI methods for Lyapunov, Sylvester, and Riccati equations differ only in pole placement and not in their interpolatory nature, we show that the shifted linear solves-which constitute the bulk of the computational cost-can be shared. The pole-placement step involves only small-scale operations and is therefore inexpensive. We propose a unified ADI framework that requires only two shifted linear solves per iteration to simultaneously solve six Lyapunov equations, one Sylvester equation, and ten Riccati equations, thus substantially increasing the return on investment for the computational cost spent on the linear solves. All operations needed to extract the individual solutions from these shared linear solves are small-scale and inexpensive. Since all ADI methods implicitly perform model order reduction when solving these linear matrix equations, we show that the resulting reduced-order models can be obtained as an additional byproduct. These models not only interpolate the original transfer function at the mirror images of the ADI shifts but also preserve important system properties such as stability, minimum-phase property, positive-realness, bounded-realness, and passivity. Consequently, the proposed unified ADI framework also serves as a recursive, interpolation-based model order reduction method, which can preserve several important properties of the original model in the reduced-order model.

Subjects: Systems and Control , Numerical Analysis

Publish: 2025-12-04 11:09:45 UTC