Electrical Engineering and Systems Science | Cool Papers

#1 Improving the Estimation of Ship Length via ISAR [PDF¹] [Copy] [Kimi¹] [REL]

A method for estimating the aspect angle of ships at sea from an ISAR is developed. The ISAR AutoTrack (IAT) algorithm uses the information from the adaptive motion compensation velocity to improve the tracker estimation of the ship aspect angle and thus to improve the estimation of ship length. The IAT is based on classical methods of autofocus for synthetic aperture radar. The average mocomp velocity yields the error in the in-range component of the ship velocity; the linear time trend of the velocity determines the cross-range component of the ship velocity. The IAT has two methods for implementing the algorithm, the Search and Analytical methods. Both methods benefit from an intelligent smoothing process that removes system errors, random noise, and ocean waves. The goal of the IAT is to measure ship length to within 10 percent over all azimuth angles and ranges relative to the aircraft and for (unsigned) aspect angles from 5 to 85 degrees. Using the IAT allows a major reduction in the radar resources dedicated to tracking; and since the IAT creates its estimates during the ISAR time window it is unaffected by ship maneuvers. Recommendations for further development and testing of the IAT are presented.

Subject: Signal Processing

Publish: 2026-03-02 18:50:58 UTC

#2 Orchestrating Multimodal DNN Workloads in Wireless Neural Processing [PDF] [Copy] [Kimi] [REL]

Authors: Sai Xu, Kai-Kit Wong, Yanan Du, Hyundong Shin

In edge inference, wireless resource allocation and accelerator-level deep neural network (DNN) scheduling have yet to be co-optimized in an end-to-end manner. The lack of coordination between wireless transmission and accelerator-level DNN execution prevents efficient overlap, leading to higher end-to-end inference latency. To address this issue, this paper investigates multimodal DNN workload orchestration in wireless neural processing (WNP), a paradigm that integrates wireless transmission and multi-core accelerator execution into a unified end-to-end pipeline. First, we develop a unified communication-computation model for multimodal DNN execution and formulate the corresponding optimization problem. Second, we propose O-WiN, a framework that orchestrates DNN workloads in WNP through two tightly coupled stages: simulation-based optimization and runtime execution. Third, we develop two algorithms, RTFS and PACS. RTFS schedules communication and computation sequentially, whereas PACS interleaves them to enable pipeline parallelism by overlapping wireless data transfer with accelerator-level DNN execution. Simulation results demonstrate that PACS significantly outperforms RTFS under high modality heterogeneity by better masking wireless latency through communication-computation overlap, thereby highlighting the effectiveness of communication-computation pipelining in accelerating multimodal DNN execution in WNP.

Subjects: Signal Processing , Machine Learning

Publish: 2026-03-02 17:25:43 UTC

#3 TCG CREST System Description for the DISPLACE-M Challenge [PDF¹] [Copy] [Kimi] [REL]

Authors: Nikhil Raghav, Md Sahidullah

This report presents the TCG CREST system description for Track 1 (Speaker Diarization) of the DISPLACE-M challenge, focusing on naturalistic medical conversations in noisy rural-healthcare scenarios. Our study evaluates the impact of various voice activity detection (VAD) methods and advanced clustering algorithms on overall speaker diarization (SD) performance. We compare and analyze two SD frameworks: a modular pipeline utilizing SpeechBrain with ECAPA-TDNN embeddings, and a state-of-the-art (SOTA) hybrid end-to-end neural diarization system, Diarizen, built on top of a pre-trained WavLM. With these frameworks, we explore diverse clustering techniques, including agglomerative hierarchical clustering (AHC), and multiple novel variants of spectral clustering, such as SC-adapt, SC-PNA, and SC-MK. Experimental results demonstrate that the Diarizen system provides an approximate $39\%$ relative improvement in the diarization error rate (DER) on the post-evaluation analysis of Phase~I compared to the SpeechBrain baseline. Our best-performing submitted system employing the Diarizen baseline with AHC employing a median filtering with a larger context window of $29$ achieved a DER of 10.37\% on the development and 9.21\% on the evaluation sets, respectively. Our team ranked sixth out of the 11 participating teams after the Phase~I evaluation.

Subjects: Audio and Speech Processing , Machine Learning

Publish: 2026-03-02 16:12:47 UTC

#4 A System-of-Systems Convergence Paradigm for Societal Challenges of the Anthropocene [PDF] [Copy] [Kimi] [REL]

Authors: Megan S. Harris, Mohammad Mahdi Naderi, Ehsanoddin Ghorbanichemazkati, Sina Jangjoo, Emily Lapan, Seyed Amirreza Hosseini, Fabian Schipfer, Stephen Craig, Enayat Moallemi, Inas Khayal, Laura M. Arpan, Tian Tang, John C. Little, Amro M. Farid

Modern societal challenges, such as climate change, urbanization, and water resource management, demand integrated, multi-discipline, multi-problem approaches to frame and address their complexity. Unfortunately, current methodologies often operate within disciplinary silos, leading to fragmented insights and missed opportunities for convergence. A critical barrier to cross-disciplinary integration lies in the disparate ontologies that shape how different fields conceptualize and communicate knowledge. To address these limitations, this paper proposes a system-of-systems (SoS) convergence paradigm grounded in a meta-cognition map, a framework that integrates five complementary domains: real-world observations, systems thinking, visual modeling, mathematics, and computing. The paradigm is based on the Systems Modeling Language (SysML), offering a standardized, domain-neutral approach for representing and analyzing complex systems. The proposed methodology is demonstrated through a case study of the Chesapeake Bay Watershed, a socio-environmental system requiring coordination across land use, hydrology, economic and policy domains. By modeling this system with SysML, the study illustrates practical strategies for navigating interdisciplinary challenges and highlights the potential of agile SoS modeling to support large-scale, multi-dimensional decision-making.

Subject: Systems and Control

Publish: 2026-03-02 15:27:13 UTC

#5 The Chebyshev Polynomial Series Frequency Modulation Model for Waveform Design and Analysis [PDF] [Copy] [Kimi] [REL]

Authors: Stephen P. Blackstock, Amaro Tuninetti, Dieter Vanderelst, Laura N. Kloepper, Michael R. Haberman

Polynomial phase signals (PPS) are a staple of waveform design and analysis in sonar, radar, and communications fields. They also find application in the modeling of bioacoustic emissions, especially those of echolocating animals such as bats and odontocetes. This work presents a novel PPS waveform formulation that exploits some special properties of Chebyshev polynomials, such as orthogonality, recurrence relations, and equivalence to trigonometric functions. The result is the Chebyshev Polynomial Frequency Modulation (CPSFM) family of waveforms, which prove useful in the modeling of bioacoustic signals and the approximation of non-polynomial-phase signals such as hyperbolic chirps. We demonstrate that the CPSFM model admits compact analytic expressions for fundamental continuous-time signal processing functions such as the Fourier transform, the convolution and correlation operations, and the ambiguity function. Derivations for these expressions using CPSFM are presented, along with their application to the analysis of biosonar emissions of Mexican free-tailed bats.

Subject: Signal Processing

Publish: 2026-03-02 15:25:43 UTC

#6 A Hetero-functional Graph State Estimator for Watershed Systems: Application to the Chesapeake Bay [PDF] [Copy] [Kimi] [REL]

Authors: Megan S. Harris, John C. Little, Amro M. Farid

Regional watersheds are complex systems of systems encompassing hydrology, land-use decision-making, estuarine ecological feedbacks, and overlapping governance jurisdictions. Their effective management underlies many modern societal challenges and therefore requires models that capture interdependencies between natural and institutional systems. Regional-specific models such as the Chesapeake Assessment Scenario Tool, used in this paper's case study, provide valuable nutrient estimates but rely on structurally opaque watershed routing that limits integration into broader systems-level analyses. This paper introduces a modeling framework for watershed systems. First, a region-independent reference architecture is developed. Second, the Weighted Least Squares Error Hetero-functional Graph State Estimator, an extension of Hetero-functional Graph Theory (HFGT), is adapted to estimate nutrient flows from uncertain data. The framework is demonstrated through instantiation in the Chesapeake Bay Watershed. By establishing a shared ontology grounded in Systems Modeling Language and HFGT, the approach enables integration of economic and governance systems to support sustainable watershed management.

Subject: Systems and Control

Publish: 2026-03-02 14:49:04 UTC

#7 PAC Finite-Time Safety Guarantees for Stochastic Systems with Unknown Disturbance Distributions [PDF] [Copy] [Kimi] [REL]

Authors: Taoran Wu, Dominik Wagner, C. -H. Luke Ong, Bai Xue

We investigate the problem of establishing finite-time probabilistic safety guarantees for discrete-time stochastic dynamical systems subject to unknown disturbance distributions, using barrier certificate methods. Our approach develops a data-driven safety certification framework that relies only on a finite collection of independent and identically distributed (i.i.d.) disturbance samples. Within this framework, we propose a certification procedure such that, with confidence at least $1-δ$ over the sampled disturbances, if the output of the certification procedure is accepted, the probability that the system remains within a prescribed safe set over a finite horizon is at least $1-ε$. A key challenge lies in formally characterizing the probably approximately correct (PAC) generalization behavior induced by finite samples. To address this, we derive PAC generalization bounds using tools from VC dimension, scenario optimization, and Rademacher complexity. These results illuminate the fundamental trade-offs between sample size, model complexity, and safety tolerance, providing both theoretical insight and practical guidance for designing reliable, data-driven safety certificates in discrete-time stochastic systems.

Subject: Systems and Control

Publish: 2026-03-02 14:31:28 UTC

#8 ScreenAnt: Transparent On-Screen Antennas for 6G [PDF] [Copy] [Kimi] [REL]

Authors: Shun Zhuge, Qing Wang

6G will require on-device antenna systems to operate at ultra-high frequency bands, achieve robust beamforming on the compact user devices, and be blockage-robust. Conventional edge-mounted antennas on devices have limited apertures, suffer from the 'death grip' caused by user-induced blockage, and have poor scalability at mmWave and sub-THz bands. To address these issues, motivated by the rapid evolution of transparent materials and antennas, we propose ScreenAnt in this work--which integrates a transparent antenna array onto the screens of future mobile devices. Specifically, we propose using a transparent on-screen uniform planar array and develop a framework to model its electromagnetic property, spatial configuration, and blockage robustness under realistic user-induced blockage. We also design a gradient-ascent-based algorithm to efficiently optimize power and phase control of on-screen antennas to maximize ScreenAnt's spectral efficiency. Our thorough simulations show that the proposed ScreenAnt can increase the uplink spectral efficiency by over 50% compared to edge-mounted antennas at 28 GHz, and by more than 150% at 300 GHz. ScreenAnt also demonstrates strong robustness against user-induced blockage, paving the way for practical and high-capacity 6G user device designs.

Subject: Signal Processing

Publish: 2026-03-02 14:24:52 UTC

#9 Dynamic Connectivity and Local Frequency Strength under Stochastic Variations [PDF] [Copy] [Kimi] [REL]

Authors: Bruno Pinheiro, Daniel Dotta

This paper introduces a novel metric, termed the Generalized Fiedler Vector (GFV), to evaluate the \textit{dynamic connectivity} in power systems. The proposed metric leverages the network connectivity, represented by the system Laplacian matrix, together with the nodal inertia distribution, following a formulation previously developed by the first author. By capturing the interplay between system topology and dynamic properties, the GFV provides valuable insights for the optimal siting of stochastic generation to mitigate its impact on local and system-wide frequency variability. The effectiveness of the proposed approach is demonstrated through Monte Carlo simulations performed on the IEEE 68-bus test system.

Subject: Systems and Control

Publish: 2026-03-02 14:22:35 UTC

#10 Guaranteed Image Classification via Goal-oriented Joint Semantic Source and Channel Coding [PDF] [Copy] [Kimi] [REL]

Authors: Wenchao Wu, Min Qiu, Yansha Deng, Jinhong Yuan

To enable critical applications such as remote diagnostics, image classification must be guaranteed under bandwidth constraints and unreliable wireless channels through joint source and channel coding (JSCC) design. However, most existing JSCC methods focus on minimizing image distortion, implicitly assuming that all image regions contribute equally to classification performance, thereby overlooking their varying importance for the task. In this paper, we propose a goal-oriented joint semantic source and channel coding (G-JSSCC) framework that applies \emph{various} levels of source coding compression and channel coding protection across image regions based on their semantic importance. Specifically, we design a semantic information extraction method that identifies and ranks various image regions based on their contributions to classification, where the contribution is measured by the shapely value from explainable artificial intelligence (AI). Based on that, we design a semantic source coding and a semantic channel coding method, which allocates higher-quality compression and stronger error protection to image regions of great semantic importance. In addition, we define a new metric, termed coding efficiency, to evaluate the effectiveness of the source and channel coding in the classification task. Simulations show that our proposed G-JSSCC framework improves classification probability by 2.70 times, reduces transmission cost by 38%, and enhances coding efficiency by 5.91 times, compared to the benchmark scheme using uniform compression and an idealized channel code to uniformly protect the whole image.

Subject: Image and Video Processing

Publish: 2026-03-02 13:51:12 UTC

#11 Plug-and-play forward backward algorithm to restore Landsat images: A preliminary step to uncover the history of surface waters [PDF] [Copy] [Kimi] [REL]

Authors: Pierre Audisio, Barbara Belletti, Nelly Pustelnik

The temporal and spatial analysis of river dynamics is a key factor for studying and understanding human impacts on floodplains. To assess the changes taking place, it is necessary to have high-resolution images with a large spatial coverage and a high temporal revisit frequency over the long term. Satellite imagery meets several of these criteria. For instance, Sentinel data provide high-resolution images but only after 2015. Therefore, to study water surface evolution prior to this date, it is necessary to rely on other satellite images such as Landsat, which offers longer historical coverage, albeit with lower spatial resolution. In this study, we aim to increase the spatial resolution of Landsat data from 30 to 10 meters (resolution of Sentinel images). To achieve this goal, we develop an innovative single image super-resolution method based on a plug-and-play approach.

Subject: Signal Processing

Publish: 2026-03-02 13:49:40 UTC

#12 Multiresolution Adaptive Block-Coordinate Forward-Backward for Image Reconstruction [PDF] [Copy] [Kimi] [REL]

Authors: Edgar Desainte-Maréville, Marion Foare, Paulo Gonçalves, Nelly Pustelnik, Elisa Riccietti

Classical first-order optimization methods for imaging inverse problems scale poorly with image resolution. Wavelet based multilevel strategies can accelerate convergence under strong blur, but their fixed coarse-to-fine schedules lose effectiveness in moderate-blur or noise-dominated regimes. In this work, we propose an adaptive multiresolution block coordinate Forward-Backward algorithm for image restoration. Multiresolution block selection is driven by the local magnitude of the proximal update via a stochastic non-smooth Gauss-Southwell rule applied to the wavelet decomposition of the image. This adaptive selection strategy dynamically balances updates across scales, emphasizing coarse or fine blocks according to the degradation regime. As a result, the proposed method automatically adapts to varying blur and noise levels without relying on a predefined hierarchical update scheme.

Subjects: Signal Processing , Optimization and Control

Publish: 2026-03-02 13:41:20 UTC

#13 Quantum-PROBE: Rydberg Atomic Receiver-Based Multi-AoA Estimation with RF Lens [PDF] [Copy] [Kimi] [REL]

Authors: Hong-Bae Jeon, Kaibin Huang, Chan-Byoung Chae

This paper presents the Quantum-Power pROfile Based Estimation (PROBE) framework, a Rydberg Atomic Receiver (RARE)-based multi-user angle-of-arrival (AoA) estimation approach equipped with a radio-frequency (RF) lens front end. We establish a physics-consistent analytical model showing that magnitude-only RARE measurements, processed via the beam-propagation method (BPM) and snapshot-wise power accumulation, can be rigorously characterized as a nonnegative superposition of AoA-dependent, lens-induced spatial power profiles. This formulation reveals a structured and interpretable power-domain dictionary that enables multi-user AoA recovery without explicit phase reconstruction. Building on this foundation, we develop two complementary recovery strategies: (i) a principled non-negative least absolute shrinkage and selection operator (NN-LASSO)-based solver that estimates a sparse nonnegative angular representation via an accelerated proximal-gradient method followed by cluster-based AoA decoding, and (ii) a low-complexity successive interference cancellation (SIC) algorithm that iteratively identifies and removes dominant power-profile components through cosine-similarity matching. Simulation results demonstrate that the proposed Quantum-PROBE framework consistently outperforms representative RARE- and RF-based benchmarks across diverse system configurations, while offering a clear accuracy-complexity tradeoff between the NN-LASSO and SIC variants for practical quantum sensing deployments.

Subject: Signal Processing

Publish: 2026-03-02 13:34:51 UTC

#14 Critical Clearing Time Enhancement of Droop-Controlled Grid-Forming Inverters with Adaptive Function-Based Parameters [PDF] [Copy] [Kimi] [REL]

Authors: Dewan Mahnaaz Mahmud, Vinu Thomas, Bogdan Marinescu, Mickaël Hilairet

With the increasing penetration of renewable energy sources, grid-forming (GFM) inverters are becoming essential for voltage and frequency regulation. However, the transient stability of GFM inverter is critically affected by the current limiters that are embedded with the standard control schemes. This paper proposes a novel adaptive function to enhance the transient stability of droop-controlled GFM inverters. The proposed method autonomously adjusts the active power reference and the droop gain based on the terminal voltage of the inverter. Also, the acceleration of the phase angle is prevented, leading to the maximization of critical clearing time (CCT). The proposed method is benchmarked against two state-of-the-art GFM inverter CCT enhancement methods. Effectiveness of the proposed method is validated through electromagnetic transient (EMT) simulations in MATLAB/Simulink\textsuperscript{\textregistered}.

Subject: Systems and Control

Publish: 2026-03-02 13:04:22 UTC

#15 Near-Field Focusing Operators for Planar Multi-Static Microwave Imaging Using Back-Projection in the Spatial Domain [PDF] [Copy] [Kimi] [REL]

Authors: Matthias M. Saurer, Marius Brinkmann, Han Na, Quanfeng Wang, Thomas Eibert

Based on a plane-wave expansion of the observation data in quasi-planar multi-static scattering scenarios, an improved formalism for image creation utilizing back-projection in the spatial domain is derived. The underlying integral expressions for different focusing operators are derived analytically leading to magnitude correction factors, which are mostly relevant for reconstructing microwave images when the distance from the scattering object to the aperture plane is small. It is shown that the derived imaging procedure is superior to the traditional back-projection only compensating the phase delay of the measurement signals and validate our findings based on simulated as well as measured data. Since the derived focusing operators correspond to a low-pass filtering of the spatial images, the resulting modified multi-static back-projection algorithms effectively suppress imaging artifacts as well.

Subject: Image and Video Processing

Publish: 2026-03-02 12:44:28 UTC

#16 Control Plane for Reconfigurable Intelligent Surfaces [PDF] [Copy] [Kimi] [REL]

Authors: Fabio Saggese, Victor Croisfelt, Kyriakos Stylianopoulos, George C. Alexandropoulos, Petar Popovski

Research on reconfigurable intelligent surfaces (RISs) has predominantly focused on purely physical (PHY)-layer aspects, particularly, on how signals are dynamically shaped by a controllable wireless propagation environment. However, integrating RISs as system-level network elements requires the development of an RIS-compatible control plane. In this article, we explore design options for such a control plane across two key dimensions: i) the allocation of spectral resources for the control plane (in- or out-of-band), and ii) the rate selection for the data plane (multiplexing or diversity). While our analysis is necessarily simplified, it reveals the fundamental trade-offs inherent in these design choices, which are crucial for integrating RIS technology into future networks.

Subject: Signal Processing

Publish: 2026-03-02 12:19:04 UTC

#17 Goal-Oriented Access Optimization for ISAC-Enabled Digital Twins [PDF] [Copy] [Kimi] [REL]

Authors: Fabio Saggese, Federico Chiariotti, Shashi Raj Pandey, Henk Wymeersch, Luca Sanguinetti, Petar Popovski

The digital twins (DTs) of physical systems and environments enable real-time remote tracking, control, and learning, but require low-latency transmission of updates and sensory data to maintain alignment with their physical counterparts. In this context, augmenting sensory data with the network's own integrated sensing and communication (ISAC)capabilities can expand the DT's awareness of the environment by allowing it to precisely non-radar locate measurements from mobile nodes. However, this integration increases the complexity of the communication system, and can only be supported through intelligent resource allocation and access optimization. In this work, we propose a two-step goal-oriented approach to solve this problem: we design a push-based random access in which sensors with a high Value of Information (VoI) inform the network of their access requirements, followed by a pull-based scheduled transmission of the actual sensory data. This design allows to combine the ISAC and reliable transmission requirements and maximize the VoI of the information delivered to the DT, significantly outperforming existing schemes.

Subject: Signal Processing

Publish: 2026-03-02 12:06:49 UTC

#18 Detection of weak signals under arbitrary noise distributions [PDF²] [Copy] [Kimi¹] [REL]

Authors: J. Zschetzsche, M. Weimar, O. Lang, S. Schuster, A. Haberl, S. Schertler, B. Lehner, J. Reisinger, M. Huemer, S. Rotter

Detecting weak signals buried in complex, non-Gaussian noise is a fundamental challenge in science and engineering, with applications ranging from radar systems and communications to industrial monitoring and gravitational wave detection. The Rao detector, a key concept in this domain, achieves asymptotically optimal performance as the number of measurements increases, but requires precise knowledge of the data's statistical properties, often relying on simplified noise models. We propose a hybrid framework that combines a lightweight neural network with the Rao detection framework to address this limitation. The neural network, trained on noise-only data, learns the optimal multivariate nonlinearity, transforming noisy data to enhance signal detectability. The newly introduced LRao detector then fully extracts the signal information, achieving asymptotically optimal performance even under challenging noise conditions. Validated on both simulated and real-world magnetic sensor data, our method significantly outperforms conventional approaches. By bridging data-driven techniques with model-based signal processing, it offers a robust and interpretable solution for signal detection across diverse applications.

Subjects: Signal Processing , Statistics Theory

Publish: 2026-03-02 11:02:27 UTC

#19 Cramer-Rao Bounds for Target Parameter Estimation in a Bi-Static IRS-Assisted Radar Configuration [PDF] [Copy] [Kimi] [REL]

Authors: Sanjeeva Reddy S, Vinod Veera Reddy

Non-Line-of-Sight (NLoS) sensing and detection of low-observable (stealth) targets are challenging for conventional radar due to blockage and severe propagation loss. Intelligent Reflective Surface (IRS)-assisted radar can extend the field-of-view (FOV), but common architectures rely on the four-hop radar--IRS--target--IRS--radar link, whose attenuation limits estimation performance. This paper proposes an alternative architecture, that exploits the target-scattered component received at a spatially separated IRS and redirected back to a mono-static radar receiver. The geometry provides bi-static/multi-static-like diversity using a passive panel, while retaining a mono-static front-end and avoiding inter-node time synchronization concerns. We develop a signal model for the proposed configuration and recast it into a compact, parameterized form that is suitable for angle estimation. Using this reformulation, we derive the Fisher Information Matrix and the associated Cramér--Rao Lower Bounds (CRLB) for target azimuth and elevation angles with respect to the IRS. Numerical evaluations quantify the impact of various signal-model parameters on the achievable bounds. These results provide insights on the parameter-estimation limits within the FOV against SNR, snapshots and IRS elements.

Subject: Signal Processing

Publish: 2026-03-02 09:47:58 UTC

#20 Predictive Lane-Change and Routing Coordination in Bus-Priority Mixed Traffic Corridors [PDF] [Copy] [Kimi] [REL]

Authors: Tanlu Liang, Ting Bai, Andreas A. Malikopoulos

In this paper, we investigate the coordination of vehicle maneuvers in mixed-traffic corridors where connected and automated vehicles, human-driven vehicles, and buses interact under dedicated bus lane operations. We develop a segment-based network coordination framework that jointly optimizes lane-change and routing decisions of connected and automated vehicles to improve dedicated lane utilization while preserving bus priority. The proposed framework incorporates a predictive bus-protection mechanism that restricts vehicle access to protected lane segments within a monitoring horizon, together with a utility-driven lane-change strategy that accounts for anticipated travel time gains, downstream routing feasibility, and lane-change stability. By explicitly coupling network-level routing decisions with lane-level interaction control, the method proactively mitigates conflicts on dedicated lanes before congestion effects materialize. The proposed approach is evaluated through microscopic traffic simulations in SUMO using a realistic urban corridor. Simulation results demonstrate that the framework enhances bus schedule adherence and reduces average travel times for both automated and human-driven vehicles, while maintaining stable lane-change behavior without increasing maneuver frequency.

Subject: Systems and Control

Publish: 2026-03-02 08:43:06 UTC

#21 A Block Least Mean Square Method for Fiber Longitudinal Power Profile Monitoring [PDF] [Copy] [Kimi] [REL]

Authors: Paolo Serena, Chiara Lasagni, Alberto Bononi, Fabien Boitier, Joana Girard-Jollet

We propose a block least mean square (LMS) algorithm to monitor the longitudinal power profile of a fiber-optic link through receiver-based digital data from a coherent detector. Compared to the benchmark least squares (LS) method, the proposed algorithm does not require large matrix inversions or batch processing, thus allowing the received data to be processed in blocks of minimum size by an overlap-save algorithm, reducing complexity and latency. We propose an efficient implementation of the method with a stochastic gradient update leveraging a key computation in the frequency domain, offering computational savings over state-of-the-art monitoring techniques. We test the proposal in different scenarios by means of numerical simulations.

Subject: Signal Processing

Publish: 2026-03-02 08:34:15 UTC

#22 Battery Discharge Modeling for Electric Vehicles: A Hybrid Physics-based Residual Learning Approach [PDF] [Copy] [Kimi] [REL]

Authors: Praharshitha Aryasomayajula, Ting Bai, Andreas A. Malikopoulos

The growing integration of electric vehicle (EV) fleets into transportation services and energy systems requires accurate modeling of battery discharge and state-of-charge (SoC) evolution to ensure reliable vehicle operation and grid coordination. Existing approaches face a trade-off between interpretable but simplified physics-based models and data-driven methods that demand large datasets and may lack physical consistency. In this paper, we propose a hybrid physics-based residual learning framework for EV battery discharge modeling. A vehicle dynamics model based on force-balance equations provides an interpretable baseline estimate of energy consumption and SoC evolution, capturing aerodynamic drag, rolling resistance, and regenerative braking. A neural network residual learner then corrects discrepancies caused by complex factors such as traffic conditions and driver behavior. Experimental results on $1,500$ trip scenarios demonstrate that the proposed approach reduces the mean absolute percentage error to approximately $0.8\%$, significantly outperforming physics-only models while preserving physical interpretability and computational efficiency.

Subject: Systems and Control

Publish: 2026-03-02 08:14:15 UTC

#23 MR-Compass: Inertial Navigation-Driven Motion Correction for Brain MRI [PDF] [Copy] [Kimi] [REL]

Authors: Musa Tunc Arslan, Fatih Calakli, Joshua Auger, Hongli Fan, Alan J Macy, Simon K Warfield

Inertial sensors can track object kinematics, however, unbounded drift from integrating noisy signals makes them impractical for MRI motion correction at millimeter resolution and minute-long scans. We introduce MR-Compass, which exploits the MRI system's static magnetic and gravitational fields to estimate 3-DOF orientation at 2 kHz directly, without integration, eliminating random-walk. The remaining 3-DOF translation is recovered via phase correlation from the MRI data. We experimentally validate the efficacy of the method retrospectively using a 3D radial koosh-ball sequence and prospectively using 2D EPI fMRI during large volunteer motions. MR-Compass followed by phase-correlation achieved a mean accuracy of 0.6$^o$ and 0.4 pixels across all experiments. Image quality improved when motion correction was applied in all volunteer scans for both retrospective and prospective correction cases. MR-Compass was effective in measuring head motion in the MRI scanner with high accuracy at unprecedented sample rates, and enabled both retrospective and prospective reconstruction to improve image quality by aligning the k-space data appropriately and by reducing the motion related artifacts.

Subjects: Image and Video Processing , Signal Processing , Medical Physics

Publish: 2026-03-02 08:13:03 UTC

#24 Investigating Group Relative Policy Optimization for Diffusion Transformer based Text-to-Audio Generation [PDF] [Copy] [Kimi] [REL]

Authors: Yi Gu, Yanqing Liu, Chen Yang, Sheng Zhao

Text-to-audio (T2A) generation has advanced considerably in recent years, yet existing methods continue to face challenges in accurately rendering complex text prompts, particularly those involving intricate audio effects, and achieving precise text-audio alignment. While prior approaches have explored data augmentation, explicit timing conditioning, and reinforcement learning, overall synthesis quality remains constrained. In this work, we experiment with reinforcement learning to further enhance T2A generation quality, building on diffusion transformer (DiT)-based architectures. Our method first employs a large language model (LLM) to generate high-fidelity, richly detailed audio captions, substantially improving text-audio semantic alignment, especially for ambiguous or underspecified prompts. We then apply Group Relative Policy Optimization (GRPO), a recently introduced reinforcement learning algorithm, to fine-tune the T2A model. Through systematic experimentation with diverse reward functions (including CLAP, KL, FAD, and their combinations), we identify the key drivers of effective RL in audio synthesis and analyze how reward design impacts final audio quality. Experimental results demonstrate that GRPO-based fine-tuning yield substantial gains in synthesis fidelity and prompt adherence.

Subjects: Audio and Speech Processing , Sound

Publish: 2026-03-02 07:44:55 UTC

#25 A Unified Fractional Spectral Framework for Spatiotemporal Graph Signals: Bi-Fractional Transform and Geodesic Coupling [PDF] [Copy] [Kimi] [REL]

Authors: Mingzhi Wang, Manjun Cui, Feiyue Zhao, Yangfan He, Zhichao Zhang

Graph signal processing extends spectral analysis to data supported on irregular domains. Existing fractional transforms for two-dimensional graph signals, including the two-dimensional graph fractional Fourier transform (GFRFT), typically impose a shared fractional order across dimensions, which limits adaptivity to heterogeneous spatiotemporal spectra. To address this limitation, we propose the two-dimensional graph bi-fractional Fourier transform, which assigns independent fractional orders to the factor graphs of a Cartesian product, enabling decoupled spectral control while preserving separability, unitarity, and invertibility. To further resolve the basis ambiguity in temporal fractional analysis, we develop a geodesic-coupled GFRFT by constructing a coupling path along the principal geodesic on the unitary manifold, thereby unifying graph-induced and discrete temporal bases with guaranteed unitarity and a closed-form inverse. Building on these transforms, we derive a differentiable Wiener-type filtering framework with a hybrid optimization strategy: the fractional orders are learned end-to-end from data, while the coupling parameter is fixed as a structural regularizer. Experiments on real-world time-varying graph datasets and dynamic image restoration tasks demonstrate consistent gains over state-of-the-art fractional transforms and competitive learning-based baselines.

Subject: Signal Processing

Publish: 2026-03-02 05:49:32 UTC