2025-04-03 | | Total: 58
Mild traumatic brain injuries (mTBI) are a highly prevalent condition with heterogeneous outcomes between individuals. A key factor governing brain tissue deformation and the risk of mTBI is the rotational kinematics of the head. Instrumented mouthguards are a widely accepted method for measuring rotational head motions, owing to their robust sensor-skull coupling. However, wearing mouthguards is not feasible in all situations, especially for long-term data collection. Therefore, alternative wearable devices are needed. In this study, we present an improved design and data processing scheme for an instrumented headband. Our instrumented headband utilizes an array of inertial measurement units (IMUs) and a new data-processing scheme based on continuous wavelet transforms to address sources of error in the IMU measurements. The headband performance was evaluated in the laboratory on an anthropomorphic test device, which was impacted with a soccer ball to replicate soccer heading. When comparing the measured peak rotational velocities (PRV) and peak rotational accelerations (PRA) between the reference sensors and the headband for impacts to the front of the head, the correlation coefficients (r) were 0.80 and 0.63, and the normalized root mean square error (NRMSE) values were 0.20 and 0.28, respectively. However, when considering all impact locations, r dropped to 0.42 and 0.34 and NRMSE increased to 0.5 and 0.41 for PRV and PRA, respectively. This new instrumented headband improves upon previous headband designs in reconstructing the rotational head kinematics resulting from frontal soccer ball impacts, providing a potential alternative to instrumented mouthguards.
Spiking Nonlinear Opinion Dynamics (S-NOD) is an excitable decision-making model inspired by the spiking dynamics of neurons. S-NOD enables the design of agile decision-making that can rapidly switch between decision options in response to a changing environment. In S-NOD, decisions are represented by continuous time, yet discrete, opinion spikes. Here, we extend previous analysis of S-NOD and explore its potential as a nonlinear controller with a tunable balance between robustness and responsiveness. We identify and provide necessary conditions for the bifurcation that determines the onset of periodic opinion spiking. We leverage this analysis to characterize the tunability of the input-output threshold for opinion spiking as a function of the model basal sensitivity and the modulation of opinion spiking frequency as a function of input magnitude past threshold. We conclude with a discussion on S-NOD as a new neuromorphic control block and its extension to distributed spiking controllers.
Radar-based vital sign monitoring (VSM) systems have become valuable for non-contact health monitoring by detecting physiological activities, such as respiration and heartbeat, remotely. However, the conventional phased array used in VSM is vulnerable to privacy breaches, as an eavesdropper can extract sensitive vital sign information by analyzing the reflected radar signals. In this paper, we propose a novel approach to protect privacy in radar-based VSM by modifying the radar transmitter hardware, specifically by strategically selecting the transmit antennas from the available antennas in the transmit array. By dynamically selecting which antennas connect or disconnect to the radio frequency chain, the transmitter introduces additional phase noise to the radar echoes, generating false frequencies in the power spectrum of the extracted phases at the eavesdropper's receiver. The antenna activation pattern is designed to maximize the variance of the phases introduced by antenna selection, which effectively makes the false frequencies dominate the spectrum, obscuring the actual vital sign frequencies. Meanwhile, the authorized receiver, having knowledge of the antenna selection pattern, can compensate for the phase noise and accurately extract the vital signs. Numerical experiments are conducted to validate the effectiveness of the proposed approach in enhancing privacy while maintaining vital sign monitoring.
Non-collocated vibration absorption (NCVA) concept using delayed resonator for in-situ tuning is analyzed and experimentally validated. There are two critical contributions of this work. One is on the scalable analytical pathway for verifying the concept of resonant substructure as the basis of the ideal vibration absorption. The second is to experimentally validate the spatial and spectral tunability of NCVA structures for the first time. For both novelties arbitrarily large dimensions of interconnected mass-spring-damper chains are considered. Following the state of the art on NCVA, control synthesis is performed over the resonant substructure comprising the delayed resonator and a part of the primary structure involved in the vibration absorption. The experimental validation of the proposed NCVA concept is performed on a mechatronic setup with three interconnected cart-bodies. Based on the spectral analysis, an excitation frequency is selected for which a stable vibration suppression can be achieved sequentially for all the three bodies, one collocated and two non-collocated. The experimental results closely match the simulations for complete vibration suppression at the targeted bodies, and thus validating the crucial spatial tunability characteristic as well as the traditional spectral tuning.
Certifying safety in dynamical systems is crucial, but barrier certificates - widely used to verify that system trajectories remain within a safe region - typically require explicit system models. When dynamics are unknown, data-driven methods can be used instead, yet obtaining a valid certificate requires rigorous uncertainty quantification. For this purpose, existing methods usually rely on full-state measurements, limiting their applicability. This paper proposes a novel approach for synthesizing barrier certificates for unknown systems with latent states and polynomial dynamics. A Bayesian framework is employed, where a prior in state-space representation is updated using input-output data via a targeted marginal Metropolis-Hastings sampler. The resulting samples are used to construct a candidate barrier certificate through a sum-of-squares program. It is shown that if the candidate satisfies the required conditions on a test set of additional samples, it is also valid for the true, unknown system with high probability. The approach and its probabilistic guarantees are illustrated through a numerical simulation.
Real-time optimal control remains a fundamental challenge in robotics, especially for nonlinear systems with stringent performance requirements. As one of the representative trajectory optimization algorithms, the iterative Linear Quadratic Regulator (iLQR) faces limitations due to their inherently sequential computational nature, which restricts the efficiency and applicability of real-time control for robotic systems. While existing parallel implementations aim to overcome the above limitations, they typically demand additional computational iterations and high-performance hardware, leading to only modest practical improvements. In this paper, we introduce Quattro, a transformer-accelerated iLQR framework employing an algorithm-hardware co-design strategy to predict intermediate feedback and feedforward matrices. It facilitates effective parallel computations on resource-constrained devices without sacrificing accuracy. Experiments on cart-pole and quadrotor systems show an algorithm-level acceleration of up to 5.3× and 27× per iteration, respectively. When integrated into a Model Predictive Control (MPC) framework, Quattro achieves overall speedups of 2.8× for the cart-pole and 17.8× for the quadrotor compared to the one that applies traditional iLQR. Transformer inference is deployed on FPGA to maximize performance, achieving up to 27.3× speedup over commonly used computing devices, with around 2 to 4× power reduction and acceptable hardware overhead.
The increasing global prevalence of mental disorders, such as depression and PTSD, requires objective and scalable diagnostic tools. Traditional clinical assessments often face limitations in accessibility, objectivity, and consistency. This paper investigates the potential of multimodal machine learning to address these challenges, leveraging the complementary information available in text, audio, and video data. Our approach involves a comprehensive analysis of various data preprocessing techniques, including novel chunking and utterance-based formatting strategies. We systematically evaluate a range of state-of-the-art embedding models for each modality and employ Convolutional Neural Networks (CNNs) and Bidirectional LSTM Networks (BiLSTMs) for feature extraction. We explore data-level, feature-level, and decision-level fusion techniques, including a novel integration of Large Language Model (LLM) predictions. We also investigate the impact of replacing Multilayer Perceptron classifiers with Support Vector Machines. We extend our analysis to severity prediction using PHQ-8 and PCL-C scores and multi-class classification (considering co-occurring conditions). Our results demonstrate that utterance-based chunking significantly improves performance, particularly for text and audio modalities. Decision-level fusion, incorporating LLM predictions, achieves the highest accuracy, with a balanced accuracy of 94.8% for depression and 96.2% for PTSD detection. The combination of CNN-BiLSTM architectures with utterance-level chunking, coupled with the integration of external LLM, provides a powerful and nuanced approach to the detection and assessment of mental health conditions. Our findings highlight the potential of MMML for developing more accurate, accessible, and personalized mental healthcare tools.
Compounding error, where small prediction mistakes accumulate over time, presents a major challenge in learning-based control. For example, this issue often limits the performance of model-based reinforcement learning and imitation learning. One common approach to mitigate compounding error is to train multi-step predictors directly, rather than relying on autoregressive rollout of a single-step model. However, it is not well understood when the benefits of multi-step prediction outweigh the added complexity of learning a more complicated model. In this work, we provide a rigorous analysis of this trade-off in the context of linear dynamical systems. We show that when the model class is well-specified and accurately captures the system dynamics, single-step models achieve lower asymptotic prediction error. On the other hand, when the model class is misspecified due to partial observability, direct multi-step predictors can significantly reduce bias and thus outperform single-step approaches. These theoretical results are supported by numerical experiments, wherein we also (a) empirically evaluate an intermediate strategy which trains a single-step model using a multi-step loss and (b) evaluate performance of single step and multi-step predictors in a closed loop control setting.
Hidden Markov Models (HMMs) provide a rigorous framework for inference in dynamic environments. In this work, we study the alpha-HMM algorithm motivated by the optimal online filtering formulation in settings where the true state evolves as a Markov chain with equal exit probabilities. We quantify the dynamics of the algorithm in stationary environments, revealing a trade-off between inference and adaptation, showing how key parameters and the quality of observations affect performance. Comprehensive theoretical analysis on the nonlinear dynamical system that governs the evolution of the log-belief ratio over time and numerical experiments demonstrate that the proposed approach effectively balances adaptation and inference performance.
There is an increasing need for effective control of systems with complex dynamics, particularly through data-driven approaches. System Level Synthesis (SLS) has emerged as a powerful framework that facilitates the control of large-scale systems while accounting for model uncertainties. SLS approaches are currently limited to linear systems and time-varying linear control policies, thus limiting the class of achievable control strategies. We introduce a novel closed-loop parameterization for time-varying affine control policies, extending the SLS framework to a broader class of systems and policies. We show that the closed-loop behavior under affine policies can be equivalently characterized using past system trajectories, enabling a fully data-driven formulation. This parameterization seamlessly integrates affine policies into optimal control problems, allowing for a closed-loop formulation of general Model Predictive Control (MPC) problems. To the best of our knowledge, this is the first work to extend SLS to affine policies in both model-based and data-driven settings, enabling an equivalent formulation of MPC problems using closed-loop maps. We validate our approach through numerical experiments, demonstrating that our model-based and data-driven affine SLS formulations achieve performance on par with traditional model-based MPC.
This paper presents a novel method to optimize thermal balance in parabolic trough collector (PTC) plants. It uses a market-based system to distribute flow among loops combined with an artificial neural network (ANN) to reduce computation and data requirements. This auction-based approach balances loop temperatures, accommodating varying thermal losses and collector efficiencies. Validation across different thermal losses, optical efficiencies, and irradiance conditions-sunny, partially cloudy, and cloudy-show improved thermal power output and intercept factors compared to a no-allocation system. It demonstrates scalability and practicality for large solar thermal plants, enhancing overall performance. The method was first validated through simulations on a realistic solar plant model, then adapted and successfully tested in a 50 MW solar trough plant, demonstrating its advantages. Furthermore, the algorithms have been implemented, commissioned, and are currently operating in 13 commercial solar trough plants.
Multivariable parametric models are critical for designing, controlling, and optimizing the performance of engineered systems. The main objective of this paper is to develop a parametric identification strategy that delivers accurate and physically relevant models of multivariable systems using time-domain data. The introduced approach adopts an additive model structure, offering a parsimonious and interpretable representation of many physical systems, and employs a refined instrumental variable-based estimation algorithm. The developed identification method enables the estimation of parametric continuous-time additive models and is applicable to both open and closed-loop controlled systems. The performance of the estimator is demonstrated through numerical simulations and experimentally validated on a flexible beam system.
Identifying controlled safety invariant sets (CSISs) is essential in safety-critical applications. This paper tackles the problem of identifying CSISs for black-box discrete-time systems, where the model is unknown and only limited simulation data is accessible. Traditionally, a CSIS is defined as a subset of a safe set, encompassing initial states for which a control input exists that keeps the system within the set at the next time step-this is referred to as the one-step invariance property. However, the requirement for one-step invariance can be equivalently translated into a stricter condition of ``always-invariance'', meaning that there exist control inputs capable of keeping the system within this set indefinitely. Such a condition may prove overly stringent or impractical for black-box systems, where predictions can become unreliable beyond a single time step or a limited number of finite time steps. To overcome the challenges posed by black-box systems, we reformulate the one-step invariance property in a ``Probably Approximately Correct'' (PAC) sense. This approach allows us to assess the probability that a control input exists to keep the system within the CSIS at the next time step, with a predefined level of confidence. If the system successfully remains within the set at the next time step, we can then reapply the invariance evaluation to the new state, thereby facilitating a recursive assurance of invariance. Our method employs barrier functions and scenario optimization, resulting in a linear programming method to estimate PAC CSISs. Finally, the effectiveness of our approach is demonstrated on several examples.
Spike sorting is a fundamental step in analyzing extracellular recordings, enabling the isolation of individual neuronal activity, yet it remains a challenging problem due to overlapping signals and recording instabilities, including electrode drift. While numerous algorithms have been developed to address these challenges, many struggle to balance accuracy and computational efficiency, limiting their applicability to largescale datasets. In response, we introduce SpikeSift, a novel spike sorting algorithm designed to mitigate drift by partitioning recordings into short, relatively stationary segments, with spikes subsequently sorted within each. To preserve neuronal identity across segment boundaries, a computationally efficient alignment process merges clusters without relying on continuous trajectory estimation. In contrast to conventional methods that separate spike detection from clustering, SpikeSift integrates these processes within an iterative detect-andsubtract framework, enhancing clustering accuracy while maintaining computational efficiency. Evaluations on intracellularly validated datasets and biophysically realistic MEArec simulations confirm that SpikeSift maintains high sorting accuracy even in the presence of electrode drift, providing a scalable and computationally efficient solution for large-scale extracellular recordings
We establish mathematical bounds on the chain, ABCD and immittance matrices of a multiconductor transmission line, based on the Telegrapher's equation. Closed-form expressions for those matrices are also presented. Existing results that hold on the imaginary axis are extended to the complex plane, without reliance on a simultaneous diagonalizability assumption that is ubiquitous in the literature. Therefore, the results remain valid even when line symmetry breaks down, as in the case of electrical faults. The system-theoretic properties established here are of general relevance to control, power systems, and signal processing involving multiconductor transmission lines.
Scaled Relative Graphs (SRGs) provide a novel graphical frequency domain method for the analysis of nonlinear systems. In this paper, we use the restriction of the SRG to particular input spaces to compute frequency-dependent gain bounds for incrementally stable nonlinear systems. This leads to a nonlinear (NL) generalization of the Bode diagram, where the sinusoidal, harmonic, and subharmonic inputs are considered separately. When applied to the analysis of the NL loop transfer and sensitivity, we define a notion of bandwidth for both the open-loop and closed-loop, compatible with the LTI definitions. We illustrate the power of our method on the analysis of a DC motor with a parasitic nonlinearity, verifying our results in simulations.
Nuclear instance segmentation plays a vital role in disease diagnosis within digital pathology. However, limited labeled data in pathological images restricts the overall performance of nuclear instance segmentation. To tackle this challenge, we propose a novel data augmentation framework Instance Migration Diffusion Model (IM-Diffusion), IM-Diffusion designed to generate more varied pathological images by constructing diverse nuclear layouts and internuclear spatial relationships. In detail, we introduce a Nuclear Migration Module (NMM) which constructs diverse nuclear layouts by simulating the process of nuclear migration. Building on this, we further present an Internuclear-regions Inpainting Module (IIM) to generate diverse internuclear spatial relationships by structure-aware inpainting. On the basis of the above, IM-Diffusion generates more diverse pathological images with different layouts and internuclear spatial relationships, thereby facilitating downstream tasks. Evaluation on the CoNSeP and GLySAC datasets demonstrate that the images generated by IM-Diffusion effectively enhance overall instance segmentation performance. Code will be made public later.
Accurate segmentation of lesions plays a critical role in medical image analysis and diagnosis. Traditional segmentation approaches that rely solely on visual features often struggle with the inherent uncertainty in lesion distribution and size. To address these issues, we propose STPNet, a Scale-aware Text Prompt Network that leverages vision-language modeling to enhance medical image segmentation. Our approach utilizes multi-scale textual descriptions to guide lesion localization and employs retrieval-segmentation joint learning to bridge the semantic gap between visual and linguistic modalities. Crucially, STPNet retrieves relevant textual information from a specialized medical text repository during training, eliminating the need for text input during inference while retaining the benefits of cross-modal learning. We evaluate STPNet on three datasets: COVID-Xray, COVID-CT, and Kvasir-SEG. Experimental results show that our vision-language approach outperforms state-of-the-art segmentation methods, demonstrating the effectiveness of incorporating textual semantic knowledge into medical image analysis. The code has been made publicly on https://github.com/HUANGLIZI/STPNet.
In integrated sensing and communication (ISAC) systems, pilot signals play a crucial role in enhancing sensing performance due to their strong autocorrelation properties and high transmission power. However, conventional interleaved pilots inherently constrain the maximum unambiguous range and reduce the accuracy of channel impulse response (CIR) estimation compared to continuous orthogonal frequency-division multiple access (OFDMA) signals. To address this challenge, we propose a novel overlapped block-pilot structure for uplink OFDMA-based ISAC systems, called phase-shifted ISAC (PS-ISAC) pilot allocation. The proposed method leverages a cyclic prefix (CP)-based phase-shifted pilot design, enabling efficient multi-transmitter pilot separation at the receiver. Simulation results confirm that the proposed scheme enhances CIR separation, reduces computational complexity, and improves mean square error (MSE) performance under practical power constraints. Furthermore, we demonstrate that utilizing continuous pilot resources maximizes the unambiguous range.
While unmanned aerial vehicles (UAVs) with flexible mobility are envisioned to enhance physical layer security in wireless communications, the efficient security design that adapts to such high network dynamics is rather challenging. The conventional approaches extended from optimization perspectives are usually quite involved, especially when jointly considering factors in different scales such as deployment and transmission in UAV-related scenarios. In this paper, we address the UAV-enabled multi-user secure communications by proposing a deep graph reinforcement learning framework. Specifically, we reinterpret the security beamforming as a graph neural network (GNN) learning task, where mutual interference among users is managed through the message-passing mechanism. Then, the UAV deployment is obtained through soft actor-critic reinforcement learning, where the GNN-based security beamforming is exploited to guide the deployment strategy update. Simulation results demonstrate that the proposed approach achieves near-optimal security performance and significantly enhances the efficiency of strategy determination. Moreover, the deep graph reinforcement learning framework offers a scalable solution, adaptable to various network scenarios and configurations, establishing a robust basis for information security in UAV-enabled communications.
Achieving more powerful semantic representations and semantic understanding is one of the key problems in improving the performance of semantic communication systems. This work focuses on enhancing the semantic understanding of the text data to improve the effectiveness of semantic exchange. We propose a novel semantic communication system for text transmission, in which the semantic understanding is enhanced by coarse-to-fine processing. Especially, a dual attention mechanism is proposed to capture both the coarse and fine semantic information. Numerical experiments show the proposed system outperforms the benchmarks in terms of bilingual evaluation, sentence similarity, and robustness under various channel conditions.
We introduce behavioral inequalities as a way to model dynamical systems defined by inequalities among their variables of interest. We claim that such a formulation enables the representation of safety-aware dynamical systems, systems with bounds on disturbances, practical design limits and operational boundaries, etc. We develop a necessary and sufficient condition for the existence of solutions to such behavioral inequalities and provide a parametrization of solutions when they exist. Finally, we show the efficacy of the proposed method in two practical examples.
This paper presents an enhanced electric vehicle demand response system based on large language models, aimed at optimizing the application of vehicle-to-grid technology. By leveraging an large language models-driven multi-agent framework to construct user digital twins integrated with multidimensional user profile features, it enables deep simulation and precise prediction of users' charging and discharging decision-making patterns. Additionally, a data- and knowledge-driven dynamic incentive mechanism is proposed, combining a distributed optimization model under network constraints to optimize the grid-user interaction while ensuring both economic viability and security. Simulation results demonstrate that the approach significantly improves load peak-valley regulation and charging/discharging strategies. Experimental validation highlights the system's substantial advantages in load balancing, user satisfaction and grid stability, providing decision-makers with a scalable V2G management tool that promotes the sustainable, synergistic development of vehicle-grid integration.
The integration of Maritime Autonomous Surface Ships (MASS) into global maritime operations represents a transformative shift in the shipping industry, promising enhanced safety, efficiency, and cost-effectiveness. However, the widespread adoption of autonomous ships necessitates a robust regulatory framework and rigorous certification processes to address the unique challenges posed by these advanced technologies. This paper proposes a gradual, multi-stage approach to the certification and integration of MASS, beginning with small-scale trials in controlled environments and progressing to large-scale international operations. Key considerations include the development of reliable control systems, cybersecurity measures, sensor technologies, and redundancy mechanisms to ensure safe and efficient navigation. Additionally, the paper explores the economic and environmental implications of autonomous shipping, as well as the evolving legal frameworks for liability and compensation in the event of collisions. By adopting a cautious and methodical approach, the maritime industry can mitigate risks and pave the way for the safe and sustainable integration of autonomous ships into global trade.
The performance of deep learning-based multi-channel speech enhancement methods often deteriorates when the geometric parameters of the microphone array change. Traditional approaches to mitigate this issue typically involve training on multiple microphone arrays, which can be costly. To address this challenge, we focus on uniform circular arrays and propose the use of a spatial filter bank to extract features that are approximately invariant to geometric parameters. These features are then processed by a two-stage conformer-based model (TSCBM) to enhance speech quality. Experimental results demonstrate that our proposed method can be trained on a fixed microphone array while maintaining effective performance across uniform circular arrays with unseen geometric configurations during applications.