Processing math: 100%

Electrical Engineering and Systems Science

2024-11-21 | | Total: 76

#1 Comparative Analysis of Machine Learning and Deep Learning Models for Classifying Squamous Epithelial Cells of the Cervix [PDF] [Copy] [Kimi] [REL]

Authors: Subhasish Das, Satish K Panda, Madhusmita Sethy, Prajna Paramita Giri, Ashwini K Nanda

The cervix is the narrow end of the uterus that connects to the vagina in the female reproductive system. Abnormal cell growth in the squamous epithelial lining of the cervix leads to cervical cancer in females. A Pap smear is a diagnostic procedure used to detect cervical cancer by gently collecting cells from the surface of the cervix with a small brush and analyzing their changes under a microscope. For population-based cervical cancer screening, visual inspection with acetic acid is a cost-effective method with high sensitivity. However, Pap smears are also suitable for mass screening due to their higher specificity. The current Pap smear analysis method is manual, time-consuming, labor-intensive, and prone to human error. Therefore, an artificial intelligence (AI)-based approach for automatic cell classification is needed. In this study, we aimed to classify cells in Pap smear images into five categories: superficial-intermediate, parabasal, koilocytes, dyskeratotic, and metaplastic. Various machine learning (ML) algorithms, including Gradient Boosting, Random Forest, Support Vector Machine, and k-Nearest Neighbor, as well as deep learning (DL) approaches like ResNet-50, were employed for this classification task. The ML models demonstrated high classification accuracy; however, ResNet-50 outperformed the others, achieving a classification accuracy of 93.06%. This study highlights the efficiency of DL models for cell-level classification and their potential to aid in the early diagnosis of cervical cancer from Pap smear images.

Subjects: Image and Video Processing ; Computer Vision and Pattern Recognition

Publish: 2024-11-20 18:37:01 UTC


#2 Efficient Brain Imaging Analysis for Alzheimer's and Dementia Detection Using Convolution-Derivative Operations [PDF] [Copy] [Kimi] [REL]

Authors: Yasmine Mustafa, Mohamed Elmahallawy, Tie Luo

Alzheimer's disease (AD) is characterized by progressive neurodegeneration and results in detrimental structural changes in human brains. Detecting these changes is crucial for early diagnosis and timely intervention of disease progression. Jacobian maps, derived from spatial normalization in voxel-based morphometry (VBM), have been instrumental in interpreting volume alterations associated with AD. However, the computational cost of generating Jacobian maps limits its clinical adoption. In this study, we explore alternative methods and propose Sobel kernel angle difference (SKAD) as a computationally efficient alternative. SKAD is a derivative operation that offers an optimized approach to quantifying volumetric alterations through localized analysis of the gradients. By efficiently extracting gradient amplitude changes at critical spatial regions, this derivative operation captures regional volume variations Evaluation of SKAD over various medical datasets demonstrates that it is 6.3x faster than Jacobian maps while still maintaining comparable accuracy. This makes it an efficient and competitive approach in neuroimaging research and clinical practice.

Subjects: Image and Video Processing ; Computer Vision and Pattern Recognition ; Neural and Evolutionary Computing ; Performance

Publish: 2024-11-20 17:38:34 UTC


#3 Efficient and Physically-Consistent Modeling of Reconfigurable Electromagnetic Structures [PDF] [Copy] [Kimi] [REL]

Authors: Alexander Stutz-Tirri, Georg Schwan, Christoph Studer

Reconfigurable electromagnetic structures (REMSs), such as reconfigurable reflectarrays (RRAs) or reconfigurable intelligent surfaces (RISs), hold significant potential to improve wireless communication and sensing systems. Even though several REMS modeling approaches have been proposed in recent years, the literature lacks models that are both computationally efficient and physically consistent. As a result, algorithms that control the reconfigurable elements of REMSs (e.g., the phase shifts of an RIS) are often built on simplistic models that are inaccurate. To enable physically accurate REMS-parameter tuning, we present a new framework for efficient and physically consistent modeling of general REMSs. Our modeling method combines a circuit-theoretic approach with a new formalism that describes a REMS's interaction with the electromagnetic (EM) waves in its far-field region. Our modeling method enables efficient computation of the entire far-field radiation pattern for arbitrary configurations of the REMS reconfigurable elements once a single full-wave EM simulation of the non-reconfigurable parts of the REMS has been performed. The predictions made by the proposed framework align with the physical laws of classical electrodynamics and model effects caused by inter-antenna coupling, non-reciprocal materials, polarization, ohmic losses, matching losses, influence of metallic housings, noise from low-noise amplifiers, and noise arising in or received by antennas. In order to validate the efficiency and accuracy of our modeling approach, we (i) compare our modeling method to EM simulations and (ii) conduct a case study involving a planar RRA that enables simultaneous multiuser beam- and null-forming using a new, computationally efficient, and physically accurate parameter tuning algorithm.

Subjects: Signal Processing ; Information Theory

Publish: 2024-11-20 17:21:36 UTC


#4 Why Anticipatory Sensing Matters in Commercial ACC Systems under Cut-In Scenarios: A Perspective from Stochastic Safety Analysis [PDF] [Copy] [Kimi] [REL]

Authors: Hao Zhang, Sixu Li, Zihao Li, Mohammad Anis, Dominique Lord, Yang Zhou

This study presents an analytical solution for the vehicle state evolution of Adaptive Cruise Control (ACC) systems under cut-in scenarios, incorporating sensing delays and anticipation using the Lambert W function. The theoretical analysis demonstrates that the vehicle state evolution and the corresponding safety of ACC in cut-in situations are influenced by multiple factors, including the original leading vehicle's state, the initial conditions of the cut-in vehicle, subsequent cut-in maneuvers, sensing delays, and the ACC's anticipation capabilities. To quantitatively assess these influences, a series of numerical experiments were conducted to perform a stochastic safety analysis of ACC systems, accounting for embedded sensing delays and anticipation, using empirically calibrated control parameters from real-world data. The experiments revealed that the impact of sensing delays on ACC is multifaceted. Specifically, sensing delays negatively affect ACC stability, with the severity increasing as the delay lengthens. Furthermore, collision risk in cut-in scenarios becomes more significant with sensing delays, particularly when the cut-in vehicle is slower than the following vehicle and when cut-ins are aggressive. However, anticipation plays a crucial role in mitigating these risks. Even with a 0.6-second anticipation, collision risk can be reduced by 91% in highly adverse scenarios. Finally, both sensing delays and anticipation have effects that intensify with their duration. An anticipation period of 2 seconds effectively ensures safety in aggressive cut-in conditions, even in the presence of sensing delays.

Subject: Systems and Control

Publish: 2024-11-20 17:05:10 UTC


#5 Issues with Input-Space Representation in Nonlinear Data-Based Dissipativity Estimation [PDF] [Copy] [Kimi] [REL]

Authors: Ethan LoCicero, Alex Penne, Leila Bridgeman

In data-based control, dissipativity can be a powerful tool for attaining stability guarantees for nonlinear systems if that dissipativity can be inferred from data. This work provides a tutorial on several existing methods for data-based dissipativity estimation of nonlinear systems. The interplay between the underlying assumptions of these methods and their sample complexity is investigated. It is shown that methods based on delta-covering result in an intractable trade-off between sample complexity and robustness. A new method is proposed to quantify the robustness of machine learning-based dissipativity estimation. It is shown that this method achieves a more tractable trade-off between robustness and sample complexity. Several numerical case studies demonstrate the results.

Subjects: Systems and Control ; Optimization and Control

Publish: 2024-11-20 15:43:58 UTC


#6 Adversarial Diffusion Compression for Real-World Image Super-Resolution [PDF2] [Copy] [Kimi1] [REL]

Authors: Bin Chen, Gehui Li, Rongyuan Wu, Xindong Zhang, Jie Chen, Jian Zhang, Lei Zhang

Real-world image super-resolution (Real-ISR) aims to reconstruct high-resolution images from low-resolution inputs degraded by complex, unknown processes. While many Stable Diffusion (SD)-based Real-ISR methods have achieved remarkable success, their slow, multi-step inference hinders practical deployment. Recent SD-based one-step networks like OSEDiff and S3Diff alleviate this issue but still incur high computational costs due to their reliance on large pretrained SD models. This paper proposes a novel Real-ISR method, AdcSR, by distilling the one-step diffusion network OSEDiff into a streamlined diffusion-GAN model under our Adversarial Diffusion Compression (ADC) framework. We meticulously examine the modules of OSEDiff, categorizing them into two types: (1) Removable (VAE encoder, prompt extractor, text encoder, etc.) and (2) Prunable (denoising UNet and VAE decoder). Since direct removal and pruning can degrade the model's generation capability, we pretrain our pruned VAE decoder to restore its ability to decode images and employ adversarial distillation to compensate for performance loss. This ADC-based diffusion-GAN hybrid design effectively reduces complexity by 73% in inference time, 78% in computation, and 74% in parameters, while preserving the model's generation capability. Experiments manifest that our proposed AdcSR achieves competitive recovery quality on both synthetic and real-world datasets, offering up to 9.3× speedup over previous one-step diffusion-based methods. Code and models will be made available.

Subjects: Image and Video Processing ; Computer Vision and Pattern Recognition

Publish: 2024-11-20 15:13:36 UTC


#7 RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content [PDF1] [Copy] [Kimi] [REL]

Authors: Yuxuan Jiang, Jakub Nawała, Chen Feng, Fan Zhang, Xiaoqing Zhu, Joel Sole, David Bull

Super-resolution (SR) is a key technique for improving the visual quality of video content by increasing its spatial resolution while reconstructing fine details. SR has been employed in many applications including video streaming, where compressed low-resolution content is typically transmitted to end users and then reconstructed with a higher resolution and enhanced quality. To support real-time playback, it is important to implement fast SR models while preserving reconstruction quality; however most existing solutions, in particular those based on complex deep neural networks, fail to do so. To address this issue, this paper proposes a low-complexity SR method, RTSR, designed to enhance the visual quality of compressed video content, focusing on resolution up-scaling from a) 360p to 1080p and from b) 540p to 4K. The proposed approach utilizes a CNN-based network architecture, which was optimized for AV1 (SVT)-encoded content at various quantization levels based on a dual-teacher knowledge distillation method. This method was submitted to the AIM 2024 Video Super-Resolution Challenge, specifically targeting the Efficient/Mobile Real-Time Video Super-Resolution competition. It achieved the best trade-off between complexity and coding performance (measured in PSNR, SSIM and VMAF) among all six submissions. The code will be available soon.

Subjects: Image and Video Processing ; Computer Vision and Pattern Recognition

Publish: 2024-11-20 14:36:06 UTC


#8 IoT-Based Coma Patient Monitoring System [PDF] [Copy] [Kimi] [REL]

Authors: Hailemicael Lulseged Yimer, Hailegabriel Dereje Degefa, Marco Cristani, Federico Cunico

Continuous monitoring of coma patients is essential but challenging, especially in developing countries with limited resources, staff, and infrastructure. This paper presents a low-cost IoT-based system designed for such environments. It uses affordable hardware and robust software to monitor patients without constant internet access or extensive medical personnel. The system employs cost-effective sensors to track vital signs, including heart rate, body temperature, blood pressure, eye movement, and body position. An energy-efficient microcontroller processes data locally, synchronizing with a central server when network access is available. A locally hosted app provides on-site access to patient data, while a GSM module sends immediate alerts for critical events, even in areas with limited cellular coverage. This solution emphasizes ease of deployment, minimal maintenance, and resilience to power and network disruptions. Using open-source software and widely available hardware, it offers a scalable, adaptable system for resource-limited settings. At under $30, the system is a sustainable, cost-effective solution for continuous patient monitoring, bridging the gap until more advanced healthcare infrastructure is available.

Subjects: Systems and Control ; Networking and Internet Architecture

Publish: 2024-11-20 14:16:47 UTC


#9 Abstracted Model Reduction: A General Framework for Efficient Interconnected System Reduction [PDF] [Copy] [Kimi] [REL]

Authors: Luuk Poort, Lars A. L. Janssen, Bart Besselink, Rob H. B. Fey, Nathan van de Wouw

This paper introduces the concept of abstracted model reduction: a framework to improve the tractability of structure-preserving methods for the complexity reduction of interconnected system models. To effectively reduce high-order, interconnected models, it is usually not sufficient to consider the subsystems separately. Instead, structure-preserving reduction methods should be employed, which consider the interconnected dynamics to select which subsystem dynamics to retain in reduction. However, structure-preserving methods are often not computationally tractable. To overcome this issue, we propose to connect each subsystem model to a low-order abstraction of its environment to reduce it both effectively and efficiently. By means of a high-fidelity structural-dynamics model from the lithography industry, we show, on the one hand, significantly increased accuracy with respect to standard subsystem reduction and, on the other hand, similar accuracy to direct application of expensive structure-preserving methods, while significantly reducing computational cost. Furthermore, we formulate a systematic approach to automatically determine sufficient abstraction and reduction orders to preserve stability and guarantee a given frequency-dependent error specification. We apply this approach to the lithography equipment use case and show that the environment model can indeed be reduced by over 80\% without significant loss in the accuracy of the reduced interconnected model.

Subject: Systems and Control

Publish: 2024-11-20 14:16:46 UTC


#10 Multipath Mitigation Technology-integrated GNSS Direct Position Estimation Plug-in Module [PDF] [Copy] [Kimi] [REL]

Authors: Sergio Vicenzo, Bing Xu

Direct position estimation (DPE) is an effective solution to the MP issue at the signal processing level. Unlike two-step positioning (2SP) receivers, DPE directly solves for the receiver position, velocity, and time (PVT) in the navigation domain, without the estimation of intermediate measurements, thus allowing it to provide more robust and accurate PVT estimates in the presence of multipath (MP) and weak signals. But GNSS positioning with DPE is mostly left unapplied commercially, and continuing research into DPE has remained relatively stagnant over the past few years. To encourage further research on DPE by the GNSS community, we propose a DPE plug-in module that can be integrated into the conventional 2SP software-defined receivers (SDRs). Programmed in MATLAB, the proposed DPE plug-in module is aimed for better understanding and familiarity of a practical implementation of DPE. Its plug-in module architecture allows it to be incorporated with 2SP MATLAB SDRs, both vector tracking and scalar tracking with minimum changes, making it easy to use, and provides greater flexibility for researchers using various 2SP SDRs. Since the proposed DPE implementation makes use of tracking observables from 2SP to propagate the channel, we propose to further improve the performance of DPE against MP through using MP-compensated observables generated from Multipath Mitigation Technology (MMT)-aided tracking. Referred to as Multipath Mitigation Technology (MMT)-integrated DPE, it is proposed as a variant of DPE that is better suit for urban environment applications. Results show that while in MP-only conditions, an MMT-integrated 2SP has similar performance with MMT-integrated DPE, the proposed MMT-integrated DPE manages to show great superiority against non-line-of-sight (NLOS), making it the preferable option for applications in urban environments.

Subject: Signal Processing

Publish: 2024-11-20 14:11:48 UTC


#11 Moving Horizon Estimation for Simultaneous Localization and Mapping with Robust Estimation Error Bounds [PDF] [Copy] [Kimi] [REL]

Authors: Jelena Trisovic, Alexandre Didier, Simon Muntwiler, Melanie N. Zeilinger

This paper presents a robust moving horizon estimation (MHE) approach with provable estimation error bounds for solving the simultaneous localization and mapping (SLAM) problem. We derive sufficient conditions to guarantee robust stability in ego-state estimates and bounded errors in landmark position estimates, even under limited landmark visibility which directly affects overall system detectability. This is achieved by decoupling the MHE updates for the ego-state and landmark positions, enabling individual landmark updates only when the required detectability conditions are met. The decoupled MHE structure also allows for parallelization of landmark updates, improving computational efficiency. We discuss the key assumptions, including ego-state detectability and Lipschitz continuity of the landmark measurement model, with respect to typical SLAM sensor configurations, and introduce a streamlined method for the range measurement model. Simulation results validate the considered method, highlighting its efficacy and robustness to noise.

Subjects: Systems and Control ; Robotics

Publish: 2024-11-20 13:26:11 UTC


#12 Analytic Design of Flat-Wire Inductors for High-Current and Compact DC-DC Converters [PDF] [Copy] [Kimi] [REL]

Authors: Sajjad Mohammadi, James L. Kirtley, Alireza Namadmalan

This paper presents analytic study and design considerations of flat wire inductors with distributed gaps for high-power and compact DC-DC Converters. The focus is eddy current loss components within the conductors due to fringing and leakage fluxes. A magnetic equivalent circuit (MEC) is proposed in which eddy currents are modeled by MMFs opposing the primary flux as well as frequency dependent reluctances, which finally leads to a frequency dependent inductance describing the behavior of the inductor at high frequencies. Three formulations for DC resistance depending on the required accuracy are developed. Calculations of the AC resistance based on vector potential obtained from FEM are provided. To provide an insight into the optimized design of such inductors, components of the magnetic flux and induced eddy currents along with sensitivity of the main inductor quantities such as DCR, ESR, loss components and inductance values to the design parameters are investigated. Finally, an inductor is prototyped and experimentally tested to verify the design.

Subject: Systems and Control

Publish: 2024-11-20 13:21:41 UTC


#13 Mutual Information-oriented ISAC Beamforming Design under Statistical CSI [PDF] [Copy] [Kimi] [REL]

Authors: Shanfeng Xu, Yanshuo Cheng, Siqiang Wang, Xinyi Wang, Zhong Zheng, Zesong Fei

Existing integrated sensing and communication (ISAC) beamforming design were mostly designed under perfect instantaneous channel state information (CSI), limiting their use in practical dynamic environments. In this paper, we study the beamforming design for multiple-input multiple-output (MIMO) ISAC systems based on statistical CSI, with the weighted mutual information (MI) comprising sensing and communication perspectives adopted as the performance metric. In particular, the operator-valued free probability theory is utilized to derive the closed-form expression for the weighted MI under statistical CSI. Subsequently, an efficient projected gradient ascent (PGA) algorithm is proposed to optimize the transmit beamforming matrix with the aim of maximizing the weighted MI.Numerical results validate that the derived closed-form expression matches well with the Monte Carlo simulation results and the proposed optimization algorithm is able to improve the weighted MI significantly. We also illustrate the trade-off between sensing and communication MI.

Subject: Signal Processing

Publish: 2024-11-20 13:16:47 UTC


#14 A CSI Feedback Framework based on Transmitting the Important Values and Generating the Others [PDF] [Copy] [Kimi] [REL]

Authors: Zhilin Du, Zhenyu Liu, Haozhen Li, Shilong Fan, Xinyu Gu, Lin Zhang

The application of deep learning (DL)-based channel state information (CSI) feedback frameworks in massive multiple-input multiple-output (MIMO) systems has significantly improved reconstruction accuracy. However, the limited generalization of widely adopted autoencoder-based networks for CSI feedback challenges consistent performance under dynamic wireless channel conditions and varying communication overhead constraints. To enhance the robustness of DL-based CSI feedback across diverse channel scenarios, we propose a novel framework, ITUG, where the user equipment (UE) transmits only a selected portion of critical values in the CSI matrix, while a generative model deployed at the BS reconstructs the remaining values. Specifically, we introduce a scoring algorithm to identify important values based on amplitude and contrast, an encoding algorithm to convert these values into a bit stream for transmission using adaptive bit length and a modified Huffman codebook, and a Transformer-based generative network named TPMVNet to recover the untransmitted values based on the received important values. Experimental results demonstrate that the ITUG framework, equipped with a single TPMVNet, achieves superior reconstruction performance compared to several high-performance autoencoder models across various channel conditions.

Subject: Signal Processing

Publish: 2024-11-20 13:08:58 UTC


#15 Efficient Localization with Base Station-Integrated Beyond Diagonal RIS [PDF] [Copy] [Kimi] [REL]

Authors: Mahmoud Raeisi, Hui Chen, Henk Wymeersch, Ertugrul Basar

This paper introduces a novel approach to efficient localization in next-generation communication systems through a base station (BS)-enabled passive beamforming utilizing beyond diagonal reconfigurable intelligent surfaces (BD-RISs). Unlike conventional diagonal RISs (D-RISs), which suffer from limited beamforming capability, a BD-RIS provides enhanced control over both phase and amplitude, significantly improving localization accuracy. By conducting a comprehensive Cramér-Rao lower bound (CRLB) analysis across various system parameters in both near-field and far-field scenarios, we establish the BD-RIS structure as a competitive alternative to traditional active antenna arrays. Our results reveal that BD-RISs achieve near active antenna arrays performance in localization precision, overcoming the limitations of D-RISs and underscoring its potential for high-accuracy positioning in future communication networks. This work envisions the use of BD-RIS for enabling passive beamforming-based localization, setting the stage for more efficient and scalable localization strategies in sixth-generation networks and beyond.

Subject: Signal Processing

Publish: 2024-11-20 13:06:04 UTC


#16 EEG Signal Denoising Using pix2pix GAN: Enhancing Neurological Data Analysis [PDF] [Copy] [Kimi] [REL]

Authors: Haoyi Wang, Xufang Chen, Yue Yang, Kewei Zhou, Meining Lv, Dongrui Wang, Wenjie Zhang

Electroencephalography (EEG) is essential in neuroscience and clinical practice, yet it suffers from physiological artifacts, particularly electromyography (EMG), which distort signals. We propose a deep learning model using pix2pixGAN to remove such noise and generate reliable EEG signals. Leveraging the EEGdenoiseNet dataset, we created synthetic datasets with controlled EMG noise levels for model training and testing across a signal-to-noise ratio (SNR) from -7 to 2. Our evaluation metrics included RRMSE and Pearson's CC, assessing both time and frequency domains, and compared our model with others. The pix2pixGAN model excelled, especially under high noise conditions, showing significant improvements in lower RRMSE and higher CC values. This demonstrates the model's superior accuracy and stability in purifying EEG signals, offering a robust solution for EEG analysis challenges and advancing clinical and neuroscience applications.

Subject: Signal Processing

Publish: 2024-11-20 12:55:14 UTC


#17 Unified Performance Control for Non-Square Nonlinear Systems with Relaxed Controllability [PDF] [Copy] [Kimi] [REL]

Authors: Bing Zhou, Kai Zhao, Yongduan Song

In this paper, we investigate the problem of unified prescribed performance tracking for a class of non-square strict-feedback nonlinear systems in the presence of actuator faults under relaxed controllability conditions. By using a skillful matrix decomposition and introducing some feasible auxiliary matrices, a more generalized controllability condition than the current state of the art is constructed, which can be applied to both square and non-square nonlinear systems subject to actuator faults and unknown yet time-varying control gain. Incorporating the relaxed controllability conditions and the uniform performance specifications into the backstepping design procedure, a prescribed performance fault-tolerant controller is developed that can achieve different performance demands without modifying the controller structure, which is more flexible and practical. In addition, the destruction of the system stability by unknown auxiliary matrices and unknown nonlinearities is circumvented by embedding the available core information of the state-dependent uncertainties into the design procedure. Both theoretical analysis and numerical simulation demonstrate the effectiveness and benefits of the proposed method.

Subject: Systems and Control

Publish: 2024-11-20 12:10:29 UTC


#18 OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater Imaging [PDF] [Copy] [Kimi] [REL]

Authors: Rajini Makam, Dhatri Shankari T M, Sharanya Patil, Suresh Sundram

Underwater environments pose significant challenges due to the selective absorption and scattering of light by water, which affects image clarity, contrast, and color fidelity. To overcome these, we introduce OceanLens, a method that models underwater image physics-encompassing both backscatter and attenuation-using neural networks. Our model incorporates adaptive backscatter and edge correction losses, specifically Sobel and LoG losses, to manage image variance and luminance, resulting in clearer and more accurate outputs. Additionally, we demonstrate the relevance of pre-trained monocular depth estimation models for generating underwater depth maps. Our evaluation compares the performance of various loss functions against state-of-the-art methods using the SeeThru dataset, revealing significant improvements. Specifically, we observe an average of 65% reduction in Grayscale Patch Mean Angular Error (GPMAE) and a 60% increase in the Underwater Image Quality Metric (UIQM) compared to the SeeThru and DeepSeeColor methods. Further, the results were improved with additional convolution layers that capture subtle image details more effectively with OceanLens. This architecture is validated on the UIEB dataset, with model performance assessed using Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index Measure (SSIM) metrics. OceanLens with multiple convolutional layers achieves up to 12-15% improvement in the SSIM.

Subject: Image and Video Processing

Publish: 2024-11-20 11:47:30 UTC


#19 Energy-based features and bi-LSTM neural network for EEG-based music and voice classification [PDF] [Copy] [Kimi] [REL]

Authors: Isaac Ariza, Ana M. Barbancho, Lorenzo J. Tardon, Isabel Barbancho

The human brain receives stimuli in multiple ways; among them, audio constitutes an important source of relevant stimuli for the brain regarding communication, amusement, warning, etc. In this context, the aim of this manuscript is to advance in the classification of brain responses to music of diverse genres and to sounds of different nature: speech and music. For this purpose, two different experiments have been designed to acquiere EEG signals from subjects listening to songs of different musical genres and sentences in various languages. With this, a novel scheme is proposed to characterize brain signals for their classification; this scheme is based on the construction of a feature matrix built on relations between energy measured at the different EEG channels and the usage of a bi-LSTM neural network. With the data obtained, evaluations regarding EEG-based classification between speech and music, different musical genres, and whether the subject likes the song listened to or not are carried out. The experiments unveil satisfactory performance to the proposed scheme. The results obtained for binary audio type classification attain 98.66% of success. In multi-class classification between 4 musical genres, the accuracy attained is 61.59%, and results for binary classification of musical taste rise to 96.96%.

Subjects: Signal Processing ; Sound ; Audio and Speech Processing

Publish: 2024-11-20 11:23:13 UTC


#20 Identification of Black-Box Inverter-Based Resource Control Using Hammerstein-Wiener Models [PDF] [Copy] [Kimi] [REL]

Authors: Aldin Dželo, Amer Mešanović, Mirsad Cosovic

The development of more complex inverter-based resources (IBRs) control is becoming essential as a result of the growing share of renewable energy sources in power systems. Given the diverse range of control schemes, grid operators are typically provided with black-box models of IBRs from various equipment manufacturers. As such, they are integrated into simulation models of the entire power system for analysis, and due to their nature, they can only be simulated in the time domain. Other system analysis approaches, like eigenvalue analysis, cannot be applied, making the comprehensive analysis of defined systems more challenging. This work introduces an approach for identification of three-phase IBR models for grid-forming and grid-following inverters using Hammerstein-Wiener models. To this end, we define a simulation framework for the identification process, and select suitable evaluation metrics for the results. Finally, we evaluate the approach on generic grid-forming and grid-following inverter models showing good identification results.

Subject: Systems and Control

Publish: 2024-11-20 11:19:59 UTC


#21 Intensity-Spatial Dual Masked Autoencoder for Multi-Scale Feature Learning in Chest CT Segmentation [PDF1] [Copy] [Kimi] [REL]

Authors: Yuexing Ding, Jun Wang, Hongbing Lyu

In the field of medical image segmentation, challenges such as indistinct lesion features, ambiguous boundaries,and multi-scale characteristics have long revailed. This paper proposes an improved method named Intensity-Spatial Dual Masked AutoEncoder (ISD-MAE). Based on the tissue-contrast semi-masked autoencoder, a Masked AutoEncoder (MAE) branch is introduced to perform intensity masking and spatial masking operations on chest CT images for multi-scale feature learning and segmentation tasks. The model utilizes a dual-branch structure and contrastive learning to enhance the ability to learn tissue features and boundary details. Experiments are conducted on multiple 2D and 3D datasets. The results show that ISD-MAE significantly outperforms other methods in 2D pneumonia and mediastinal tumor segmentation tasks. For example, the Dice score reaches 90.10% on the COVID19 LESION dataset, and the performance is relatively stable. However, there is still room for improvement on 3D datasets. In response to this, improvement directions are proposed, including optimizing the loss function, using enhanced 3D convolution blocks, and processing datasets from multiple perspectives.Our code is available at:https://github.com/prowontheus/ISD-MAE.

Subjects: Image and Video Processing ; Computer Vision and Pattern Recognition

Publish: 2024-11-20 10:58:47 UTC


#22 Coexistence of Real-Time Source Reconstruction and Broadband Services Over Wireless Networks [PDF] [Copy] [Kimi] [REL]

Authors: Anup Mishra, Nikolaos Pappas, Čedomir Stefanović, Onur Ayan, Xueli An, Yiqun Wu, Petar Popovski, Israel Leyva-Mayorga

Achieving a flexible and efficient sharing of wireless resources among a wide range of novel applications and services is one of the major goals of the sixth-generation of mobile systems (6G). Accordingly, this work investigates the performance of a real-time system that coexists with a broadband service in a frame-based wireless channel. Specifically, we consider real-time remote tracking of an information source, where a device monitors its evolution and sends updates to a base station (BS), which is responsible for real-time source reconstruction and, potentially, remote actuation. To achieve this, the BS employs a grant-free access mechanism to serve the monitoring device together with a broadband user, which share the available wireless resources through orthogonal or non-orthogonal multiple access schemes. We analyse the performance of the system with time-averaged reconstruction error, time-averaged cost of actuation error, and update-delivery cost as performance metrics. Furthermore, we analyse the performance of the broadband user in terms of throughput and energy efficiency. Our results show that an orthogonal resource sharing between the users is beneficial in most cases where the broadband user requires maximum throughput. However, sharing the resources in a non-orthogonal manner leads to a far greater energy efficiency.

Subject: Signal Processing

Publish: 2024-11-20 10:48:09 UTC


#23 Experimental Assessment of Human Blockage at sub-THz and mmWave Frequency Bands [PDF] [Copy] [Kimi] [REL]

Authors: Juan E. Galeote-Cazorla, Alejandro Ramírez-Arroyo, José-María Molina-García-Pardo, María-Teresa Martínez-Inglés, Juan F. Valenzuela Valdés

The fifth generation (5G) of mobile communications relies on extremely high data transmissions using a large variety of frequency bands, such as FR1 (sub-6 GHz) and FR2 (mmWave). Future mobile communications envisage using electromagnetic spectrum beyond FR2, i.e. above 100 GHz, known as sub-THz band. These new frequencies open up challenging scenarios where communications shall rely on a major contribution such as the line-of-sight (LoS) component. To the best of the authors' knowledge, for the first time in literature this work studies the human blockage effects over an extremely wide frequency band from 75 GHz to 215 GHz given: (i) the distance between the blocker and the antennas and (ii) the body orientation. Furthermore, the obtained results are modeled with the classical path loss models and compared to 3GPP alternatives. The average losses increase from 42 dB to 56 dB when frequency rises from 75 GHz to 215 GHz. In terms of distance, a 18 dB increment in the received power is found when the Tx--Rx separation is increased from 1 m to 2.5 m. Finally, the blocker orientation induces variations of up to 4.6 dB.

Subject: Signal Processing

Publish: 2024-11-20 10:46:42 UTC


#24 Coexistence of Radar and Communication with Rate-Splitting Wireless Access [PDF] [Copy] [Kimi] [REL]

Authors: Anup Mishra, Israel Leyva-Mayorga, Petar Popovski

This work investigates the coexistence of sensing and communication functionalities in a base station (BS) serving a communication user in the uplink and simultaneously detecting a radar target with the same frequency resources. To address inter-functionality interference, we employ rate-splitting (RS) at the communication user and successive interference cancellation (SIC) at the joint radar-communication receiver at the BS. This approach is motivated by RS's proven effectiveness in mitigating inter-user interference among communication users. Building on the proposed system model based on RS, we derive inner bounds on performance in terms of ergodic data information rate for communication and ergodic radar estimation information rate for sensing. Additionally, we present a closed-form solution for the optimal power split in RS that maximizes the communication user's performance. The bounds achieved with RS are compared to conventional methods, including spectral isolation and full spectral sharing with SIC. We demonstrate that RS offers a superior performance trade-off between sensing and communication functionalities compared to traditional approaches. Pertinently, while the original concept of RS deals only with digital signals, this work brings forward RS as a general method for including non-orthogonal access for sensing signals. As a consequence, the work done in this paper provides a systematic and parametrized way to effectuate non-orthogonal sensing and communication waveforms.

Subject: Signal Processing

Publish: 2024-11-20 10:41:16 UTC


#25 Quantitative Fairness -- A Framework For The Design Of Equitable Cybernetic Societies [PDF] [Copy] [Kimi] [REL]

Authors: Kevin Riehl, Michail Makridis, Anastasios Kouvelas

Advancements in computer science, artificial intelligence, and control systems of the recent have catalyzed the emergence of cybernetic societies, where algorithms play a significant role in decision-making processes affecting the daily life of humans in almost every aspect. Algorithmic decision-making expands into almost every industry, government processes critical infrastructure, and shapes the life-reality of people and the very fabric of social interactions and communication. Besides the great potentials to improve efficiency and reduce corruption, missspecified cybernetic systems harbor the threat to create societal inequities, systematic discrimination, and dystopic, totalitarian societies. Fairness is a crucial component in the design of cybernetic systems, to promote cooperation between selfish individuals, to achieve better outcomes at the system level, to confront public resistance, to gain trust and acceptance for rules and institutions, to perforate self-reinforcing cycles of poverty through social mobility, to incentivize motivation, contribution and satisfaction of people through inclusion, to increase social-cohesion in groups, and ultimately to improve life quality. Quantitative descriptions of fairness are crucial to reflect equity into algorithms, but only few works in the fairness literature offer such measures; the existing quantitative measures in the literature are either too application-specific, suffer from undesirable characteristics, or are not ideology-agnostic. Therefore, this work proposes a quantitative, transactional, distributive fairness framework, which enables systematic design of socially feasible decision-making systems. Moreover, it emphasizes the importance of fairness and transparency when designing algorithms for equitable, cybernetic societies.

Subject: Systems and Control

Publish: 2024-11-20 10:33:02 UTC