2024-10-29 | | Total: 47
Despite their wide-scale deployment and ability to make accurate high-frequency voltage measurements, communication network limitations have largely precluded the use of smart meters for real-time monitoring purposes in electric distribution systems. Although smart meter communication networks have limited bandwidth available per meter, they also have the ability to dedicate higher bandwidth to varying subsets of meters. Using this capability to enable real-time monitoring from smart meters, this paper proposes an online bandwidth-constrained sensor sampling algorithm that takes advantage of the graphical structure inherent in the power flow equations. The key idea is to use a spectral bandit framework where the estimated parameters are the graph Fourier transform coefficients of the nodal voltages. The structure provided by this framework promotes a sampling policy that strategically accounts for electrical distance. Maxima of sub-Gaussian random variables model the policy rewards, which relaxes distributional assumptions common in prior work. The scheme is implemented on a synthetic electrical network to dynamically identify meters exposing violations of voltage magnitude limits, illustrating the effectiveness of the proposed method.
The volatility of renewable energy sources and fluctuations in real-time electricity demand present significant challenges to traditional unit commitment (UC) methods, often causing system constraint violations. Conventional optimization algorithms face substantial difficulties in responding quickly to these variations, frequently requiring the relaxation of constraints or producing infeasible solutions. To address these challenges, a robust two-stage UC framework based on quantum reinforcement learning (QRL) is proposed in this work, which improves both decision-making speed and solution feasibility. In the first stage, the day-ahead scheduling of thermal generators is optimized. In the second stage, real-time adjustments are made to account for changes in renewable generation and load, with microgrids integrated to reduce the impact of uncertainties on the power system. Both stages are formulated as Markov decision processes (MDPs), and QRL is used to efficiently solve the problem. QRL provides key advantages, including more effective navigation of the high-dimensional solution space and faster convergence compared to classical methods, thus enhancing the robustness and computational efficiency of UC operations. The proposed QRL-based two-stage UC framework is validated using the IEEE RTS 24-bus system. Results demonstrate the effectiveness of the approach, showing improved solution feasibility and computational speed compared to conventional UC methods.
Extracting dynamic models from data is of enormous importance in understanding the properties of unknown systems. In this work, we employ Lipschitz neural networks, a class of neural networks with a prescribed upper bound on their Lipschitz constant, to address the problem of data-efficient nonlinear system identification. Under the (fairly weak) assumption that the unknown system is Lipschitz continuous, we propose a method to estimate the approximation error bound of the trained network and the bound on the difference between the simulated trajectories by the trained models and the true system. Empirical results show that our method outperforms classic fully connected neural networks and Lipschitz regularized networks through simulation studies on three dynamical systems, and the advantage of our method is more noticeable when less data is used for training.
The decarbonization of road freight transport is crucial for reducing greenhouse gas emissions (GHG) and achieving climate neutrality goals. This study develops a comprehensive Total Cost of Ownership (TCO) model to evaluate the economic viability and strategic pathways for decarbonizing road freight transport. The model integrates vehicles with infrastructures, encompassing costs associated with acquisition, operation, maintenance, energy consumption, environmental impacts, and end-of-life considerations. Our analysis covers medium- and heavy-duty vehicles across eight powertrain types, with variants on battery sizes and fuel cell powers, incorporating key financial parameters, technological advancements, and policy incentives. Data sources include real-world fleet data and costs gathered from expert interviews, cross-referenced with multiple public resources. Findings indicate that zero-emission and near-zero-emission vehicles, though currently more expensive, will become cost-competitive with diesel vehicles by leveraging advancements in battery, fuel cell, and hydrogen technologies.
In response to the increasing complexity of electricity markets due to low-carbon requirements and the integration of sustainable energy sources, this paper proposes a dynamic quantum computing enhanced bilevel optimization model for electricity market operations. The upper level focuses on market mechanism optimization using Reinforcement Learning (RL), specifically Proximal Policy Optimization (PPO), while the lower level models the bidding strategies of Generating Companies (GENCOs) using a Multi-Agent Deep Q-Network (MADQN) enhanced with quantum computing through a Variational Quantum Circuit (VQC). The three main contributions of this work are: (1) establishing a dynamic bilevel model with timely feedback between the upper and lower levels; (2) parameterizing and optimizing market mechanisms to derive the most effective designs; and (3) introducing quantum computing into the context of electricity markets to more realistically simulate market operations. The proposed model is tested on the IEEE 30-bus system with six GENCOs, demonstrating its effectiveness in capturing the complexities of modern electricity markets.
Deep reinforcement learning (DRL) is emerging as a promising method for adaptive robotic motion and complex task automation, effectively addressing the limitations of traditional control methods. However, ensuring safety throughout both the learning process and policy deployment remains a key challenge due to the risky exploration inherent in DRL, as well as the discrete nature of actions taken at intervals. These discontinuities, despite being part of a continuous action space, can lead to abrupt changes between successive actions, causing instability and unsafe intermediate states. To address these challenges, this paper proposes an integrated framework that combines DRL with a jerk-bounded trajectory generator (JBTG) and a robust low-level control strategy, significantly enhancing the safety, stability, and reliability of robotic manipulators. The low-level controller ensures the precise execution of DRL-generated commands, while the JBTG refines these motions to produce smooth, continuous trajectories that prevent abrupt or unsafe actions. The framework also includes pre-calculated safe velocity zones for smooth braking, preventing joint limit violations and ensuring compliance with kinematic constraints. This approach not only guarantees the robustness and safety of the robotic system but also optimizes motion control, making it suitable for practical applications. The effectiveness of the proposed framework is demonstrated through its application to a highly complex heavy-duty manipulator.
In this paper we address the optimal planification of general purpose tasks that includes a wide spectrum of situations: from project management of human teams to the coordination of an automated assembly line or the automated inspection of power grids. There exists many methods for planification. However, the vast majority of such methods are conceived for very specific problems or situations. The main consequences of this is that no general planification method exists and the rigidity that prevents the extension of current methods to new cases and applications. To address this, we propose a new truly general method ultimately based on the generalization of the Travelling Salesman Problem (TSP) that we call the Heterogeneous Multiworker Task Planification Problem (HMWTPP). The HMWTPP is then used to model and solve several classical problems included in the TSPLIB \cite{tsplib} library for validation. We then solve an example of an assembly line to show the capabilities and flexibility of the HMWTPP. To conclude, we adapt the HMWTPP to the planification of unmanned aerial vehicles (UAVs), specifically to the automated inspection of power grids. This adaptation was validated by solving real-life cases for power grids in ATLAS Flight Test Center at Villacarrillo, Spain.
Solar-powered base stations are a promising approach to sustainable telecommunications infrastructure. However, the successful deployment of solar-powered base stations requires precise prediction of the energy harvested by photovoltaic (PV) panels vs. anticipated energy expenditure in order to achieve affordable yet reliable deployment and operation. This paper introduces an innovative approach to predict energy harvesting by utilizing a novel conditional Long Short-Term Memory (Cond-LSTM) neural network architecture. Compared with LSTM and Transformer models, the Cond-LSTM model reduced the normalized root mean square error (nRMSE) by 69.6% and 42.7%, respectively. We also demonstrate the generalizability of our model across different scenarios. The proposed approach would not only facilitate an accurate cost-optimal PV-battery configuration that meets the outage probability requirements, but also help with site design in regions that lack historical solar energy data.
This paper explores the observability and estimation capability of dynamical systems using predominantly relative measurements of the system's state-space variables, with minimal to no reliance on absolute measurements of these variables. We concentrate on linear time-invariant systems, in which the observation matrix serves as the algebraic representation of a graph object. This graph object encapsulates the availability of relative measurements. Utilizing algebraic graph theory and abstract linear algebra (geometric) tools, we establish a link between the structure of the graph of relative measurements and the system-theoretic observability subspace of linear systems. Special emphasis is given to multi-agent networked systems whose dynamics are governed by the linear consensus protocol. We demonstrate the importance of absolute information and its placement to the system's dynamics in achieving full-state estimation. Finally, the analysis shifts to the synthesis of a distributed observer with relative measurements for single integrator dynamics, exemplifying the relevance of the preceding analytical findings. We support our theoretical analysis with numerical simulations.
A wide variety of data can be represented using third-order tensors, spanning applications in chemometrics, psychometrics, and image processing. However, traditional data-driven frameworks are not naturally equipped to process tensors without first unfolding or flattening the data, which can result in a loss of crucial higher-order structural information. In this article, we introduce a novel framework for the data-driven analysis of T-product-based dynamical systems (TPDSs), where the system evolution is governed by the T-product between a third-order dynamic tensor and a third-order state tensor. In particular, we examine the data informativity of TPDSs concerning system identification, stability, controllability, and stabilizability and illustrate significant computational improvements over traditional approaches by leveraging the unique properties of the T-product. The effectiveness of our framework is demonstrated through numerical examples.
In this paper, a novel approach for wireless localization is proposed and experimentally validated that leverages space-time coded reconfigurable intelligent surfaces (RIS). It is demonstrated that applying proper single-bit codes to each RIS element, enables accurate determination of the direction of arrival (AOA) at the receiver. Moreover, we introduce different scenarios that such technique can be used for localization. By incorporating RIS, a passive component, the method significantly reduces the complexity found in previous localization techniques. Additionally, the use of 1-bit codes minimizes hardware requirements, offering a reliable, low-cost solution for localization in advanced telecommunications networks.
This paper proposes a Risk-Averse Just-In-Time (RAJIT) operation scheme for Ammonia-Hydrogen-based Micro-Grids (AHMGs) to boost electricity-hydrogen-ammonia coupling under uncertainties. First, an off-grid AHMG model is developed, featuring a novel multi-mode ammonia synthesis process and a hydrogen-ammonia dual gas turbine with tunable feed-in ratios. Subsequently, a state-behavior mapping strategy linking hydrogen storage levels with the operation modes of ammonia synthesis is established to prevent cost-ineffective shutdowns. The proposed model substantially improves operational flexibility but results in a challenging nonlinear fractional program. Based upon this model, a data-driven RAJIT scheme is developed for the real-time rolling optimization of AHMGs. Unlike conventional one-size-fits-all schemes using one optimization method throughout, the data driven RAJIT intelligently switches between cost-effective deterministic optimization and risk-averse online-learning distributionally robust optimization depending on actual risk profiles, thus capitalizing on the respective strengths of these two optimization methods. To facilitate the solution of the resulting nonlinear program, we develop an equivalent-reformulation-based solution methodology by leveraging a constraint-tightening technique. Numerical simulations demonstrate that the proposed scheme guarantees safety and yields an overall cost reduction up to 14.6% compared with several state-of-the-art methods.
Defense hardening can effectively enhance the resilience of distribution networks against extreme weather disasters. Currently, most existing hardening strategies focus on reducing load shedding. However, for electricity-hydrogen distribution networks (EHDNs), the leakage risk of hydrogen should be controlled to avoid severe incidents such as explosions. To this end, this paper proposes an optimal hardening strategy for EHDNs under extreme weather, aiming to minimize load shedding while limiting the leakage risk of hydrogen pipelines. Specifically, modified failure uncertainty models for power lines and hydrogen pipelines are developed. These models characterize not only the effect of hardening, referred to as decision-dependent uncertainties (DDUs), but also the influence of disaster intensity correlations on failure probability distributions. Subsequently, a hardening decision framework is established, based on the two-stage distributionally robust optimization incorporating a hydrogen leakage chance constraint (HLCC). To enhance the computational efficiency of HLCC under discrete DDUs, an efficient second-order-cone transformation is introduced. Moreover, to address the intractable inverse of the second-order moment under DDUs, lifted variables are adopted to refine the main-cross moments. These reformulate the hardening problem as a two-stage mixed-integer second-order-cone programming, and finally solved by the column-and-constraint generation algorithm. Case studies demonstrate the effectiveness and superiority of the proposed method.
Distributed optimization finds many applications in machine learning, signal processing, and control systems. In these real-world applications, the constraints of communication networks, particularly limited bandwidth, necessitate implementing quantization techniques. In this paper, we propose distributed optimization dynamics over multi-agent networks subject to logarithmically quantized data transmission. Under this condition, data exchange benefits from representing smaller values with more bits and larger values with fewer bits. As compared to uniform quantization, this allows for higher precision in representing near-optimal values and more accuracy of the distributed optimization algorithm. The proposed optimization dynamics comprise a primary state variable converging to the optimizer and an auxiliary variable tracking the objective function's gradient. Our setting accommodates dynamic network topologies, resulting in a hybrid system requiring convergence analysis using matrix perturbation theory and eigenspectrum analysis.
Neural Control Barrier Functions (NCBFs) have shown significant promise in enforcing safety constraints on nonlinear autonomous systems. State-of-the-art exact approaches to verifying safety of NCBF-based controllers exploit the piecewise-linear structure of ReLU neural networks, however, such approaches still rely on enumerating all of the activation regions of the network near the safety boundary, thus incurring high computation cost. In this paper, we propose a framework for Synthesis with Efficient Exact Verification (SEEV). Our framework consists of two components, namely (i) an NCBF synthesis algorithm that introduces a novel regularizer to reduce the number of activation regions at the safety boundary, and (ii) a verification algorithm that exploits tight over-approximations of the safety conditions to reduce the cost of verifying each piecewise-linear segment. Our simulations show that SEEV significantly improves verification efficiency while maintaining the CBF quality across various benchmark systems and neural network structures. Our code is available at https://github.com/HongchaoZhang-HZ/SEEV.
We consider a susceptible-infected-susceptible (SIS) epidemic model in which a large group of individuals decide whether to adopt partially effective protection without being aware of their individual infection status. Each individual receives a signal which conveys noisy information about its infection state, and then decides its action to maximize its expected utility computed using its posterior probability of being infected conditioned on the received signal. We first derive the static signal which minimizes the infection level at the stationary Nash equilibrium under suitable assumptions. We then formulate an optimal control problem to determine the optimal dynamic signal that minimizes the aggregate infection level along the solution trajectory. We compare the performance of the dynamic signaling scheme with the optimal static signaling scheme, and illustrate the advantage of the former through numerical simulations.
Multi-agent cyber-physical systems are present in a variety of applications. Agent decision-making can be affected due to errors induced by uncertain, dynamic operating environments or due to incorrect actions taken by an agent. When an erroneous decision that leads to a violation of safety is identified, assigning responsibility to individual agents is a key step toward preventing future accidents. Current approaches to carrying out such investigations require human labor or high degree of familiarity with operating environments. Automated strategies to assign responsibility can achieve a significant reduction in human effort and associated cognitive burden. In this paper, we develop an automated procedure to assign responsibility for safety violations to actions of any single agent in a principled manner. We base our approach on reasoning about safety violations in road safety. Given a safety violation, we use counterfactual reasoning to create alternative scenarios, showing how different outcomes could have occurred if certain actions had been replaced by others. We introduce the degree of responsibility (DoR) metric for each agent. The DoR, using the Shapley value, quantifies each agent's contribution to the safety violation, providing a basis to explain and justify decisions. We also develop heuristic techniques and methods based on agent interaction structures to improve scalability as agent numbers grow. We examine three safety violation cases from the National Highway Traffic Safety Administration (NHTSA). We run experiments using CARLA urban driving simulator. Results show the DoR improves the explainability of decisions and accountability for agent actions and their consequences.
Optimal Power Flow (OPF) is essential for efficient planning and real-time operation in power systems but is NP-hard and non-convex, leading to significant computational challenges. Neural networks (NNs) offer computational speedups in solving OPF but face issues like dependency on large datasets, scalability limitations, and inability to enforce physical constraints, compromising solution reliability. To overcome these limitations, this paper proposes hybrid Quantum Neural Networks (QNNs) that integrate quantum computing principles into neural network architectures. Leveraging quantum mechanics properties such as superposition and entanglement, QNNs can capture complex input-output relationships more effectively and learn from small or noisy datasets. To further enhance the performance of QNNs and explore the role of the classical (non-quantum) components in hybrid architectures, we apply residual learning and incorporate physics-informed layers into the hybrid QNN designs. These techniques aim to improve training efficiency, generalization capability, and adherence to physical laws. Simulation results demonstrate that these enhanced hybrid QNNs outperform conventional NNs in solving OPF problems, even when trained on imperfect data. This work provides valuable insights into the design and optimization of hybrid QNNs, highlighting their potential to address complex optimization challenges in power systems.
Structural health monitoring of aerostructures often faces challenges identifying damage, especially in complex systems. Multi-input multi-output modal parameter identification methods are known to offer enhanced insight compared to single-input multi-output testing, as they allow for the identification of additional out-of-plane modes. The improved Loewner Framework presents a computationally efficient approach to extracting these modal parameters, focusing on natural frequencies and mode shapes as indicators of structural health. To address the challenges of damage detection, a numerical case study involving a cantilever beam with variable cross-sections is used to simulate various damage scenarios. Additionally, a full-scale experimental dataset from the BAE Hawk T1A trainer jet aircraft is employed for SHM for the first time. The modified total modal assurance criterion (MTMAC) is proposed as a standalone metric for assessing damage severity, while the coordinate modal assurance criterion (COMAC) is applied for localising damage. Benchmarking against methods such as least-squares complex exponential (LSCE) and stochastic subspace identification with the canonical variate analysis (SSI-CVA) demonstrates the effectiveness of the improved Loewner Framework in accurately identifying even small changes in modal parameters. The MTMAC and COMAC are shown to be valuable tools for, respectively, damage quantification and localisation.
We propose networked policy gradient play for solving Markov potential games including continuous action and state spaces. In the decentralized algorithm, agents sample their actions from parametrized and differentiable policies that depend on the current state and other agents' policy parameters. During training, agents estimate their gradient information through two consecutive episodes, generating unbiased estimators of reward and policy score functions. Using this information, agents compute the stochastic gradients of their policy functions and update their parameters accordingly. Additionally, they update their estimates of other agents' policy parameters based on the local estimates received through a time-varying communication network. In Markov potential games, there exists a potential value function among agents with gradients corresponding to the gradients of local value functions. Using this structure, we prove the almost sure convergence of joint policy parameters to stationary points of the potential value function. We also show that the convergence rate of the networked policy gradient algorithm is $\mathcal{O}(1/\epsilon^2)$. Numerical experiments on a dynamic multi-agent newsvendor problem verify the convergence of local beliefs and gradients. It further shows that networked policy gradient play converges as fast as independent policy gradient updates, while collecting higher rewards.
Demand-responsive connector (DRC) services are increasingly recognized for their convenience, comfort, and efficiency, offering seamless integrations between travelers' origins/destinations and major transportation hubs such as rail stations. Past analytical models for DRC optimization often failed to distinguish between two commonly used DRC operating strategies: (i) the "fully-flexible routing" strategy, where a vehicle serves only the requests received before its dispatch through an optimal tour, and (ii) the "semi-flexible routing" strategy, where a vehicle follows a predefined path through a swath to serve requests received en route. Additionally, these models often adopted oversimplified approaches for estimating local tour lengths and capturing the stochastic nature of demand. This paper distinctly identifies and analyzes the two DRC operating strategies, developing analytical models for each that accurately incorporate the second-order effects of stochastic demand and utilize refined local tour length formulas. Numerical experiments demonstrate that our models reduce cost estimation errors to within 2% for fully-flexible routing and to 0.25% for semi-flexible routing, a significant improvement over the previous errors of 8-12% and 6.3%, respectively. These enhanced models allow for more precise determination of critical demand densities for selecting between the two DRC strategies and the fixed-route feeder service. Our extensive numerical analysis offers many insights, particularly highlighting the transition from fully-flexible to semi-flexible routing as demand and region size increase, before ultimately shifting to fixed-route service. Additionally, zoning is identified as pivotal in DRC service design, with fully-flexible routing favoring square-shaped zones and semi-flexible routing preferring elongated rectangular zones.
Given a set of measurements, observability characterizes the distinguishability of a system's initial state, whereas constructability focuses on the final state in a trajectory. In the presence of process and/or measurement noise, the Fisher information matrices with respect to the initial and final states$\unicode{x2013}$equivalent to the stochastic observability and constructability Gramians$\unicode{x2013}$bound the performance of corresponding estimators through the Cramér-Rao inequality. This letter establishes a connection between stochastic observability and constructability of discrete-time linear systems and provides a more numerically stable way for calculating the stochastic observability Gramian. We define a dual system and show that the dual system's stochastic constructability is equivalent to the original system's stochastic observability, and vice versa. This duality enables the interchange of theorems and tools for observability and constructability. For example, we use this result to translate an existing recursive formula for the stochastic constructability Gramian into a formula for recursively calculating the stochastic observability Gramian for both time-varying and time-invariant systems, and we show the convergence of this sequence for the latter. Finally, we illustrate the robustness of our formula compared to existing (non-recursive) formulas through a numerical example.
Deep reinforcement learning (DRL) holds significant promise for managing voltage control challenges in simulated power grid environments. However, its real-world application in power system operations remains underexplored. This study rigorously evaluates DRL's performance and limitations within actual operational contexts by utilizing detailed experiments across the IEEE 14-bus system, Illinois 200-bus system, and the ISO New England node-breaker model. Our analysis critically assesses DRL's effectiveness for grid control from a system operator's perspective, identifying specific performance bottlenecks. The findings provide actionable insights that highlight the necessity of advancing AI technologies to effectively address the growing complexities of modern power systems. This research underscores the vital role of DRL in enhancing grid management and reliability.
This study introduces an advanced transient stability assessment (TSA) method for power systems, addressing the challenges of sample class imbalance and data noise through a novel CatBoost algorithm framework. By implementing a Gradient Harmonizing Mechanism (GHM), this method adjusts the gradient norm distribution across samples by incorporating a coordination parameter for each, thus optimizing the gradient weights for various sample types. This enhancement enables more effective training of the CatBoost algorithm, reducing the negative impacts of class imbalance and noise, and enhancing algorithmic performance. Additionally, the feature importance functionality of the CatBoost framework guides the placement of phasor measurement units, promoting economical operation of the power system. Numerical results from the New England 10-machine 39-bus system demonstrate the superior versatility, reduced application cost, and lower maintenance expenses of the proposed method compared to existing techniques.
Despite being the subject of study for several years, excessive vibration persists in the machining of metal parts. In this context, the Stability Lobe Diagram (SLD) is presented as a viable tool to mitigate this problem as a function of axial depth of cut and spindle speed. However, its accurate construction is subject to the consideration of multiple parameters and models, whose application may be affected by certain inherent uncertainties. In turn, this impacts its accuracy, especially in the stability and instability regions. The present study aims to characterize these uncertainties, analyze their influence on the SLD, and propose strategies for their reduction. Ultimately, the goal is to facilitate the user's decision-making when choosing the trajectory generation parameters.