https://papers.cool/arxiv/math.NANumerical Analysis2024-11-01T00:00:00+00:00python-feedgenCool Papers - Immersive Paper Discoveryhttps://papers.cool/arxiv/2410.23359Domain-decomposed image classification algorithms using linear discriminant analysis and convolutional neural networks2024-11-01T00:00:00+00:00Axel KlawonnMartin LanserJanine WeberIn many modern computer application problems, the classification of image data plays an important role. Among many different supervised machine learning models, convolutional neural networks (CNNs) and linear discriminant analysis (LDA) as well as sophisticated variants thereof are popular techniques. In this work, two different domain decomposed CNN models are experimentally compared for different image classification problems. Both models are loosely inspired by domain decomposition methods and in addition, combined with a transfer learning strategy. The resulting models show improved classification accuracies compared to the corresponding, composed global CNN model without transfer learning and besides, also help to speed up the training process. Moreover, a novel decomposed LDA strategy is proposed which also relies on a localization approach and which is combined with a small neural network model. In comparison with a global LDA applied to the entire input data, the presented decomposed LDA approach shows increased classification accuracies for the considered test problems.https://papers.cool/arxiv/2410.23440Learning Lipschitz Operators with respect to Gaussian Measures with Near-Optimal Sample Complexity2024-11-01T00:00:00+00:00Ben AdcockMichael GriebelGregor MaierOperator learning, the approximation of mappings between infinite-dimensional function spaces using ideas from machine learning, has gained increasing research attention in recent years. Approximate operators, learned from data, hold promise to serve as efficient surrogate models for problems in computational science and engineering, complementing traditional numerical methods. However, despite their empirical success, our understanding of the underpinning mathematical theory is in large part still incomplete. In this paper, we study the approximation of Lipschitz operators in expectation with respect to Gaussian measures. We prove higher Gaussian Sobolev regularity of Lipschitz operators and establish lower and upper bounds on the Hermite polynomial approximation error. We further consider the reconstruction of Lipschitz operators from $m$ arbitrary (adaptive) linear samples. A key finding is the tight characterization of the smallest achievable error for all possible (adaptive) sampling and reconstruction maps in terms of $m$. It is shown that Hermite polynomial approximation is an optimal recovery strategy, but we have the following curse of sample complexity: No method to approximate Lipschitz operators based on finitely many samples can achieve algebraic convergence rates in $m$. On the positive side, we prove that a sufficiently fast spectral decay of the covariance operator of the Gaussian measure guarantees convergence rates which are arbitrarily close to any algebraic rate in the large data limit $m \to \infty$. Finally, we focus on the recovery of Lipschitz operators from finitely many point samples. We consider Christoffel sampling and weighted least-squares approximation, and present an algorithm which provably achieves near-optimal sample complexity.https://papers.cool/arxiv/2410.23467Gradient-free training of recurrent neural networks2024-11-01T00:00:00+00:00Erik Lien BolagerAna CukarskaIryna BurakZahra MonfaredFelix DietrichRecurrent neural networks are a successful neural architecture for many time-dependent problems, including time series analysis, forecasting, and modeling of dynamical systems. Training such networks with backpropagation through time is a notoriously difficult problem because their loss gradients tend to explode or vanish. In this contribution, we introduce a computational approach to construct all weights and biases of a recurrent neural network without using gradient-based methods. The approach is based on a combination of random feature networks and Koopman operator theory for dynamical systems. The hidden parameters of a single recurrent block are sampled at random, while the outer weights are constructed using extended dynamic mode decomposition. This approach alleviates all problems with backpropagation commonly related to recurrent networks. The connection to Koopman operator theory also allows us to start using results in this area to analyze recurrent neural networks. In computational experiments on time series, forecasting for chaotic dynamical systems, and control problems, as well as on weather data, we observe that the training time and forecasting accuracy of the recurrent neural networks we construct are improved when compared to commonly used gradient-based methods.https://papers.cool/arxiv/2410.23647Acoustic wave diffraction by a quadrant of sound-soft scatterers2024-11-01T00:00:00+00:00Matthew NethercoteAnastasia KisilRaphael AssierMotivated by research in metamaterials, we consider the challenging problem of acoustic wave scattering by a doubly periodic quadrant of sound-soft scatterers arranged in a square formation, which we have dubbed the quarter lattice. This leads to a Wiener--Hopf equation in two complex variables with three unknown functions for which we can reduce and solve exactly using a new analytic method. After some suitable truncations, the resulting linear system is inverted using elementary matrix arithmetic and the solution can be numerically computed. This solution is also critically compared to a numerical least squares collocation approach and to our previous method where we decomposed the lattice into semi-infinite rows or columns.https://papers.cool/arxiv/2410.23385A Framework for the Solution of Tree-Coupled Saddle-Point Systems2024-11-01T00:00:00+00:00Christoph HansknechtBernhard HeinzelreiterJohn W. PearsonAndreas PotschkaWe consider the solution of saddle-point systems with a tree-based block structure, introducing a parallelizable direct method for their solution. As our key contribution, we then propose several structure-exploiting preconditioners to be used during applications of the MINRES and GMRES algorithms and analyze their properties. We adapt several concepts originating in the field of multigrid methods, obtaining a variety of problem-adapted multi-level methods. We analyze the complexity of all algorithms, and derive a number of results on eigenvalues of the preconditioned system and convergence of iterative methods. We validate our theoretical findings through a range of numerical experiments.https://papers.cool/arxiv/2410.23650An asymptotic-preserving IMEX PN method for the gray model of the radiative transfer equation2024-11-01T00:00:00+00:00Jinxue FuJuan ChengWeiming LiTao XiongYanli WangAn asymptotic-preserving (AP) implicit-explicit PN numerical scheme is proposed for the gray model of the radiative transfer equation, where the first- and second-order numerical schemes are discussed for both the linear and nonlinear models. The AP property of this numerical scheme is proved theoretically and numerically, while the numerical stability of the linear model is verified by Fourier analysis. Several classical benchmark examples are studied to validate the efficiency of this numerical scheme.https://papers.cool/arxiv/2410.23681Convergent analysis of algebraic multigrid method with data-driven parameter learning for non-selfadjoint elliptic problems2024-11-01T00:00:00+00:00Juan ZhangJunyue LuoIn this paper, we apply the practical GADI-HS iteration as a smoother in algebraic multigrid (AMG) method for solving second-order non-selfadjoint elliptic problem. Additionally, we prove the convergence of the derived algorithm and introduce a data-driven parameter learing method called Gaussian process regression (GPR) to predict optimal parameters. Numerical experimental results show that using GPR to predict parameters can save a significant amount of time cost and approach the optimal parameters accurately.https://papers.cool/arxiv/2410.23707Non-Hydrostatic Model for Simulating Moving Bottom-Generated Waves: A Shallow Water Extension with Quadratic Vertical Pressure Profile2024-11-01T00:00:00+00:00Kemal FirdausJörn BehrensWe formulate a depth-averaged non-hydrostatic model to solve wave equations with generation by a moving bottom. This model is built upon the shallow water equations, which are widely used in tsunami wave modelling. An extension leads to two additional unknowns to be solved: vertical momentum and non-hydrostatic pressure. We show that a linear vertical velocity assumption turns out to give us a quadratic pressure relation, which is equivalent to Boussinesq-type equations. However, this extension involves a time derivative of an unknown parameter, rendering the solution by a projection method ambiguous. In this study, we derive an alternative form of the elliptic system of equations to avoid such ambiguity. The new set of equations satisfies the desired solubility property, while also consistently representing the non-flat moving topography wave generation. Validations are performed using several test cases based on previous experiments and a high-fidelity simulation. First, we show the efficiency of our model in solving a vertical movement, which represents an undersea earthquake-generated tsunami. Following that, we demonstrate the accuracy of the model for landslide-generated waves. Finally, we compare the performance of our novel set of equations with the linear and simplified quadratic pressure profiles.https://papers.cool/arxiv/2410.23816A theoretical analysis of mass scaling techniques2024-11-01T00:00:00+00:00Yannis VoetEspen SandeAnnalisa BuffaMass scaling is widely used in finite element models of structural dynamics for increasing the critical time step of explicit time integration methods. While the field has been flourishing over the years, it still lacks a strong theoretical basis and mostly relies on numerical experiments as the only means of assessment. This contribution thoroughly reviews existing methods and connects them to established linear algebra results to derive rigorous eigenvalue bounds and condition number estimates. Our results cover some of the most successful mass scaling techniques, unraveling for the first time well-known numerical observations.https://papers.cool/arxiv/2410.23865A Primal Staggered Discontinuous Galerkin Method on Polytopal Meshes2024-11-01T00:00:00+00:00L. ChenX. HuangE. ParkR. WangThis paper introduces a novel staggered discontinuous Galerkin (SDG) method tailored for solving elliptic equations on polytopal meshes. Our approach utilizes a primal-dual grid framework to ensure local conservation of fluxes, significantly improving stability and accuracy. The method is hybridizable and reduces the degrees of freedom compared to existing approaches. It also bridges connections to other numerical methods on polytopal meshes. Numerical experiments validate the method's optimal convergence rates and computational efficiency.https://papers.cool/arxiv/2410.23945A Derivative-Orthogonal Wavelet Multiscale Method for 1D Elliptic Equations with Rough Diffusion Coefficients2024-11-01T00:00:00+00:00Qiwei FengBin HanIn this paper, we investigate 1D elliptic equations $-\nabla\cdot (a\nabla u)=f$ with rough diffusion coefficients $a$ that satisfy $0<a_{\min}\le a\le a_{\max}<\infty$ and $f\in L_2(\Omega)$. To achieve an accurate and robust numerical solution on a coarse mesh of size $H$, we introduce a derivative-orthogonal wavelet-based framework. This approach incorporates both regular and specialized basis functions constructed through a novel technique, defining a basis function space that enables effective approximation. We develop a derivative-orthogonal wavelet multiscale method tailored for this framework, proving that the condition number $\kappa$ of the stiffness matrix satisfies $\kappa\le a_{\max}/a_{\min}$, independent of $H$. For the error analysis, we establish that the energy and $L_2$-norm errors of our method converge at first-order and second-order rates, respectively, for any coarse mesh $H$. Specifically, the energy and $L_2$-norm errors are bounded by $2 a_{\min}^{-1/2} \|f\|_{L_2(\Omega)} H$ and $4 a_{\min}^{-1}\|f\|_{L_2(\Omega)} H^2$. Moreover, the numerical approximated solution also possesses the interpolation property at all grid points. We present a range of challenging test cases with continuous, discontinuous, high-frequency, and high-contrast coefficients $a$ to evaluate errors in $u, u'$ and $a u'$ in both $l_2$ and $l_\infty$ norms. We also provide a numerical example that both coefficient $a$ and source term $f$ contain discontinuous, high-frequency and high-contrast oscillations. Additionally, we compare our method with the standard second-order finite element method to assess error behaviors and condition numbers when the mesh is not fine enough to resolve coefficient oscillations. Numerical results confirm the bounded condition numbers and convergence rates, affirming the effectiveness of our approach.https://papers.cool/arxiv/2410.23973Decoupled structure-preserving discretization of incompressible MHD equations with general boundary conditions2024-11-01T00:00:00+00:00Yi ZhangArtur PalhaAndrea BrugnoliDeepesh ToshniwalMarc GerritsmaIn the framework of a mixed finite element method, a structure-preserving formulation for incompressible MHD equations with general boundary conditions is proposed. A leapfrog-type temporal scheme fully decouples the fluid part from the Maxwell part by means of staggered discrete time sequences and, in doing so, partially linearizes the system. Conservation and dissipation properties of the formulation before and after the decoupling are analyzed. We demonstrate optimal spatial and second-order temporal error convergence and conservation and dissipation properties of the proposed method using manufactured solutions, and apply it to the benchmark Orszag-Tang and lid-driven cavity test cases.https://papers.cool/arxiv/2410.23999A Power Method for Computing Singular Value Decomposition2024-11-01T00:00:00+00:00Doulaye DembeleThe singular value decomposition (SVD) allows to put a matrix as a product of three matrices: a matrix with the left singular vectors, a matrix with the positive-valued singular values and a matrix with the right singular vectors. There are two main approaches allowing to get the SVD result: the classical method and the randomized method. The analysis of the classical approach leads to accurate singular values. The randomized approach is especially used for high dimensional matrix and is based on the approximation accuracy without computing necessary all singular values. In this paper, the SVD computation is formalized as an optimization problem and a use of the gradient search algorithm. That results in a power method allowing to get all or the first largest singular values and their associated right vectors. In this iterative search, the accuracy on the singular values and the associated vector matrix depends on the user settings. Two applications of the SVD are the principal component analysis and the autoencoder used in the neural network models.https://papers.cool/arxiv/2410.24107Phase-field modeling of ductile fracture across grain boundaries in polycrystals2024-11-01T00:00:00+00:00Kim Louisa AuthJim BrouzoulisMagnus EkhIn this study, we address damage initiation and micro-crack formation in ductile failure of polycrystalline metals. We show how our recently published thermodynamic framework for ductile phase-field fracture of single crystals can be extended to polycyrstalline structures. A key feature of this framework is that is accounts for size effects by adopting gradient-enhanced (crystal) plasticity. Gradient-enhanced plasticity requires the definition of boundary conditions representing the plastic slip transmission resistance of the boundaries. In this work, we propose a novel type of micro-flexible boundary condition for gradient-plasticity, which couples the slip transmission resistance with the phase-field damage such that the resistance locally changes during the fracturing process. The formulation permits to maintain the effect of grain boundaries as obstacles for plastic slip during plastification, while also accounting for weakening of their resistance during the softening phase. In numerical experiments, the new damage-dependent boundary condition is compared to classical micro-free and micro-hard boundary conditions in polycrystals and it is demonstrated that it indeed produces a response that transitions from micro-hard to micro-free as the material fails. We show that the formulation maintains resistance to slip transmission during hardening, but can generate micro-cracks across grain boundaries during the fracture process. We further show examples of how the model can be used to simulate void coalescence and three-dimensional crack fronts in polycrystals.https://papers.cool/arxiv/2410.24138Nonlinear Two-Level Schwarz Methods: A Parallel Implementation in FROSch2024-11-01T00:00:00+00:00Alexander HeinleinKyrill HoAxel KlawonnMartin LanserOwing to the ability of nonlinear domain decomposition methods to improve the nonlinear convergence behavior of Newton's method, they have experienced a rise in popularity recently in the context of problems for which Newton's method converges slowly or not at all. This article introduces a novel parallel implementation of a two-level nonlinear Schwarz solver based on the FROSch (Fast and Robust Overlapping Schwarz) solver framework, part of Sandia's Trilinos library. First, an introduction to the key concepts underlying two-level nonlinear Schwarz methods is given, including a brief overview of the coarse space used to build the second level. Next, the parallel implementation is discussed, followed by preliminary parallel results for a scalar nonlinear diffusion problem and a 2D nonlinear plane-stress Neo-Hooke elasticity problem with large deformations.