2026-04-21 | | Total: 18
Financial misinformation poses significant threats to financial market stability and individuals' investment decisions. The multilingual environment and the inherent complexity of financial information present substantial challenges for Multilingual Financial Misinformation Detection (MFMD). Existing LLM-based approaches for financial misinformation detection primarily focus on English and a single financial misinformation detection task, which limits their ability to capture multilingual contexts and complex features. In this paper, we propose MFMDQwen, the first open-source LLM designed for MFMD tasks. Furthermore, we introduce MFMD4Instruction, the first instruction dataset supporting MFMD with LLMs, covering English, Chinese, Greek, and Bengali. We also construct MFMDBench, a benchmark dataset for evaluating the MFMD capabilities of LLMs. Experimental results on MFMDBench demonstrate that our model outperforms existing open-source LLMs. The project is available at https://github.com/lzw108/FMD.
High-fidelity, scalable market simulation is a key instrument for mechanism evaluation, stress testing, and counterfactual policy analysis. Yet existing simulators rarely achieve \emph{mechanism fidelity} beyond single-asset intraday settings, \emph{microstructure fidelity} against historical limit order books (LOB), and \emph{computational tractability} at market scale in a single system. This paper presents \textit{EvoMarket}, a discrete-event, multi-agent financial market simulator designed for intervention-oriented experiments in multi-asset and cross-day environments. EvoMarket couples a high-throughput execution core (optimized LOB data structures, hierarchical scheduling under propagation delays, and asynchronous per-asset matching) with explicit institutional mechanisms (market calendars, opening call auctions, price limits, and T+1 settlement). To avoid expensive black-box calibration, EvoMarket introduces an Oracle-guided in-run self-calibration mechanism that interprets microstructure discrepancy as missing order flow and synthesizes corrective orders at recording checkpoints. Experiments on China A-share order-flow and LOB data show close replay alignment over five trading days, fidelity gains from budgeted in-run calibration across depth levels, broad agent order-space coverage, and scalable performance under increasing input order rates and market breadth. We further demonstrate cross-asset linkage and event-study style intervention evaluation that produces structured dependence and interpretable event-time responses.
The matrix-free gather-batched-GEMM-scatter pattern eliminates global stiffness assembly for three-dimensional SIMP topology optimization, but the conventional three-stage implementation forces avoidable DRAM traffic between stages. We present a single fused CUDA kernel, implemented through CuPy's runtime compilation interface, that performs gather, per-element stiffness multiplication, and scatter accumulation in one pass. On a single RTX 4090 (24 GB), the fused path reaches a problem-size-dependent 4.6-7.3x end-to-end SIMP wall-time speedup across 216k-4.9M cantilever elements and 4.4x on the 499,125-element torsion benchmark. Against the same-precision FP32 three-stage baseline, the fused path still yields 2.3-4.6x on cantilever and 2.8x on torsion. Isolated CUDA-event cantilever-operator measurements reach 8.9-13.8x per matvec call, while separate instrumented board-power traces at 216k and 1M show 3.2-4.9x lower energy than matched FP64 runs. A separate bridge stress test shows the same FP32-versus-FP64 three-stage trend under one distributed-load case; direct fused-kernel bridge benchmarks are not reported. We also evaluate a BF16 WMMA variant: a separate PyTorch BF16 GEMM proxy on matching tensor shapes yields 14.3x, but direct condition-number estimates of 6.1e5-2.3e6 across 64k-512k uniform-density test states imply BF16 conditioning products of 2.4e3-9.1e3, far above the 256 threshold, observed alongside BF16 iterative-refinement stagnation at the two tested inner tolerances.
In the complex domain of microfluidics systems, analysing fluid flow patterns through random-shaped circular microchannels is significantly challenging task. Conventional approach of solving such problems using computational fluid dynamics often incapable due to their intensive computational requirements and high simulation times. In this study, addressing these limitations, we introduce $μ$-FlowNet, a deep learning framework based on the adaptable U-Net autoencoders. This model provides a data-driven approach that enhances the prediction and mapping of random-shaped circular microchannels and their corresponding fluid flow patterns. The datasets required for the training of the model is generated by performing extensive simulations using conventional approach of computational fluid dynamics methods. The datasets are then pre-processed and accessed the required spatial and temporal features that are essential for the training. We have trained three different models based on U-Net framework namely, standard U-Net, T-Net, and U-Net with attention mechanism to compare the prediction accuracy and loss. The accuracy of the $μ$-FlowNet is compared using metrics of dice score and intersection over union and it shows that U-Net with attention mechanism shows the highest dice score and IoU of 0.9317 and 0.8731, respectively and shows the highest structural similarity as compared to standard U-Net and T-Net. This show that U-Net with attention mechanism serves best model to map the fluid flow pattern with random datasets on testing.
Understanding the origin of optimization difficulty in high-dimensional combinatorial spaces remains a fundamental problem. Existing perspectives typically characterize difficulty in terms of properties of states, their connectivity, or distributions over states. However, search algorithms operate as stochastic processes evolving over time, and optimization is inherently a trajectory-level phenomenon. This motivates a shift from state-based to trajectory-based analysis. In this work, we adopt a trajectory-based perspective and analyze search dynamics through the evolution of a distance process. We identify a structural mechanism, which we term entropy-driven drift. This mechanism systematically biases trajectories toward high-entropy regions. This drift arises from asymmetry in local transitions induced by the underlying graph structure, independent of objective variation. In the absence of objective variation, trajectories that reach the target are atypical under the induced dynamics, leading to a discrepancy between rapid mixing and slow hitting. We formalize this mechanism in a canonical combinatorial setting with a highly symmetric underlying graph, where the symmetry allows explicit characterization of the induced drift. The mechanism highlights entropy-driven drift as a source of optimization difficulty and provides a trajectory-level framework for understanding search dynamics in combinatorial spaces.
Large language models (LLMs) hold great promise for business applications, yet business analysis remains inherently complex, demanding rigorous reasoning and the integration of diverse knowledge sources. Existing benchmarks typically target narrow tasks and thus leave a fundamental question unanswered: how can LLMs be reliably applied in business, and how are these applications grounded in underlying theoretical capabilities? To address this gap, we introduce BizCompass, a benchmark explicitly designed to connect theoretical foundations with practical business knowledge and applications. At the knowledge level, BizCompass covers four core domains--finance, economics, statistics, and operations management. At the application level, it structures tasks around three representative roles: the analyst, the trader, and the consultant. This dual-axis design not only exposes performance differences across realistic scenarios but also diagnoses which foundational capabilities enable or constrain success. We systematically evaluate both open-source and commercial LLMs, revealing how theoretical knowledge translates into practical performance in business. The results provide actionable insights for model selection and training optimization in real-world business contexts. All datasets and evaluation code are publicly released to support reproducibility and future research: https://bizcompass.dev.ypemc.com.
Polycube structures provide parametric domains for all-hexahedral (all-hex) mesh generation and analysis-suitable volumetric spline construction in isogeometric analysis (IGA). Recent learning-based polycube pipelines have improved automation, yet several challenges remain when handling complex CAD geometries. These challenges include the limited diversity of primitive geometries, restricted grid configurations, and the increasing cost of genus-guided context search during inference as both the primitive set and the grid size grow. In this paper, we present {Scalable DDPM-Polycube}, an extended diffusion-based polycube construction method that addresses these limitations. First, we expand the primitive set from two primitive geometries to three by introducing a blind-hole cube primitive, thereby improving the representation of local hole-like features that do not change the global genus. Second, we extend the grid configuration from the previous $2\times 1$ setting to an enlarged three-dimensional grid configuration, which increases representational capacity and reduces mapping distortion for complex geometries. Third, we develop a genus-guided context generation strategy together with a hierarchical verification procedure, enabling robust context generation in both user-guided and automated modes. Once a valid polycube structure is generated, it is used for parametric mapping, all-hex control mesh generation, and volumetric spline construction. Experimental results demonstrate that scalable DDPM-Polycube improves the generality, scalability, and automation of diffusion-based polycube generation, and supports hex mesh generation and volumetric spline construction for IGA applications on complex geometries.
Standard risk models reduce the rich dependence structure of financial markets to scalar volatility estimates, discarding the topological information encoded in cross-asset correlation networks. We present ORCA (Online Regime Correlation Analyzer), an end-to-end framework that fuses spectral graph theory, random matrix theory, and supervised machine learning to deliver calibrated probability estimates for both rally and crash events over a ten-day forward horizon. ORCA constructs rolling correlation matrices from 24 diversified exchange-traded instruments using three parallel estimators at different time scales, and extracts 127 spectral features (absorption ratios, eigenvalue entropy, effective rank, spectral gap, eigenvector concentration, and graph-topological descriptors at multiple correlation thresholds), concatenated with 79 traditional price-derived indicators to form a 206-dimensional feature vector. A depth-limited Random Forest with balanced sub-sample weighting is evaluated under a strict eight-fold walk-forward protocol with ten-day anti-leakage gaps spanning fifteen years of daily US market data. ORCA achieves a Balanced Crisis Detection AUC (BCD-AUC, the geometric mean of rally and crash AUC) of 0.741, ranking first against all baselines. Ablation studies show that spectral features contribute +10.3 percentage points of AUC for crash detection and +5.2 for rally detection over traditional features alone, with SHAP analysis revealing that graph-topological descriptors (clustering coefficient, edge density, and dominant-eigenvalue percentile rank) are the three most important crash predictors. A backtested walk-forward strategy mapping the joint rally-crash signal to dynamic equity exposure with risk-on/risk-off rotation achieves a Sharpe ratio of 1.13, a CAGR of 15.6%, and a maximum drawdown of only -7.5%, versus 3.7% CAGR and -33.7% drawdown for buy-and-hold.
Bitcoin transaction fees will become more important as the block subsidy declines, but fee formation is hard to study with blockchain data alone because the relevant queueing environment is unobserved. We develop and estimate a structural model of Bitcoin fee choice that treats the mempool as a market for scarce blockspace. We assemble a novel, high-frequency mempool panel, from a self-run Bitcoin node that records transaction arrivals, exits, block inclusion, fee-bumping events, and congestion snapshots. We characterize the fee market as a Vickery-Clarke-Groves mechanism and derive an equation to estimate fees. In the first-stage we estimate a monotone delay technology linking fee-rate priority and network state to expected confirmation delay. We then estimate how fees respond to that delay technology and to transaction characteristics. We find that congestion is the main determinant of delay; that the marginal value of priority is priced in fees, which is increasing in the gradient of confirmation time reduction per movement up in the fee queue; and that transactor choice of RBF, CPFP, and block conditions have economically important effects on fees.
Snow depth plays a central role in seasonal snowpack characterization and the terrestrial water cycle, yet remains challenging to estimate at high spatial resolution. Recent studies have shown that repeat-pass interferometric synthetic aperture radar (InSAR) measurements combined with physics-based models can enable effective snow water equivalent (SWE) retrieval. However, the performance of these methods depends strongly on measurement accuracy and modeling assumptions. Building on the success of InSAR-based approaches, we develop a robust learning-based model that directly learns the relationship between measured InSAR observables and snow depth. The model is trained on a single SnowEx Idaho site and evaluated across independent years and geographically distinct regions. Results demonstrate strong temporal and spatial transferability. In temporal transfer experiments, the proposed approach achieves a Pearson correlation of 0.81 with lidar snow depth, compared to a correlation of approximately 0.47 reported for physics-based Sentinel-1 SWE retrievals over the same site.
Can we learn the physics of matter in motion directly from images and video--and trust it? Answering this question requires integrating experiments, physics-based simulation, and data across traditionally separate disciplines. Much of this knowledge is visual and temporal rather than textual: images and videos encode structure, dynamics, and causality that equations alone cannot fully capture. Recent generative models produce compelling visual content, yet they rely on observational data and often lack physical validity. Here we show that generative video models gain scientific value when they couple visual data with experiments and high-fidelity simulations. Using deformation mechanics as a testbed, we study three systems of increasing complexity--rubber compression, can crushing, and cardiac motion--and identify regimes in which visual learning succeeds, fails, and requires mechanistic supervision. When physics manifests in visible kinematics, generative models recover measurable quantities such as surface strain; when internal state variables dominate, visual plausibility no longer ensures physical admissibility. We propose that this convergence defines a new frontier, the Generative Sciences of Matter and Motion, which unifies Simulogenics, Physiogenics, and Materiogenics. These physics-grounded foundation models can turn visual generation into a scientific instrument for inference, prediction, and design of matter in motion.
This paper develops a geospatial framework for climate risk stress testing in California with applications to banking and climate-exposed sectors such as agriculture, real estate, and tourism. The study integrates physical hazard mapping, sector-specific exposure analysis, and scenario-based financial risk assessment to evaluate how wildfires, drought, flooding, extreme heat, and transition risks may affect regional economic activity and financial stability. The framework is intended to support portfolio monitoring, climate scenario analysis, and institutional readiness under emerging disclosure and risk-management standards. In addition, the paper provides a survey-based implementation guide for benchmarking current climate-risk practices and data needs across industry and academic stakeholders.
Shell structures are pivotal in the fields of architecture and engineering, due to their aesthetic appeal and structural efficiency. Recently, 3D concrete printing has reignited the interest in these structures. But, as printed concrete cannot be reinforced with steel, structures built in this way must be designed to withstand primarily pure compression: they must be funicular shells. Nevertheless, a fundamental challenge remains unsolved since Robert Hooke's discovered the catenary arch in 1675: it is not known whether the concept of a funicular polygon can be generalised to three-dimensional structures. Generative Adversarial Networks (GANs), have shown remarkable success in generating realistic data samples matching the distribution of the training data and have been shown to produce highly convincing synthetic images. This work proposes a physics-informed generative adversarial framework for the design of funicular shell structures. The approach employs a modified Deep Convolutional Generative Adversarial architecture physically guided by an auxiliary discriminator to generate realistic and structurally efficient shell geometries. Specifically, the model is constrained by the membrane factor to penalize geometries dominated by bending. An additional discriminator is also employed allowing the model to deal with more complex structures. Results show that the developed model is stable and capable of generating physically optimal, previously unseen, funicular shells with smooth forms and high membrane factor distributions.
Digital marketplaces processing billions of dollars annually represent critical infrastructure in sociotechnical ecosystems, yet their performance optimization lacks principled measurement frameworks that can inform algorithmic governance decisions regarding market efficiency and fairness from complex market data. By looking at orderbook data from double auction markets alone, because bids and asks do not represent true maximum willingnesses to buy and true minimum willingnesses to sell, there is little an economist can say about the market's actual performance in terms of allocative efficiency. We turn to experimental data to address this issue, `inverting' the standard induced value approach of double auction experiments. Our aim is to predict key market features relevant to market efficiency, particularly allocative efficiency, using orderbook data only -- specifically bids, asks and price realizations, but not the induced reservation values -- as early as possible. Since there is no established model of strategically optimal behavior in these markets, and because orderbook data is highly unstructured, non-stationary and non-linear, we propose quantile-based normalization techniques that help us build general predictive models. We develop and train several models, including linear regressions and gradient boosting trees, leveraging quantile-based input from the underlying supply-demand model. Our models can predict allocative efficiency with reasonable accuracy from the earliest bids and asks, and these predictions improve with additional realized price data. The performance of the prediction techniques varies by target and market type. Our framework holds significant potential for application to real-world market data, offering valuable insights into market efficiency and performance, even prior to any trade realizations.
U.S. dollar stablecoins are increasingly used as payment and settlement instruments beyond cryptocurrency markets. With the enactment of the GENIUS Act in 2025, the United States established the first comprehensive federal framework governing their issuance, backing, and supervision. This paper evaluates the financial, technological, and regulatory risks that may arise as GENIUS-compliant stablecoins scale into mainstream use. We show that maintaining par-value redemption may depend not only on backing-asset quality, but also on the functioning of Treasury and repo markets, the balance-sheet capacity of broker-dealers, and the operational reliability of blockchain-based transaction rails. Even conservatively backed stablecoins can face stress from redemption surges, market-intermediation bottlenecks, or technological disruptions. We argue that durable stability will likely require an integrated approach spanning financial-market infrastructure, prudential regulation, and software governance. While grounded in U.S.\ law, the analysis identifies principles that are relevant for regulators in other jurisdictions developing stablecoin regimes.
As the cornerstone of modern power systems, the Unit Commitment Problem (UC) is critical for ensuring operational security and economic efficiency in the ongoing global energy transition. However, existing UC studies typically propose specialized algorithms for specific variants and operational requirements, tightly coupling the algorithms to their target models and limiting their applicability to other variants. To address this issue, this paper proposes a method that uses SAT-based reduction to decouple the algorithm from the problem, which allows a single algorithm to solve multiple UC variants. By uniformly reducing all UC variants to SAT instances solvable by standard SAT solvers, this method makes the solving algorithm independent of the original UC variant, thus granting it broad applicability across diverse variants. Experimental results show that our method achieves better solution quality than specialized algorithms and demonstrates stronger generalizability. This work offers a fast and flexible framework for addressing newly emerging UC formulations in evolving power systems.
Representing turbulent flow fields in a compact yet physically faithful form remains a central challenge in computational fluid dynamics. We propose a continuous parametric representation based on localized Gaussian primitives, in which the velocity field is modeled as a superposition of kernels with learnable positions, amplitudes, and scales. This formulation yields a compact, grid-independent encoding while enabling evaluation of derived quantities such as vorticity and enstrophy. The approach is assessed on three-dimensional Taylor-Green vortex fields spanning stages from smooth flow to fully developed turbulence. We quantify the compression-accuracy trade-off using both primary variables and derivative-sensitive diagnostics. The baseline isotropic formulation achieves high velocity accuracy at compression ratios exceeding 1e3-1e4, but exhibits substantial enstrophy degradation due to loss of small-scale structure. To address this limitation, we investigate structure-aware extensions including adaptive placement, multi-resolution kernels, and anisotropic Gaussians. The anisotropic formulation provides the most consistent improvement, better aligning with elongated vortical structures and recovering intermediate- and high-wavenumber content, while other strategies yield modest gains. A compact-support Beta basis improves enstrophy in some cases but introduces localized artifacts. Overall, the results indicate that the main limitation of baseline Gaussian representations lies in geometric expressiveness rather than parameter count. The proposed framework provides a compact, interpretable, and continuous representation of turbulent flows, and establishes a foundation for structure-aware and physics-informed flow compression.
Cashback reward programs now serve as central instruments in the competitive landscape of cards, digital wallets, and payment platforms. Despite their financial significance, the business logic governing these programs is seldom treated as a security critical surface. In this paper, we study a class of reward abuse attacks that arise from flaws in how reward systems accrue, redeem, and adjust incentives when underlying transactions are reversed through refunds. Using controlled, small scale experiments on six issuer accounts we legitimately hold, we document a spectrum of real world behaviors in production systems. At one extreme, a debit based cashback program (Issuer A) never adjusts rewards when refunded transactions post, enabling a deterministic double dip cashback reward abuse attack. A credit card program (Issuer B) exhibits an analogous reward integrity violation through a statement cycle timing gap that allows reward redemption before the merchant return window closes. At an intermediate tier, a credit card issuer (Issuer F) creates negative reward entries on refunds at statement close but makes rewards redeemable immediately upon settlement, creating a timing asymmetry that allows users to extract reward value before clawback occurs. At the robust end, three credit card issuers (C, D, and E) implement indefinite negative balance enforcement with proportional clawback. We formalize reward engines as state machines, introduce two integrity invariants (Reward Integrity and Refund Reward Consistency), develop a taxonomy of vulnerability classes mapped to CWE and OWASP, and present defensive pseudo algorithms with a semi formal correctness argument that close the identified loopholes. The primary vulnerability (Issuer A) was reported through a private bug bounty program and has been acknowledged by the vendor; good faith disclosure efforts for Issuer B are detailed in Section 8.