Mathematics
- [1] arXiv:2405.07995 [pdf, ps, other]
-
Title: On estimation of Hankel determinants for certain class of starlike functionsComments: arXiv admin note: substantial text overlap with arXiv:2210.01435Subjects: Complex Variables (math.CV)
In the present study, we consider two subclasses starlike and convex functions, denoted by $\mathcal{S}_{\mathcal{B}}^{*}$ and $\mathcal{C}_{\mathcal{B}}$ respectively, associated with a bean-shaped domain. Further, we estimate certain sharp initial coefficients, as well as second, third and fourth-order Hankel determinants for functions belonging to the class $\mathcal{S}_{\mathcal{B}}^{*}$. Additionally, we compute sharp second and third-order Hankel determinants for functions belonging to the $\mathcal{C}_{\mathcal{B}}$ class.
- [2] arXiv:2405.07996 [pdf, ps, other]
-
Title: A Subspace Minimization Barzilai-Borwein Method for Multiobjective Optimization ProblemsComments: arXiv admin note: text overlap with arXiv:2309.06929Subjects: Optimization and Control (math.OC)
Nonlinear conjugate gradient methods have recently garnered significant attention within the multiobjective optimization community. These methods aim to maintain consistency in conjugate parameters with their single-objective optimization counterparts. However, the preservation of the attractive conjugate property of search directions remains uncertain, even for quadratic cases, in multiobjective conjugate gradient methods. This loss of interpretability of the last search direction significantly limits the applicability of these methods. To shed light on the role of the last search direction, we introduce a novel approach called the subspace minimization Barzilai-Borwein method for multiobjective optimization problems (SMBBMO). In SMBBMO, each search direction is derived by optimizing a preconditioned Barzilai-Borwein subproblem within a two-dimensional subspace generated by the last search direction and the current Barzilai-Borwein descent direction. Furthermore, to ensure the global convergence of SMBBMO, we employ a modified Cholesky factorization on a transformed scale matrix, capturing the local curvature information of the problem within the two-dimensional subspace. Under mild assumptions, we establish both global and $Q$-linear convergence of the proposed method. Finally, comparative numerical experiments confirm the efficacy of SMBBMO, even when tackling large-scale and ill-conditioned problems.
- [3] arXiv:2405.07997 [pdf, ps, html, other]
-
Title: New criteria for starlikeness in the unit discSubjects: Complex Variables (math.CV)
It is well-known that the condition ${\operatorname{Re}} \left[1+\frac{zf''(z)}{f'(z)}\right]>0$, $z\in{\mathbb D}$, implies that $f$ is starlike function (i.e. convexity implies starlikeness). If the previous condition is not satisfied for every $z\in {\mathbb D}$, then it is possible to get new criteria for starlikeness by using $\left|\arg\left[\alpha+\frac{zf''(z)}{f'(z)}\right]\right|$, $z\in{\mathbb D}$, where $\alpha>1.$
- [4] arXiv:2405.07999 [pdf, ps, html, other]
-
Title: Remarks on b-enriched nonexpansive mappingsSubjects: Functional Analysis (math.FA)
In this note, we analyzed the concept of enriched nonexpansive which was proposed in "Approximating fixed points of enriched nonexpansive mappings by Krasnoselskij iteration in Hilbert spaces" (Carpathian J. Math., 35(2019), No. 3, 293-304.) Through our analysis, we conclude that the idea of enriched nonexpansive needs reconsideration, as it coincides with well known concept of nonexpansive. Our findings provide an insights into the existing literature and highlight the need for further investigations and clarifications in the existing literature on a metric-fixed point theory.
- [5] arXiv:2405.08000 [pdf, ps, html, other]
-
Title: A characterization of the existence of zeros for operators with Lipschitzian derivative and closed rangeSubjects: Functional Analysis (math.FA)
Let $H$ be a real Hilbert space and $\Phi:H\to H$ be a $C^1$ operator with Lipschitzian derivative and closed range. We prove that $\Phi^{-1}(0)\neq \emptyset$ if and only if, for each $\epsilon>0$, there exist a convex set $X\subset H$ and a convex function $\psi:X\to {\bf R}$ such that $\sup_{x\in X}(\|x\|^2+\psi(x))-\inf_{x\in X}\|x\|^2+\psi(x))<\epsilon$ and $0\in \overline{conv}(\Phi(X))$.
- [6] arXiv:2405.08001 [pdf, ps, html, other]
-
Title: Preconditioned Nonlinear Conjugate Gradient Method for Real-time Interior-point HyperelasticitySubjects: Optimization and Control (math.OC); Graphics (cs.GR)
The linear conjugate gradient method is widely used in physical simulation, particularly for solving large-scale linear systems derived from Newton's method. The nonlinear conjugate gradient method generalizes the conjugate gradient method to nonlinear optimization, which is extensively utilized in solving practical large-scale unconstrained optimization problems. However, it is rarely discussed in physical simulation due to the requirement of multiple vector-vector dot products. Fortunately, with the advancement of GPU-parallel acceleration techniques, it is no longer a bottleneck. In this paper, we propose a Jacobi preconditioned nonlinear conjugate gradient method for elastic deformation using interior-point methods. Our method is straightforward, GPU-parallelizable, and exhibits fast convergence and robustness against large time steps. The employment of the barrier function in interior-point methods necessitates continuous collision detection per iteration to obtain a penetration-free step size, which is computationally expensive and challenging to parallelize on GPUs. To address this issue, we introduce a line search strategy that deduces an appropriate step size in a single pass, eliminating the need for additional collision detection. Furthermore, we simplify and accelerate the computations of Jacobi preconditioning and Hessian-vector product for hyperelasticity and barrier function. Our method can accurately simulate objects comprising over 100,000 tetrahedra in complex self-collision scenarios at real-time speeds.
- [7] arXiv:2405.08002 [pdf, ps, html, other]
-
Title: Toeplitz operators on the proper images of bounded symmetric domainsComments: 29 pagesSubjects: Complex Variables (math.CV); Functional Analysis (math.FA)
Let $\Omega$ be a bounded symmetric domain in $\mathbb C^n$ and $f :\Omega \to \Omega^\prime$ be a proper holomorphic mapping factored by (automorphisms) a finite complex reflection group $G.$ We define an appropriate notion of the Hardy space $H^2(\Omega^\prime)$ on $\Omega^\prime$ which can be realized as a closed subspace of an $L^2$-space on the Šilov boundary of $\Omega^\prime$. We study various algebraic properties of Toeplitz operators (such as the finite zero product property, commutative and semi-commutative property etc.) on $H^2(\Omega^\prime)$. We prove a Brown-Halmos type characterization for Toeplitz operators on $H^2(\Omega^\prime),$ where $\Omega^\prime$ is an image of the open unit polydisc in $\mathbb C^n$ under a proper holomorphic mapping factored by an irreducible finite complex reflection group.
- [8] arXiv:2405.08003 [pdf, ps, html, other]
-
Title: Continuous Krishna-Parthasarathy Entropic Uncertainty PrincipleComments: 7 pages, 0 FiguresJournal-ref: Special issue of Infinite Dimensional Analysis, Quantum Probability and Related Topics in honour of Prof. K. R. Parthasarathy, 18 March 2024Subjects: Functional Analysis (math.FA); Information Theory (cs.IT); Operator Algebras (math.OA); Quantum Algebra (math.QA)
In 2002, Krishna and Parthasarathy [\textit{Sankhyā Ser. A}] derived discrete quantum version of Maassen-Uffink [\textit{Phys. Rev. Lett., 1988}] entropic uncertainty principle. In this paper, using the notion of continuous operator-valued frames, we derive an entropic uncertainty principle for arbitrary family of operators indexed by measure spaces having finite measure. We give an application to the special case of compact groups.
- [9] arXiv:2405.08004 [pdf, ps, html, other]
-
Title: A class of explicit solutions for the Fermat problem for tetrahedraComments: 14 pages, 2 figuresSubjects: General Mathematics (math.GM)
We present a class of explicit solutions for the problem of minimization of the function $f(x,y,z)=\sum_{i=1}^{4}\sqrt{(x-x_{i})^2+(y-y_{i})^2+(z-z_{i})^2},$ which gives the location of the unique stationary (Fermat-Torricelli) point for four non-collinear and non-coplanar points $A_{i}=(x_{i},y_{i},z_{i}),$ determining tetrahedra, which are derived by a proper class of isosceles tetrahedra having four equal edges and two equal opposite edges. This class of explicit solutions contains Mehlhos and Glastier's explicit solutions (theoretical constructions) obtained in \cite{Mehlhos:00} and \cite{Glastier:93}, respectively.
- [10] arXiv:2405.08005 [pdf, ps, html, other]
-
Title: Graphon Mean Field Games with A Representative Player: Analysis and Learning AlgorithmSubjects: Optimization and Control (math.OC); Artificial Intelligence (cs.AI)
We propose a discrete-time graphon game formulation on continuous state and action spaces using a representative player to study stochastic games with heterogeneous interaction among agents. This formulation admits both philosophical and mathematical advantages, compared to a widely adopted formulation using a continuum of players. We prove the existence and uniqueness of the graphon equilibrium with mild assumptions, and show that this equilibrium can be used to construct an approximate solution for finite player game on networks, which is challenging to analyze and solve due to curse of dimensionality. An online oracle-free learning algorithm is developed to solve the equilibrium numerically, and sample complexity analysis is provided for its convergence.
- [11] arXiv:2405.08009 [pdf, ps, html, other]
-
Title: Approximating the common fixed point of enriched interpolative matkowski type mapping in Banach spaceComments: 17 pagesSubjects: Optimization and Control (math.OC)
In the Normed space theory, the existence of fixed points is one of the main tools in improving efficiency of iterative algorithms in optimization, numerical analysis and various mathematical applications. This study introduces and investigates a recent concept termed "enriched interpolative Matkowski-type mapping". Building upon the well-established foundation of Matkowski-type contractions. This extension incorporates an interpolative enrichment mechanism, yielding a refined framework for analyzing contraction mappings. The proposed concept is motivated by the desire to enhance the convergence behavior and applicability of contraction mapping principles in various mathematical and scientific domains.
- [12] arXiv:2405.08012 [pdf, ps, other]
-
Title: Zero-Sum Games for piecewise deterministic Markov decision processes with risk-sensitive finite-horizon cost criterionComments: NASubjects: Optimization and Control (math.OC); Probability (math.PR)
This paper investigates the two-person zero-sum stochastic games for piece-wise deterministic Markov decision processes with risk-sensitive finite-horizon cost criterion on a general state space. Here, the transition and cost/reward rates are allowed to be un-unbounded from below and above. Under some mild conditions, we show the existence of the value of the game and an optimal randomized Markov saddle-point equilibrium in the class of all admissible feedback strategies. By studying the corresponding risk-sensitive finite-horizon optimal differential equations out of a class of possibly unbounded functions, to which the extended Feynman-Kac formula is also justified to hold, we obtain our required results.
- [13] arXiv:2405.08028 [pdf, ps, html, other]
-
Title: Forbidden subdivision in integral treesComments: 5 pagesSubjects: Combinatorics (math.CO)
We show that if all the eigenvalues of a tree are integers, then it does not contain a subdivided edge with 7 vertices.
- [14] arXiv:2405.08047 [pdf, ps, html, other]
-
Title: Autonomous Sparse Mean-CVaR Portfolio OptimizationComments: ICML 2024Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG); Portfolio Management (q-fin.PM)
The $\ell_0$-constrained mean-CVaR model poses a significant challenge due to its NP-hard nature, typically tackled through combinatorial methods characterized by high computational demands. From a markedly different perspective, we propose an innovative autonomous sparse mean-CVaR portfolio model, capable of approximating the original $\ell_0$-constrained mean-CVaR model with arbitrary accuracy. The core idea is to convert the $\ell_0$ constraint into an indicator function and subsequently handle it through a tailed approximation. We then propose a proximal alternating linearized minimization algorithm, coupled with a nested fixed-point proximity algorithm (both convergent), to iteratively solve the model. Autonomy in sparsity refers to retaining a significant portion of assets within the selected asset pool during adjustments in pool size. Consequently, our framework offers a theoretically guaranteed approximation of the $\ell_0$-constrained mean-CVaR model, improving computational efficiency while providing a robust asset selection scheme.
- [15] arXiv:2405.08075 [pdf, ps, html, other]
-
Title: Identification of non-isomorphic 2-groups with dihedral central quotient and isomorphic modular group algebrasComments: 26 pages, 7 tables, 1 figureSubjects: Rings and Algebras (math.RA); Group Theory (math.GR)
The question whether non-isomorphic finite $p$-groups can have isomorphic modular group algebras was recently answered in the negative by García-Lucas, Margolis and del Río [J. Reine Angew. Math. 783 (2022), pp. 269-274]. We embed these negative solutions in the class of two-generated finite $2$-groups with dihedral central quotient, and solve the original question for all groups within this class. As a result, we discover new negative solutions and simple algebra isomorphisms. At the same time, the positive solutions for most of the groups in this class give some insights what makes the negative solutions special.
- [16] arXiv:2405.08103 [pdf, ps, html, other]
-
Title: Positive Knots and Ribbon ConcordanceComments: 11 pages, 1 figure, comments welcomeSubjects: Geometric Topology (math.GT)
Ribbon concordances between knots generalize the notion of ribbon knots. Agol, building on work of Gordon, proved ribbon concordance gives a partial order on knots in $S^3$. In previous work, the author and Greene conjectured that positive knots are minimal in this ordering. In this note we prove this conjecture for a large class of positive knots, and show that a positive knot cannot be expressed as a non-trivial band sum -- both results extend earlier theorems of Greene and the author for special alternating knots. In a related direction, we prove that if positive knots $K$ and $K'$ are concordant and $|\sigma(K)| \geq 2g(K) - 2$, then $K$ and $K'$ have isomorphic rational Alexander modules. This strengthens a result of Stoimenow, and gives evidence toward a conjecture that any concordance class contains at most one positive knot.
- [17] arXiv:2405.08105 [pdf, ps, html, other]
-
Title: The Hattori-Stallings rank, the Euler-Poincar\'e characteristic and zeta functions of totally disconnected locally compact groupsComments: 42 pagesSubjects: Group Theory (math.GR)
For a unimodular totally disconnected locally compact group $G$ we introduce and study an analogue of the Hattori-Stallings rank $\tilde{\rho}(P)\in\mathbf{h}_G$ for a finitely generated projective rational discrete left $\mathbb Q[G]$-module $P$. Here $\mathbf{h}_G$ denotes the $\mathbb Q$-vector space of left invariant Haar measures of $G$. Indeed, an analogue of Kaplansky's theorem holds in this context (cf. Theorem A). As in the discrete case, using this rank function it is possible to define a rational discrete Euler-Poincaré characteristic $\tilde{\chi}_G$ whenever $G$ is a unimodular totally disconnected locally compact group of type $\mathrm{FP}_\infty$ of finite rational discrete cohomological dimension. E.g., when $G$ is a discrete group of type $\mathrm{FP}$, then $\tilde{\chi}_G$ coincides with the ''classical'' Euler-Poincaré characteristic times the counting measure $\mu_{\{1\}}$. For a profinite group $\mathcal{O}$, $\tilde{\chi}_{\mathcal{O}}$ equals the probability Haar measure $\mu_{\mathcal{O}}$ on $\mathcal{O}$. Many more examples are calculated explicitly (cf. Example 1.7 and Section 5). In the last section, for a totally disconnected locally compact group $G$ satisfying an additional finiteness condition, we introduce and study a formal Dirichlet series $\zeta_{_{G,\mathcal{O}}}(s)$ for any compact open subgroup $\mathcal{O}$. In several cases it happens that $\zeta_{_{G,\mathcal{O}}}(s)$ defines a meromorphic function $\tilde{\zeta}_{_{G,\mathcal{O}}}\colon \mathbb{C} \to\bar{\mathbb C}$ of the complex plane satisfying miraculously the identity $\tilde{\chi}_G=\tilde{\zeta}_{_{G,\mathcal{O}}}(-1)^{-1}\cdot\mu_{\mathcal{O}}$. Here $\mu_{\mathcal{O}}$ denotes the Haar measure of $G$ satisfying $\mu_{\mathcal{O}}(\mathcal{O})=1$.
- [18] arXiv:2405.08108 [pdf, ps, html, other]
-
Title: Toric varieties admitting an action of a unipotent group with a finite number of orbitsSubjects: Algebraic Geometry (math.AG)
We describe complete simplicial toric varieties on which a unipotent group acts with a finite number of orbits. We also provide a complete list of such varieties in the case where the dimension is equal to 2.
- [19] arXiv:2405.08112 [pdf, ps, html, other]
-
Title: On the affine permutation group of certain decreasing Cartesian codesSubjects: Combinatorics (math.CO)
A decreasing Cartesian code is defined by evaluating a monomial set closed under divisibility on a Cartesian set. Some well-known examples are the Reed-Solomon, Reed-Muller, and (some) toric codes. The affine permutations consist of the permutations of the code that depend on an affine transformation. In this work, we study the affine permutations of some decreasing Cartesian codes, including the case when the Cartesian set has copies of multiplicative or additive subgroups.
- [20] arXiv:2405.08124 [pdf, ps, html, other]
-
Title: Faithfully flat ring maps are not descendableComments: Preprint, all comments are welcomeSubjects: Commutative Algebra (math.AC); Category Theory (math.CT)
We describe a general procedure of constructing non-trivial cup-products from non-trivial compositions in the derived category of a commutative ring. Using this, we show that there exist faithfully flat commutative boolean ring maps that are not descendable.
- [21] arXiv:2405.08126 [pdf, ps, other]
-
Title: Orthogonal Howe duality and dynamical (split) symmetric pairsComments: 53 pages. Comments welcomeSubjects: Representation Theory (math.RT); Quantum Algebra (math.QA)
Inspired by Etingof--Varchenko's dynamical fusion, dynamical $R$-matrix, and dynamical Weyl group for Lie algebras, we introduce, for split symmetric pairs, versions of dynamical fusion, dynamical $K$-matrix, and dynamical Weyl group. We then turn to the study of $(\mathfrak{so}_{2n},O_m)$-duality and prove that the standard Knizhnik-Zamolodchikov and dynamical operators (both differential and difference) on the $\mathfrak{so}_{2n}$-side are exchanged with the symmetric pair analogs, for $O_m\subset GL_m$, on the $O_m$-side.
- [22] arXiv:2405.08129 [pdf, ps, html, other]
-
Title: Wavelets for $L^2(B(0,1))$ using Zernike polynomialsSubjects: Functional Analysis (math.FA)
A set of orthogonal polynomials on the unit disk $B(0,1)$ known as Zernike polynomials are commonly used in the analysis and evaluation of optical systems. Here Zernike polynomials are used to construct wavelets for polynomial subspaces of $L^2(B(0,1)).$ This naturally leads to a multiresolution analysis of $L^2(B(0,1)).$ Previously, other authors have dealt with the one dimensional case, and used orthogonal polynomials of a single variable to construct time localized bases for polynomial subspaces of an $L^2$-space with arbitrary weight. Due to the nature of Zernike polynomials, the wavelet construction given here is well-suited for the analysis of two-dimensional signals defined on circular domains. This is shown by some experimental results done on corneal data.
- [23] arXiv:2405.08133 [pdf, ps, html, other]
-
Title: Asymptotics of bivariate algebraico-logarithmic generating functionsComments: To appear in the 2024 FPSAC proceedings in "Séminaire Lotharingien de Combinatoire"Subjects: Combinatorics (math.CO)
We derive asymptotic formulae for the coefficients of bivariate generating functions with algebraic and logarithmic factors. Logarithms appear when encoding cycles of combinatorial objects, and also implicitly when objects can be broken into indecomposable parts. Asymptotics are quickly computable and can verify combinatorial properties of sequences and assist in randomly generating objects. While multiple approaches for algebraic asymptotics have recently emerged, we find that the contour manipulation approach can be extended to these D-finite generating functions.
- [24] arXiv:2405.08140 [pdf, ps, html, other]
-
Title: Entropy numbers of Reproducing Hilbert Space of zonal positive definite kernels on compact two-point homogeneous spacesComments: arXiv admin note: text overlap with arXiv:2304.14103Subjects: Functional Analysis (math.FA)
We present estimates for the covering numbers of the unit ball of Reproducing Kernel Hilbert Spaces (RKHSs) of functions on $M^d$ a d-dimensional compact two-point homogeneous space. The RKHS is generated by a continuous zonal/isotropic positive definite kernel. We employ the representation in terms of the Schoenberg/Fourier series expansion for continuous isotropic positive definite kernels, given in terms of a family of orthogonal polynomials on $M^d$. The bounds we present carry accurate information about the asymptotic constants depending on the dimension of the manifold and the decay or growth rate of the coefficients of the kernel. The results we present extend the estimates previously known for continuous isotropic positive definite kernels on the d-dimensional unit sphere. We present the weak asymptotic equivalence for the order of the growth of covering numbers associated to kernels on $M^d$ with a convergent geometric sequence of coefficients. We apply our estimates in order to present a bound for the covering numbers of the spherical Gaussian kernel, and to present bounds for formal examples on $M^d$.
- [25] arXiv:2405.08162 [pdf, ps, html, other]
-
Title: Generalized planar Tur\'an numbers related to short cyclesSubjects: Combinatorics (math.CO)
Given two graphs $H$ and $F$, the generalized planar Turán number $\mathrm{ex}_\mathcal{P}(n,H,F)$ is the maximum number of copies of $H$ that an $n$-vertex $F$-free planar graph can have. We investigate this function when $H$ and $F$ are short cycles. Namely, for large $n$, we find the exact value of $\mathrm{ex}_\mathcal{P}(n, C_l,C_3)$, where $C_l$ is a cycle of length $l$, for $4\leq l\leq 6$, and determine the extremal graphs in each case. Also, considering the converse of these problems, we determine sharp upper bounds for $\mathrm{ex}_\mathcal{P}(n,C_3,C_l)$, for $4\leq l\leq 6$.
- [26] arXiv:2405.08165 [pdf, ps, html, other]
-
Title: Automorphisms of Fano threefolds of rank 2 and degree 28Subjects: Algebraic Geometry (math.AG)
We describe the automorphism groups of smooth Fano threefolds of rank 2 and degree 28 in the cases where they are finite.
- [27] arXiv:2405.08186 [pdf, ps, html, other]
-
Title: Metric lines in Engel-type groups and the nilpotent group $N_{6,3,1}$Subjects: Differential Geometry (math.DG); Optimization and Control (math.OC)
Given a sub-Riemannian manifold, which geodesics are "metric lines" (i.e. globally minimizing geodesics)? This article takes the first steps in answering this question for "arbitrary rank" and "non-integrable" Carnot groups. We classify the metric lines of the Engel-type groups $Eng(n)$ (Theorem B) and give a partial classification for the group of four-by-four nilpotent triangular matrices $N_{6,3,1}$ (Theorem C). The sub-Riamannian structure of the former group is defined on a non-integrable distribution of rank $n+1$ and the geodesic flow of the latter group is not algebraically integrable.
- [28] arXiv:2405.08194 [pdf, ps, html, other]
-
Title: Distributionally Robust Degree Optimization for BATS CodesComments: 8 pages, accepted by 2024 IEEE International Symposium on Information TheorySubjects: Information Theory (cs.IT)
Batched sparse (BATS) code is a network coding solution for multi-hop wireless networks with packet loss. Achieving a close-to-optimal rate relies on an optimal degree distribution. Technical challenges arise from the sensitivity of this distribution to the often empirically obtained rank distribution at the destination node. Specifically, if the empirical distribution overestimates the channel, BATS codes experience a significant rate degradation, leading to unstable rates across different runs and hence unpredictable transmission costs. Confronting this unresolved obstacle, we introduce a formulation for distributionally robust optimization in degree optimization. Deploying the resulting degree distribution resolves the instability of empirical rank distributions, ensuring a close-to-optimal rate, and unleashing the potential of applying BATS codes in real-world scenarios.
- [29] arXiv:2405.08201 [pdf, ps, other]
-
Title: Numerical approximation of the stochastic heat equation with a distributional reaction termSubjects: Probability (math.PR); Numerical Analysis (math.NA)
We study the numerical approximation of the stochastic heat equation with a distributional reaction term. Under a condition on the Besov regularity of the reaction term, it was proven recently that a strong solution exists and is unique in the pathwise sense, in a class of Hölder continuous processes. For a suitable choice of sequence $(b^k)_{k\in \mathbb{N}}$ approximating $b$, we prove that the error between the solution $u$ of the SPDE with reaction term $b$ and its tamed Euler finite-difference scheme with mollified drift $b^k$, converges to $0$ in $L^m(\Omega)$ with a rate that depends on the Besov regularity of $b$. In particular, one can consider two interesting cases: first, even when $b$ is only a (finite) measure, a rate of convergence is obtained. On the other hand, when $b$ is a bounded measurable function, the (almost) optimal rate of convergence $(\frac{1}{2}-\varepsilon)$-in space and $(\frac{1}{4}-\varepsilon)$-in time is achieved. Stochastic sewing techniques are used in the proofs, in particular to deduce new regularising properties of the discrete Ornstein-Uhlenbeck process.
- [30] arXiv:2405.08202 [pdf, ps, html, other]
-
Title: The mean field stubborn voter modelComments: 23 pagesSubjects: Probability (math.PR)
We analyse the effect of a fat-tailed waiting time distribution in the voter model on the complete graph with $N$ vertices. Our main result is the existence of a limiting infinite voter model on the slowest updating sites. We further derive explicitly the consensus probabilities in the limit model. To obtain these results, we study properties of the coalescing system of random walks that forms the dual of the limit voter model and prove, among other auxiliary statements, that the limit models comes down from infinity.
- [31] arXiv:2405.08208 [pdf, ps, html, other]
-
Title: Error bounds for a uniform asymptotic approximation of the zeros of the Bessel function $J_{\nu}(x)$Subjects: Classical Analysis and ODEs (math.CA)
A recent asymptotic expansion for the positive zeros $x=j_{\nu,m}$ ($m=1,2,3,\ldots$) of the Bessel function of the first kind $J_{\nu}(x)$ is studied, where the order $\nu$ is positive. Unlike previous well-known expansions in the literature, this is uniformly valid for one or both $m$ and $\nu$ unbounded, namely $m=1,2,3,\ldots$ and $1 \leq \nu < \infty$. Explicit and simple lower and upper error bounds are derived for the difference between $j_{\nu,m}$ and the first three terms of the expansion. The bounds are sharp in the sense they are close to the value of the fourth term of the expansion (i.e. the first neglected term).
- [32] arXiv:2405.08211 [pdf, ps, html, other]
-
Title: Simple Homogeneous Structures and Indiscernible Sequence InvariantsComments: 48 pagesSubjects: Logic (math.LO)
We introduce some properties describing dependence in indiscernible sequences: $F_{ind}$ and its dual $F_{Mb}$, the definable Morley property, and $n$-resolvability. Applying these properties, we establish the following results:
We show that the degree of nonminimality introduced by Freitag and Moosa, which is closely related to $F_{ind}$ (equal in $\mathrm{DCF}_{0}$), may take on any positive integer value in an $\omega$-stable theory, answering a question of Freitag, Jaoui, and Moosa.
Proving a conjecture of Koponen, we show that every simple theory with quantifier elimination in a finite relational language has finite rank and is one-based. The arguments closely rely on finding types $q$ with $F_{Mb}(q) = \infty$, and on $n$-resolvability.
We prove some variants of the simple Kim-forking conjecture, a generalization of the stable forking conjecture to $\mathrm{NSOP}_{1}$ theories. We show a global analogue of the simple Kim-forking conjecture with infinitely many variables holds in every $\mathrm{NSOP}_{1}$ theory, and show that Kim-forking with a realization of a type $p$ with $\mathrm{F}_{Mb}(p) < \infty$ satisfies a finite-variable version of this result. We then show, in a low $\mathrm{NSOP}_{1}$ theory or when $p$ is isolated, if $p \in S(C)$ has the definable Morley property for Kim-independence, Kim-forking with realizations of $p$ gives a nontrivial instance of the simple Kim-forking conjecture itself. In particular, when $F_{Mb}(p) < \infty$ and $|S^{F_{Mb}(p) + 1}(C)| < \infty$, Kim-forking with realizations of $p$ gives us a nontrivial instance of the simple Kim-forking conjecture.
We show that the quantity $F_{Mb}$, motivated in simple and $\mathrm{NSOP}_{1}$ theories by the above results, is in fact nontrivial even in stable theories. - [33] arXiv:2405.08212 [pdf, ps, html, other]
-
Title: Macroscopic Fluctuation Theory for Ginzburg-Landau dynamics with long range interactionsSubjects: Mathematical Physics (math-ph); Statistical Mechanics (cond-mat.stat-mech); Probability (math.PR)
Focusing on a famous class of interacting diffusion processes called Ginzburg-Landau (GL) dynamics, we extend the Macroscopic Fluctuations Theory (MFT) to these systems in the case where the interactions are long-range, and consequently, the macroscopic effective equations are described by non-linear fractional diffusion equations.
- [34] arXiv:2405.08215 [pdf, ps, other]
-
Title: Circles in diffractionComments: 23 pagesSubjects: Classical Analysis and ODEs (math.CA); Mathematical Physics (math-ph)
Given a Fourier transformable measure in two dimensions, we find a formula for the intensity of its Fourier transform along circles. In particular, we obtain a formula for the diffraction measure along a circle in terms of the autocorrelation measure. We look at some applications of this formula.
- [35] arXiv:2405.08225 [pdf, ps, other]
-
Title: Linear Operator Approximate Message Passing (OpAMP)Comments: 31 pages, 5 figuresSubjects: Statistics Theory (math.ST); Information Theory (cs.IT); Probability (math.PR)
This paper introduces a framework for approximate message passing (AMP) in dynamic settings where the data at each iteration is passed through a linear operator. This framework is motivated in part by applications in large-scale, distributed computing where only a subset of the data is available at each iteration. An autoregressive memory term is used to mitigate information loss across iterations and a specialized algorithm, called projection AMP, is designed for the case where each linear operator is an orthogonal projection. Precise theoretical guarantees are provided for a class of Gaussian matrices and non-separable denoising functions. Specifically, it is shown that the iterates can be well-approximated in the high-dimensional limit by a Gaussian process whose second-order statistics are defined recursively via state evolution. These results are applied to the problem of estimating a rank-one spike corrupted by additive Gaussian noise using partial row updates, and the theory is validated by numerical simulations.
- [36] arXiv:2405.08234 [pdf, ps, html, other]
-
Title: Nakajima's quiver varieties and triangular bases of bipartite cluster algebrasComments: arXiv admin note: text overlap with arXiv:2208.12307Subjects: Representation Theory (math.RT); Algebraic Geometry (math.AG)
Berenstein and Zelevinsky introduced quantum cluster algebras \cite{BZ1} and the triangular bases \cite{BZ2}. The support conjecture proposed in \cite{LLRZ}, which asserts that the support of each triangular basis element for a rank-2 cluster algebra is bounded by an explicitly described region, was established in \cite{L} for skew-symmetric rank-2 cluster algebras. In this paper we extend this result by proving a bound on the support of each triangular basis element for bipartite cluster algebras.
- [37] arXiv:2405.08236 [pdf, ps, html, other]
-
Title: Existence of attracting invariant 2-curves in fibred quadratic dynamicsSubjects: Dynamical Systems (math.DS)
We present a construction of new invariant sets for fibred polynomial dynamics with base an irrational rotation over the unit circle, called multi-curves. Furthermore, the local dynamical theory for attracting invariant curves is extended to these objects.
- [38] arXiv:2405.08239 [pdf, ps, html, other]
-
Title: Hypergraphs accumulateComments: 6 pagesSubjects: Combinatorics (math.CO)
We show that for every integer $k\geq3$, the set of Turán densities of $k$-uniform hypergraphs has an accumulation point in $[0,1)$. In particular, $1/2$ is an accumulation point for the set of Turán densities of $3$-uniform hypergraphs.
- [39] arXiv:2405.08256 [pdf, ps, html, other]
-
Title: The cohomology of the classifying space of $PU(4)$Comments: 25 pagesSubjects: Algebraic Topology (math.AT)
Let $BPU(n)$ be the classifying space of the projective unitary group $PU(n)$. We determine the integral cohomology ring of $BPU(4)$, and the Steenrod algebra structure of its mod $2$ cohomology.
- [40] arXiv:2405.08257 [pdf, ps, other]
-
Title: Global existence and multiplicity of solutions for logarithmic Schr\"{o}dinger equations on graphsSubjects: Analysis of PDEs (math.AP); Functional Analysis (math.FA)
We consider the following logarithmic Schrödinger equation
$$
-\Delta u+h(x)u=u\log u^{2}
$$ on a locally finite graph $G=(V,E)$, where $\Delta$ is a discrete Laplacian operator on the graph, $h$ is the potential function. Different from the classical methods in Euclidean space, we obtain the existence of global solutions to the equation by using the variational method from local to global, which is inspired by the works of Lin and Yang in \cite{LinYang}. In addition, when the potential function $h$ is sign-changing, we prove that the equation admits infinitely many solutions with high energy by using the symmetric mountain pass theorem. We extend the classical results in Euclidean space to discrete graphs. - [41] arXiv:2405.08258 [pdf, ps, other]
-
Title: On special properties of solutions to Camassa-Holm equation and related modelsSubjects: Analysis of PDEs (math.AP)
We study unique continuation properties of solutions to the b-family of equations. This includes the Camassa-Holm and the Degasperi-Procesi models. We prove that for both, the initial value problem and the periodic boundary value problem, the unique continuation results found in \cite{LiPo} are optimal. More precisely, the result established there for the constant $c_0=0$ fails for any constant $c_0\neq 0$.
- [42] arXiv:2405.08265 [pdf, ps, other]
-
Title: On the subadditivity of generalized Kodaira dimensionsComments: 30 pages, all comments are welcome!Subjects: Complex Variables (math.CV); Algebraic Geometry (math.AG)
The goals of this paper are of two aspects. Firstly, we introduce the notion of generalized numerical Kodaira dimension with multiplier ideal sheaf and establish the subadditivity inequalities in terms of this notion, which can be used to give an analytic proof of O. Fujino's result on the subadditivity of the log Kodaira dimensions. Secondly, motivated by Zhou-Zhu's subadditivity of generalized Kodaira dimensions, we adopt another definition of generalized Kodaira dimension with multiplier ideal sheaf and show they are equal by using Okounkov bodies. As one application, we show that the superadditivity part in Zhou-Zhu's setting also holds true. As another application, we give an alternative proof of Zhou-Zhu's subadditivity formula, in the case when the singular metric $h_L$ has analytic singularities, by using generalized Iitaka fibrations.
- [43] arXiv:2405.08266 [pdf, ps, other]
-
Title: Double orbits of weakly almost periodic functionsSubjects: Functional Analysis (math.FA)
A locally compact group G is called a WS-group if the double orbits of the weakly almost periodic functions on G are relatively weakly compact. It is known that Moore-groups are WS-groups. We will show that if a discrete FC-group is a WS-group then its center is of finite index in G. We will study noncompact locally compact groups with the property that if the double orbits of bounded continuous functions on G are relatively weakly compact then they are relatively norm compact. Examples of such groups include the motion group M(n) and the special linear group SL(n,R).
- [44] arXiv:2405.08269 [pdf, ps, other]
-
Title: On saturation of the discrepancy principle for nonlinear Tikhonov regularization in Hilbert spacesSubjects: Numerical Analysis (math.NA); Optimization and Control (math.OC)
In this paper we revisit the discrepancy principle for Tikhonov regularization of nonlinear ill-posed problems in Hilbert spaces and provide some new and improved saturation results under less restrictive conditions, comparing with the existing results in the literature.
- [45] arXiv:2405.08275 [pdf, ps, other]
-
Title: Power of $\ell_1$-Norm Regularized Kaczmarz Algorithms for High-Order Tensor RecoveryComments: arXiv admin note: text overlap with arXiv:2311.00783Subjects: Optimization and Control (math.OC); Computer Vision and Pattern Recognition (cs.CV); Numerical Analysis (math.NA)
Tensors serve as a crucial tool in the representation and analysis of complex, multi-dimensional data. As data volumes continue to expand, there is an increasing demand for developing optimization algorithms that can directly operate on tensors to deliver fast and effective computations. Many problems in real-world applications can be formulated as the task of recovering high-order tensors characterized by sparse and/or low-rank structures. In this work, we propose novel Kaczmarz algorithms with a power of the $\ell_1$-norm regularization for reconstructing high-order tensors by exploiting sparsity and/or low-rankness of tensor data. In addition, we develop both a block and an accelerated variant, along with a thorough convergence analysis of these algorithms. A variety of numerical experiments on both synthetic and real-world datasets demonstrate the effectiveness and significant potential of the proposed methods in image and video processing tasks, such as image sequence destriping and video deconvolution.
- [46] arXiv:2405.08280 [pdf, ps, other]
-
Title: Parallel-in-Time Iterative Methods for Pricing American OptionsComments: 20 pages, 7 figures, 3 tablesSubjects: Numerical Analysis (math.NA)
For pricing American options, %after suitable discretization in space and time, a sequence of discrete linear complementarity problems (LCPs) or equivalently Hamilton-Jacobi-Bellman (HJB) equations need to be solved in a sequential time-stepping manner. In each time step, the policy iteration or its penalty variant is often applied due to their fast convergence rates. In this paper, we aim to solve for all time steps simultaneously, by applying the policy iteration to an ``all-at-once form" of the HJB equations, where two different parallel-in-time preconditioners are proposed to accelerate the solution of the linear systems within the policy iteration. Our proposed methods are generally applicable for such all-at-once forms of the HJB equation, arising from option pricing problems with optimal stopping and nontrivial underlying asset models. Numerical examples are presented to show the feasibility and robust convergence behavior of the proposed methodology.
- [47] arXiv:2405.08281 [pdf, ps, other]
-
Title: Mahler's problem and Turyn polynomialsComments: 29 pages, 5 figuresSubjects: Number Theory (math.NT); Classical Analysis and ODEs (math.CA); Complex Variables (math.CV); Probability (math.PR)
Mahler's problem asks for the largest possible value of the Mahler measure, normalized by the $L_2$ norm, of a polynomial with $\pm1$ coefficients and large degree. We establish a new record value in this problem exceeding $0.95$ by analyzing certain Turyn polynomials, which are defined by cyclically shifting the coefficients of a Fekete polynomial by a prescribed amount. It was recently established that the distribution of values over the unit circle of Fekete polynomials of large degree is effectively modeled by a particular random point process. We extend this analysis to the Turyn polynomials, and determine expressions for the asymptotic normalized Mahler measure of these polynomials, as well as for their normalized $L_q$ norms. We also describe a number of calculations on the corresponding random processes, which indicate that the Turyn polynomials where the shift is approximately $1/4$ of the length have Mahler measure exceeding $95\%$ of their $L_2$ norm. Further, we show that these asymptotic values are not disturbed by a small change to make polynomials having entirely $\pm1$ coefficients, which establishes the result on Mahler's problem.
- [48] arXiv:2405.08291 [pdf, ps, html, other]
-
Title: Lie Rota--Baxter operators on the Sweedler algebra $H_4$Comments: 30 pagesSubjects: Group Theory (math.GR); Rings and Algebras (math.RA)
If $A$ is an associative algebra, then we can define the adjoint Lie algebra $A^{(-)}$ and Jordan algebra $A^{(+)}$. It is easy to see that any associative Rota--Baxter operator on $A$ induces a Lie and Jordan Rota--Baxter operator on $A^{(-)}$ and $A^{(+)}$ respectively. Are there Lie (Jordan) Rota--Baxter operators, which are not associative Rota--Baxter operators?
In the present article we are studying these questions for the Sweedler algebra $H_4$, that is a 4-dimension non-commutative Hopf algebra. More precisely, we describe the Rota--Baxter operators on Lie algebra on the adjoint Lie algebra $H_4^{(-)}$. - [49] arXiv:2405.08296 [pdf, ps, html, other]
-
Title: Area-Preserving Anisotropic Mean Curvature Flow in Two DimensionsComments: 30 pages, 3 figuresSubjects: Analysis of PDEs (math.AP)
We study the motion of sets by anisotropic curvature under a volume constraint in the plane. We establish the exponential convergence of the area-preserving anisotropic flat flow to a disjoint union of Wulff shapes of equal area, the critical point of the anisotropic perimeter functional. This is an anisotropic analogue of the results in the isotropic case studied in \cite{julin2022}. The novelty of our approach is in using the Cahn-Hoffman map to parametrize boundary components as small perturbations of the Wulff shape. In addition, we show that certain reflection comparison symmetries are preserved by the flat flow, which lets us obtain uniform bounds on the distance between the convergent profile and the initial data.
- [50] arXiv:2405.08301 [pdf, ps, other]
-
Title: Coded Downlink Massive Random Access and a Finite de Finetti TheoremComments: 14 Pages, submitted to IEEE Transactions on Information TheorySubjects: Information Theory (cs.IT)
This paper considers a massive connectivity setting in which a base-station (BS) aims to communicate sources $(X_1,\cdots,X_k)$ to a randomly activated subset of $k$ users, among a large pool of $n$ users, via a common downlink message. Although the identities of the $k$ active users are assumed to be known at the BS, each active user only knows whether itself is active and does not know the identities of the other active users. A naive coding strategy is to transmit the sources alongside the identities of the users for which the source information is intended, which would require $H(X_1,\cdots,X_k) + k\log(n)$ bits, because the cost of specifying the identity of a user is $\log(n)$ bits. For large $n$, this overhead can be significant. This paper shows that it is possible to develop coding techniques that eliminate the dependency of the overhead on $n$, if the source distribution follows certain symmetry. Specifically, if the source distribution is independent and identically distributed (i.i.d.) then the overhead can be reduced to at most $O(\log(k))$ bits, and in case of uniform i.i.d. sources, the overhead can be further reduced to $O(1)$ bits. For sources that follow a more general exchangeable distribution, the overhead is at most $O(k)$ bits, and in case of finite-alphabet exchangeable sources, the overhead can be further reduced to $O(\log(k))$ bits. The downlink massive random access problem is closely connected to the study of finite exchangeable sequences. The proposed coding strategy allows bounds on the relative entropy distance between finite exchangeable distributions and i.i.d. mixture distributions to be developed, and gives a new relative entropy version of the finite de Finetti theorem which is scaling optimal.
- [51] arXiv:2405.08306 [pdf, ps, other]
-
Title: Flight Path Optimization with Optimal Control MethodSubjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
This paper is based on a crucial issue in the aviation world: how to optimize the trajectory and controls given to the aircraft in order to optimize flight time and fuel consumption. This study aims to provide elements of a response to this problem and to define, under certain simplifying assumptions, an optimal response, using Constrained Finite Time Optimal Control(CFTOC). The first step is to define the dynamic model of the aircraft in accordance with the controllable inputs and wind disturbances. Then we will identify a precise objective in terms of optimization and implement an optimization program to solve it under the circumstances of simulated real flight situation. Finally, the optimization result is validated and discussed by different scenarios.
- [52] arXiv:2405.08325 [pdf, ps, other]
-
Title: Centers of Universal Enveloping AlgebrasSubjects: Representation Theory (math.RT); Rings and Algebras (math.RA)
The universal enveloping algebra $U(\mathfrak{g} )$ of a current (super)algebra or loop (super)algebra $\mathfrak{g} $ is considered over an algebraically closed field $\mathbb{K} $ with characteristic $p\ge 0$. This paper focuses on the structure of the center $Z(\mathfrak{g} )$ of $U(\mathfrak{g} )$. In the case of zero characteristic, $Z(\mathfrak{g} )$ is generated by the centers of $\mathfrak{g} $. In the case of prime characteristic, $Z(\mathfrak{g} )$ is generated by the centers of $\mathfrak{g} $ and the $p$-centers of $U(\mathfrak{g} )$. We also study the structure of $Z(\mathfrak{g} )$ in the semisimple Lie (super)algebra.
- [53] arXiv:2405.08332 [pdf, ps, html, other]
-
Title: Parameter estimation and long-range dependence of the fractional binomial processComments: 18 pages, 4 figuresSubjects: Statistics Theory (math.ST)
In 1990, Jakeman (see \cite{jakeman1990statistics}) defined the binomial process as a special case of the classical birth-death process, where the probability of birth is proportional to the difference between a fixed number and the number of individuals present. Later, a fractional generalization of the binomial process was studied by Cahoy and Polito (2012) (see \cite{cahoy2012fractional}) and called it as fractional binomial process (FBP). In this paper, we study second-order properties of the FBP and the long-range behavior of the FBP and its noise process. We also estimate the parameters of the FBP using the method of moments procedure. Finally, we present the simulated sample paths and its algorithm for the FBP.
- [54] arXiv:2405.08338 [pdf, ps, other]
-
Title: Homogeneous CR-manifold in $\mathbb{C}^4$Comments: in Russian languageSubjects: Complex Variables (math.CV)
In this paper we study holomorphically homogeneous model submanifolds CR-type (1, 3) complex space $\mathbb C^4$. One finds moduli space of five-dimensional model surfaces Bloom-Graham type ((2, 1), (3, 1), (4, 1)). It is shown that there exists unique model surface of this type with property of holomorphical homogeneous, which is equivalent to tube surface $\mathcal C$ with affin homogeneous base. One describes and classifies with respect to model surfaces the orbits relative to the group of holomorhical automorphisms of $\mathcal C$
- [55] arXiv:2405.08341 [pdf, ps, html, other]
-
Title: On approximation to a real number by algebraic numbers of bounded degreeComments: 21 pagesSubjects: Number Theory (math.NT)
In his seminal 1961 paper, Wirsing studied how well a given transcendental real number $\xi$ can be approximated by algebraic numbers $\alpha$ of degree at most $n$ for a given positive integer $n$, in terms of the so-called naive height $H(\alpha)$ of $\alpha$. He showed that the infimum $\omega^*_n(\xi)$ of all $\omega$ for which infinitely many such $\alpha$ have $|\xi-\alpha| \le H(\alpha)^{-\omega-1}$ is at least $(n+1)/2$. He also asked if we could even have $\omega^*_n(\xi) \ge n$ as it is generally expected. Since then, all improvements on Wirsing's lower bound were of the form $n/2+\mathcal{O}(1)$ until Badziahin and Schleischitz showed in 2021 that $\omega^*_n(\xi) \ge an$ for each $n\ge 4$, with $a=1/\sqrt{3}\simeq 0.577$. In this paper, we use a different approach partly inspired by parametric geometry of numbers and show that $\omega^*_n(\xi) \ge an$ for each $n\ge 2$, with $a=1/(2-\log 2)\simeq 0.765$.
- [56] arXiv:2405.08346 [pdf, ps, other]
-
Title: An infinite dimensional balanced embedding problem III: Asymptotics near infinityComments: 39pagesSubjects: Complex Variables (math.CV); Differential Geometry (math.DG)
We continue our study on the logarithmic balanced model metric initiated in our previous work. By a non-trivial refinement of the set of tools developed in our previous work, we are able to confirm partially a conjecture we made in our previous work on the asymptotic behavior of the balanced metric near infinity.
- [57] arXiv:2405.08347 [pdf, ps, other]
-
Title: Tree walks and the spectrum of random graphsComments: 26 pages, long version of a paper presented at the 35th International Conference on Probabilistic, Combinatorial and Asymptotic Methods for the Analysis of Algorithms (AofA 2024)Subjects: Combinatorics (math.CO); Spectral Theory (math.SP)
It is a classic result in spectral theory that the limit distribution of the spectral measure of random graphs G(n, p) converges to the semicircle law in case np tends to infinity with n. The spectral measure for random graphs G(n, c/n) however is less understood. In this work, we combine and extend two combinatorial approaches by Bauer and Golinelli (2001) and Enriquez and Menard (2016) and approximate the moments of the spectral measure by counting walks that span trees.
- [58] arXiv:2405.08352 [pdf, ps, other]
-
Title: Sibson's $\alpha$-Mutual Information and its Variational RepresentationsSubjects: Information Theory (cs.IT); Probability (math.PR)
Information measures can be constructed from Rényi divergences much like mutual information from Kullback-Leibler divergence. One such information measure is known as Sibson's $\alpha$-mutual information and has received renewed attention recently in several contexts: concentration of measure under dependence, statistical learning, hypothesis testing, and estimation theory. In this paper, we survey and extend the state of the art. In particular, we introduce variational representations for Sibson's $\alpha$-mutual information and employ them in each of the contexts just described to derive novel results. Namely, we produce generalized Transportation-Cost inequalities and Fano-type inequalities. We also present an overview of known applications, spanning from learning theory and Bayesian risk to universal prediction.
- [59] arXiv:2405.08358 [pdf, ps, other]
-
Title: Carleson measures on locally finite treesSubjects: Functional Analysis (math.FA); Classical Analysis and ODEs (math.CA)
We provide a characterization of Carleson measures on locally finite trees. This characterization establishes the connection between Carleson measures and the boundedness of a suitable Poisson integral between $L^p$-spaces. Additionally, when the tree has bounded degree, we investigate the relationship between Carleson measures and BMO functions defined on the boundary of the tree.
- [60] arXiv:2405.08360 [pdf, ps, other]
-
Title: A Local discontinuous Galerkin method for the Benajamin-Ono equationComments: arXiv admin note: text overlap with arXiv:2404.18069Subjects: Numerical Analysis (math.NA)
The main purpose of this paper is to design a local discontinuous Galerkin (LDG) method for the Benjamin-Ono equation. We analyze the stability and error estimates for the semi-discrete LDG scheme. We prove that the scheme is $L^2$-stable and it converges at a rate $\mathcal{O}(h^{k+1/2})$ for general nonlinear flux. Furthermore, we develop a fully discrete LDG scheme using the four-stage fourth order Runge-Kutta method and ensure the devised scheme is strongly stable in case of linear flux using two-step and three-step stability approach under an appropriate time step constraint. Numerical examples are provided to validate the efficiency and accuracy of the method.
- [61] arXiv:2405.08361 [pdf, ps, other]
-
Title: A Representability Theorem for Stacks in Derived Geometry ContextsComments: 89 pages, comments welcomeSubjects: Algebraic Geometry (math.AG); Algebraic Topology (math.AT); Category Theory (math.CT)
The representability theorem for stacks, due to Artin in the underived setting and Lurie in the derived setting, gives conditions under which a stack is representable by an $n$-geometric stack. In recent work of Ben-Bassat, Kelly, and Kremnizer, a new theory of derived analytic geometry has been proposed as geometry relative to the $(\infty,1)$-category of simplicial commutative Ind-Banach $R$-modules, for $R$ a Banach ring. In this paper, we prove a representability theorem which holds in a very general context, which we call a representability context, encompassing both the derived algebraic geometry context of Toën and Vezzosi and these new derived analytic geometry contexts. The representability theorem gives natural and easily verifiable conditions for checking that derived stacks in these contexts are $n$-geometric, such as having an $n$-geometric truncation, being nilcomplete, and having an obstruction theory. Future work will explore representability of certain moduli stacks arising in derived analytic geometry, for example moduli stacks of Galois representations.
- [62] arXiv:2405.08364 [pdf, ps, other]
-
Title: Is addition definable from multiplication and successor?Friedrich Wehrung (UNICAEN)Subjects: Rings and Algebras (math.RA)
A map $f\colon R\to S$ between (associative, unital, but not necessarily commutative) rings is a \emph{brachymorphism} if $f(x+1)=f(x)+1$ and $f(xy)=f(x)f(y)$ whenever $x,y\in R$.We tackle the problem whether every brachymorphism is additive (i.e., $f(x+y)=f(x)+f(y)$), showing that in many contexts, including the following, the answer is positive: -- $R$ is finite (or, more generally, $R$ is left or right Artinian); -- $R$ is any ring of $2\times2$ matrices over a commutative ring; -- $R$ is Engelian; -- every element of $R$ is a sum of $\pi$-regular and central elements (this applies to $\pi$-regular rings, C*-algebras, and power series rings); -- $R$ is the full matrix ring of order greater than~$1$ over any ring; -- $f$ is the power function $x\mapsto x^n$ over any ring; -- $f$ is the determinant function over any ring $R$ of $n\times n$ matrices, with $n\geq3$, over a commutative ring, such that if $n>3$ then $R$ contains $n$ scalar matrices with invertible differences.We leave open the problem whether every brachymorphism is additive.
- [63] arXiv:2405.08365 [pdf, ps, other]
-
Title: A Riemannian Proximal Newton-CG MethodSubjects: Optimization and Control (math.OC)
Recently, a Riemannian proximal Newton method has been developed for optimizing problems in the form of $\min_{x\in\mathcal{M}} f(x) + \mu \|x\|_1$, where $\mathcal{M}$ is a compact embedded submanifold and $f(x)$ is smooth. Although this method converges superlinearly locally, global convergence is not guaranteed. The existing remedy relies on a hybrid approach: running a Riemannian proximal gradient method until the iterate is sufficiently accurate and switching to the Riemannian proximal Newton method. This existing approach is sensitive to the switching parameter. This paper proposes a Riemannian proximal Newton-CG method that merges the truncated conjugate gradient method with the Riemannian proximal Newton method. The global convergence and local superlinear convergence are proven. Numerical experiments show that the proposed method outperforms other state-of-the-art methods.
- [64] arXiv:2405.08369 [pdf, ps, other]
-
Title: A generic approach to homogenization of a diffusion driven by growing incompressible driftBrice Franke (LMBA), Shuenn-Jyi Sheu (NCU)Subjects: Probability (math.PR)
We study how the resolvent-family of a diffusion behaves, as thedrift grows to infinity. The limit turns out to be a selfadjoint pseudo-resolvent.After reduction of the underlying Hilbert-space, this pseudo-resolvent becomesa resolvent to a strongly continuous semi-group of contractions. We prove thatthis semi-group is associated to some Hunt-process on some suitable state-space which is constructed from equivalence classes of the drifts trajectories.Finally, we show a distributional limit theorem for the accelerated diffusiontoward the associated Hunt process.
- [65] arXiv:2405.08371 [pdf, ps, other]
-
Title: Homogeneous spaces of semidirect products and finite Gelfand pairsSubjects: Representation Theory (math.RT); Combinatorics (math.CO); Functional Analysis (math.FA); Group Theory (math.GR)
Let $K\leq H$ be two finite groups and let $C\leq A$ be two finite abelian groups, with $H$ acting on $A$ as a group of isomorphisms admitting $C$ as a $K$-invariant subgroup. We study the homogeneous space $X\coloneqq\left(H\ltimes A\right)/\left(K\ltimes C\right)$ and determine the decomposition of the permutation representation of $H\ltimes A$ acting on $X$. We then characterize when this is multiplicity-free, that is, when $\left(H\ltimes A,K\ltimes C\right)$ is a Gelfand pair. If this is the case, we explicitly calculate the corresponding spherical functions. From our general construction and related analysis, we recover Dunkl's results on the $q$-analog of the nonbinary Johnson scheme.
- [66] arXiv:2405.08374 [pdf, ps, other]
-
Title: On Long Range Ising Models with Random Boundary ConditionsComments: 53 pagesSubjects: Mathematical Physics (math-ph)
We consider polynomial long-range Ising models in one dimension, with ferromagnetic pair interactions decaying with power $2-\alpha$ (for $0 \leq \alpha < 1$), and prepared with randomly chosen boundary conditions. We show that at low temperatures in the thermodynamic limit the finite-volume Gibbs measures do not converge, but have a distributional limit, the so-called metastate. We find that there is a distinction between the values of $\alpha$ less than or larger than $\frac{1}{2}$. For moderate, or intermediate, decay $\alpha < \frac{1}{2}$, the metastate is very dispersed and supported on the set of all Gibbs measures, both extremal and non-extremal, whereas for slow decays $\alpha > \frac{1}{2}$ the metastate is still dispersed, but has its support just on the set of the two extremal Gibbs measures, the plus measure and the minus measure.
The former, moderate decays case, appears to be new and is due to the occurrence of almost sure boundedness of the random variable which is the sum of all interaction (free) energies between random and ordered half-lines, when the decay is fast enough, but still slow enough to get a phase transition ($\alpha>0$); while the latter, slow decays case, is more reminiscent of and similar to the behaviour of higher-dimensional nearest-neighbour Ising models with diverging boundary (free) energies.
We leave the threshold case $\alpha=\frac{1}{2}$ for further studies. - [67] arXiv:2405.08375 [pdf, ps, other]
-
Title: OliVier: an Oil and Vinegar based cryptosystemSubjects: Commutative Algebra (math.AC)
In this paper, we present OliVier a new Public Key Exchange cryptosystem that is based on a multivariate quadratic polynomial system: Oil & Vinegar polynomials together with fully quadratic ones. We describe its designing process, usage, complexity
- [68] arXiv:2405.08378 [pdf, ps, other]
-
Title: Weak well-posedness and weak discretization error for stable-driven SDEs with Lebesgue driftSubjects: Probability (math.PR)
We are interested in the discretization of stable driven SDEs with additive noise for $\alpha$ $\in$ (1, 2) and Lq -- Lp drift under the Serrin type condition $\alpha$/q + d/p < $\alpha$ -- 1. We show weak existence and uniqueness as well as heat kernel estimates for the SDE and obtain a convergence rate of order (1/$\alpha$)*($\alpha$ -- 1 -- $\alpha$/q - d/p) for the difference of the densities for the Euler scheme approximation involving suitably cutoffed and time randomized drifts.
- [69] arXiv:2405.08379 [pdf, ps, html, other]
-
Title: Detecting and Handling Reflection Symmetries in Mixed-Integer (Nonlinear) ProgrammingSubjects: Optimization and Control (math.OC)
Symmetries in mixed-integer (nonlinear) programs (MINLP), if not handled appropriately, are known to negatively impact the performance of (spatial) branch-and-bound algorithms. Usually one thus tries to remove symmetries from the problem formulation or is relying on a solver that automatically detects and handles symmetries. While modelers of a problem can handle various kinds of symmetries, automatic symmetry detection and handling is mostly restricted to permutation symmetries. This article therefore develops techniques such that also black-box solvers can automatically detect and handle a broader class of symmetries.
Inspired from geometric packing problems such as the kissing number problem, we focus on reflection symmetries of MINLPs. We develop a generic and easily applicable framework that allows to automatically detect reflection symmetries for MINLPs. To handle this broader class of symmetries, we discuss generalizations of state-of-the-art methods for permutation symmetries, and develop dedicated symmetry handling methods for special reflection symmetry groups. Our symmetry detection framework has been implemented in the open-source solver SCIP and we provide a comprehensive discussion of the implementation. The article concludes with a detailed numerical evaluation of our symmetry handling methods when solving MINLPs. - [70] arXiv:2405.08381 [pdf, ps, other]
-
Title: On Instability Properties of the Fractional Calder\'{o}n ProblemComments: 37 pages, 1 figureSubjects: Analysis of PDEs (math.AP)
We prove exponential instability properties for the fractional Calderón problem and the conductivity formulation of the fractional Calderón problem in the regime of fractional powers $s\in (0,1)$. We particularly focus on two settings: First, we discuss instability properties in general domain geometries with scaling critical $L^{\frac{n}{2s}}$ potentials and constant background metrics. Secondly, we investigate instability properties in general geometries with $L^{\frac{n}{2s}}$ potentials and low regularity, variable coefficient, possibly anisotropic background metrics. In both settings we make use of the methods introduced in \cite{KRS21} and we deduce strong compression estimates for the forward problem. In the first setting this is based on analytic smoothing estimates for a suitable comparison operator while in the second setting involving low regularity metrics this is based on an iterated compression gain. We thus generalize the results from \cite{RS18} to generic geometries and variable coefficients and further also discuss the setting of fractional conductivity equations. In particular, this proves that the logarithmic stability estimates for the fractional Calderón problem from \cite{RS20} are optimal.
- [71] arXiv:2405.08383 [pdf, ps, other]
-
Title: Faithful Artin induction and the Chebotarev density theoremComments: 50 pagesSubjects: Number Theory (math.NT); Group Theory (math.GR)
Given a finite group G, we prove that the vector space spanned by the faithful irreducible characters of G is generated by the monomial characters in the vector space. As a consequence, we show that in any family of G-extensions of a fixed number field F, almost all are subject to a strong effective version of the Chebotarev density theorem. We use this version of the Chebotarev density theorem to deduce several consequences for class groups in families of number fields.
- [72] arXiv:2405.08384 [pdf, ps, other]
-
Title: Group Dispersal Modelling revisitedSubjects: Analysis of PDEs (math.AP); Probability (math.PR); Quantitative Methods (q-bio.QM)
In this paper we revisit the notion of grouped dispersal that have been introduced by Soubeyrand and co-authors \cite{soubeyrand2011patchy} to model the simultaneous (and hence dependent) dispersal of several propagules from a single source in a homogeneous environment. We built a time continuous measure valued process that takes into account the main feature of a grouped dispersal and derive its infinitesimal generator. To cope with the mutligeneration aspect associated to the demography we introduce two types of propagules in the description of the population which is one of the main innovations here. We also provide a rigorous description of the process and its generator. We derive as well, some large population asymptotics of the process unveilling the degenerate ultra parabolic system of PDE satisfied by the density of population. Finally, we also show that such a PDE system has a non-trivial solution which is unique in a certain functional space.
- [73] arXiv:2405.08389 [pdf, ps, other]
-
Title: The Grushin problem for Bismut's hypoelliptic LaplacianSubjects: Analysis of PDEs (math.AP)
We study in this article the combined asymptoticanalysis of Bismut's hypoelliptic Laplacian, in the high friction b $\rightarrow$ 0+ and possibly low temperature h $\rightarrow$ 0+ regimes.
- [74] arXiv:2405.08390 [pdf, ps, other]
-
Title: Weak solutions to the steady incompressible Euler equations with source termsSubjects: Analysis of PDEs (math.AP)
In this paper, we prove the non-uniqueness of stationary solutions to steady incompressible Euler equations with source terms. Based on the convex integration scheme developed by De Lellis and Székelyhidi, the Euler system is reformulated as a differential inclusion. The key point is to construct the corresponding plane-wave solutions via high frequency perturbations. Then we use iteration and Baire category argument to conclude that there exist a large amount of weak solutions with given energy profile.
- [75] arXiv:2405.08393 [pdf, ps, other]
-
Title: Gaussian measure on the dual of $\mathrm{U}(N)$, random partitions, and topological expansion of the partition functionThibaut Lemoine (CRIStAL), Mylène Maïda (LPP)Subjects: Mathematical Physics (math-ph); Probability (math.PR); Representation Theory (math.RT)
We study a Gaussian measure with parameter $q\in(0,1)$ on the dual of the unitary group of size $N$: we prove that a random highest weight under this measure is the coupling of two independent $q$-uniform random partitions $\alpha,\beta$ and a random highest weight of $\mathrm{U}(1)$. We prove deviation inequalities for the $q$-uniform measure, and use them to show that the coupling of random partitions under the Gaussian measure vanishes in the limit $N\to\infty$. We also prove that the partition function of this measure admits an asymptotic expansion in powers of $1/N$, and that this expansion is topological, in the sense that its coefficients are related to the enumeration of ramified coverings of elliptic curves. It provides a rigorous proof of the gauge/string duality for the Yang-Mills theory on a 2D torus with gauge group $\mathrm{U}(N),$ advocated by Gross and Taylor \cite{GT,GT2}.
- [76] arXiv:2405.08394 [pdf, ps, html, other]
-
Title: Weak solutions to the steady compressible Euler equations with source termsSubjects: Analysis of PDEs (math.AP)
In this paper, we showed that for some given suitable density and pressure, there exist infinitely many compactly supported solutions with prescribed energy profile. The proof is mainly based on the convex integration scheme. We construct suitable subsolutions and localized plane-wave solutions to the reformulated system, and weak solutions are obtained by iterating these subsolutions.
- [77] arXiv:2405.08396 [pdf, ps, other]
-
Title: Coupled-Band ESSFM for Low-Complexity DBPComments: The paper has been submitted for publication to ECOC 2024Subjects: Information Theory (cs.IT); Signal Processing (eess.SP)
We propose a novel digital backpropagation (DBP) technique that combines perturbation theory, subband processing, and splitting ratio optimization. We obtain 0.23 dB, 0.47 dB, or 0.91 dB gains w.r.t. dispersion compensation with only 74, 161, or 681 real multiplications/2D-symbol, improving significantly on existing DBP techniques.
- [78] arXiv:2405.08404 [pdf, ps, other]
-
Title: Genetic contribution of an advantaged mutant in the biparental Moran model -- finite selectionCamille Coron (INRAE), Yves Le JanSubjects: Probability (math.PR); Populations and Evolution (q-bio.PE)
We consider a population of N individuals, whose dynamics through time is represented by a biparental Moran model with two types: an advantaged type and a disadvantaged type. The advantage is due to a mutation, transmitted in a Mendelian way from parent to child that reduces the death probability of individuals carrying it. We assume that initially this mutation is carried by a proportion a of individuals in the population. Once the mutation is fixed, a gene is sampled uniformly in the population, at a locus independent of the locus under selection. We then give the probability that this gene initially comes from an advantaged individual, i.e. the genetic contribution of these individuals, as a function of a and when the population size is large.
- [79] arXiv:2405.08405 [pdf, ps, html, other]
-
Title: A constraint-based approach to function interpolation, with application to performance estimation for weakly convex optimisationSubjects: Optimization and Control (math.OC)
We propose a novel approach to obtain interpolation constraints for a wide range of function classes, i.e. necessary and sufficient constraints that a set of points, functions values and (sub)gradients must satisfy to ensure the existence of a global function of the class considered, consistent with this set. The derivation of such constraints is crucial for instance in the performance analysis of optimization methods, since obtaining a priori tight performance guarantees requires using a tight description of function classes of interest. Our method allows setting aside all analytic properties of the function class to work only at an algebraic level, and to easily obtain counterexamples when a condition characterizing a function class cannot serve as an interpolation constraint. As an illustration, we provide interpolation constraints for a class of non convex non smooth functions: weakly convex functions with bounded subgradients, and rely on these new interpolation constraints to outperform state of the art bounds on the performance of the subgradient method on this class.
- [80] arXiv:2405.08407 [pdf, ps, other]
-
Title: Global existence of small data weak solutions to the semilinear wave equations with time-dependent scale-invariant dampingComments: 32 pages, 2 figuresSubjects: Analysis of PDEs (math.AP)
In this paper, we are concerned with the global existence of small data weak solutions to the $n-$dimensional semilinear wave equation $\partial_t^2u-\Delta u+\frac{\mu}{t}\partial_tu=|u|^p$ with time-dependent scale-invariant damping, where $n\geq 2$, $t\geq 1$, $\mu\in(0,1)\cup(1,2]$ and $p>1$. This equation can be changed into the semilinear generalized Tricomi equation $\partial_t^2u-t^m\Delta u=t^{\alpha(m)}|u|^p$, where $m=m(\mu)>0$ and $\alpha(m)\in\Bbb R$ are two suitable constants. At first, for the more general semilinear Tricomi equation $\partial_t^2v-t^m\Delta v=t^{\alpha}|v|^p$ with any fixed constant $m>0$ and arbitrary parameter $\alpha\in\Bbb R$, we shall show that in the case of $\alpha\leq -2$, $n\geq 3$ and $p>1$, the small data weak solution $v$ exists globally; in the case of $\alpha>-2$, through determining the conformal exponent $p_{conf}(n,m,\alpha)>1$, the global small data weak solution $v$ exists when some extra restrictions of $p\geq p_{conf}(n,m,\alpha)$ are given. Returning to the original equation $\partial_t^2u-\Delta u+\frac{\mu}{t}\partial_tu=|u|^p$, the corresponding global existence results on the small data solution $u$ can be obtained.
- [81] arXiv:2405.08410 [pdf, ps, other]
-
Title: Classification of closed conformally flat Lorentzian manifolds with unipotent holonomyComments: 34 pages, 3 figuresSubjects: Differential Geometry (math.DG); Geometric Topology (math.GT)
We classify closed, conformally flat Lorentzian manifolds of dimension $n \geq 3$ with unipotent holonomy in PO(2,n). They are all Kleinian and fall into four different geometric types according to the intersection of the image of the developing map with a holonomy-invariant isotropic flag. They are homeomorphic to $S^{n-1} \times S^1$ or a nilmanifold of degree at most three, up to a finite cover. We classify those admitting an essential conformal flow; these fall into two geometric types, both homeomorphic to $S^{n-1} \times S^1$ up to finite cover.
- [82] arXiv:2405.08412 [pdf, ps, other]
-
Title: Compact bilinear operators and paraproducts revisitedComments: 14 pagesSubjects: Functional Analysis (math.FA); Classical Analysis and ODEs (math.CA)
We present a new proof of the compactness of bilinear paraproducts with CMO symbols. By drawing an analogy to compact linear operators, we first explore further properties of compact bilinear operators on Banach spaces and present examples. We then prove compactness of bilinear paraproducts with CMO symbols by combining one of the properties of compact bilinear operators thus obtained with vanishing Carleson measure estimates and interpolation of bilinear compactness.
- [83] arXiv:2405.08415 [pdf, ps, other]
-
Title: On the transcendentality condition for Gaussian Gabor framesComments: 7 pages. Comments welcomeSubjects: Functional Analysis (math.FA); Classical Analysis and ODEs (math.CA); Complex Variables (math.CV)
We give a criterion for higher-dimensional Gaussian Gabor frames, which is a reformulation of one of the main results in a previous article by the first and last authors in more explicit terms. We also show that this density criterion for Gaussian Gabor frames is generic in a certain sense.
- [84] arXiv:2405.08416 [pdf, ps, other]
-
Title: Compact $T(1)$ theorem \`a la SteinComments: 22 pagesSubjects: Functional Analysis (math.FA); Classical Analysis and ODEs (math.CA)
We prove a compact $T(1)$ theorem, involving quantitative estimates, analogous to the quantitative classical $T(1)$ theorem due to Stein. We also discuss the $C_c^\infty$-to-$CMO$ mapping properties of non-compact Calderón-Zygmund operators.
- [85] arXiv:2405.08422 [pdf, ps, other]
-
Title: Hereditary undecidability of fragments of some elementary theoriesComments: in Russian languageSubjects: Logic (math.LO)
It is well known that whenever a class of structures $\mathcal{K}_1$ is interpretable in a class of structures $\mathcal{K}_2$, then the hereditary undecidability of (a fragment of) the theory of $\mathcal{K}_1$ implies the hereditary undecidability of (a suitable fragment of) the theory of $\mathcal{K}_2$. In the present paper, we construct a $\Sigma_1$-interpretation of the class of all finite bipartite graphs in the class of all pairs of equivalence relations on the same finite domain; from this we obtain the hereditary undecidability of the $\Sigma_2$-theory of the second class. Next, we construct a $\Sigma_1$-interpretation of the class of all pairs of equivalence relations on the same finite domain in the class of all pairs consisting of a linear ordering and an equivalence relation on the same finite domain; this gives us the hereditary undecidability of the $\Sigma_2$-theory of the second class. The corresponding results are, in a sense, optimal, since the $\Pi_2$-theories of the classes under consideration are decidable.
Keywords: undecidability, elementary theories, prefix fragments - [86] arXiv:2405.08425 [pdf, ps, other]
-
Title: Rogers-Ramanujan identities in Statistical MechanicsComments: 19 pages, 3 figures. Chapter 11 of 32 in the book: CAMPBELL, G.B. Vector Partitions, Visible Points, and Ramanujan Functions, CRC Press, Taylor and Francis Group, Boca Raton, London, New York, A Chapman & Hall Book, ISBN: 978-1-032-00366-5 (hbk), ISBN: 978-1-032-00432-7 (pbk), ISBN: 978-1-003-17415-8 (ebk), to appear, June 2024Subjects: Mathematical Physics (math-ph); Combinatorics (math.CO); History and Overview (math.HO); Number Theory (math.NT)
We describe the story of the Rogers-Ramanujan identities; being known for 85 years and having about 130 pure mathematics proofs, suddenly entering physics when Rodney Baxter solved the Hard Hexagon Model in Statistical Mechanics in 1980. We next cover the accompanying proofs by George E Andrews of other related Baxter identities arisen of Rogers-Ramanujan type, leading into a new flourishing partnership of Physics and Mathematics. Our narrative goes into the subsequent 44 years, explaining the progress in physics and mathematical analysis. Finally we show some related crossovers with regard to the Elliptic q-gamma function and some Vector Partition generating functional equations; the latter of which may be new. The present paper is essentially chapter 11 of a 32 chapter book to appear in June 2024.
- [87] arXiv:2405.08426 [pdf, ps, other]
-
Title: On the universal and generalized orbifold Euler characteristicsSubjects: Algebraic Geometry (math.AG); Algebraic Topology (math.AT); Group Theory (math.GR)
We discuss the universal orbifold Euler characteristic and generalized orbifold Euler characteristics corresponding to finitely generated groups $A$ (the $A$-Euler characteristics). We show that the collection of all $A$-Euler characteristics for $A$ of the form $A'\times Z$ ($Z$ is the group of integers) with finite $A'$ determine the universal orbifold Euler characteristic.
- [88] arXiv:2405.08430 [pdf, ps, other]
-
Title: Conformal product structures on compact K\"ahler manifoldsComments: 16 pagesSubjects: Differential Geometry (math.DG)
A conformal product structure on a Riemannian manifold is a Weyl connection with reducible holonomy. We give the geometric description of all compact Kähler manifolds admitting conformal product structures
- [89] arXiv:2405.08432 [pdf, ps, html, other]
-
Title: Hochster's type formulaeSubjects: Commutative Algebra (math.AC)
We give an elementary proof and generalize some Hochsters's type formulae on local cohomology and Ext's of squarefree modules
- [90] arXiv:2405.08433 [pdf, ps, other]
-
Title: On finite groups in which the twisted conjugacy classes of the unit element are subgroupsSubjects: Group Theory (math.GR)
We consider groups $G$ such that the set $[G,\varphi]=\{g^{-1}g^{\varphi}|g\in G\}$ is a subgroup for every automorphism $\varphi$ of $G$, and we prove that there exists such a group $G$ that is finite and nilpotent of class $n$ for every $n\in\mathbb N$. Then there exists an infinite nonnilpotent group with the above property and the conjecture 18.14 of $[5]$ is false.
- [91] arXiv:2405.08442 [pdf, ps, other]
-
Title: Algorithmic aspects of left-orderings of solvable Baumslag--Solitar groups via its dynamical realizationSubjects: Logic (math.LO); Group Theory (math.GR)
We answer a question of Calderoni and Clay by showing that the conjugation equivalence relation of left orderings of the Baumslag-Solitar groups $\mathrm{BS}(1,n)$ is hyperfinite for any $n$. Our proof relies on a classification of $\mathrm{BS}(1,n)$'s left-orderings via its one-dimensional dynamical realizations. We furthermore use the effectiveness of the dynamical realizations of $\mathrm{BS}(1,n)$ to study algorithmic properties of the left-orderings on $\mathrm{BS}(1,n)$.
- [92] arXiv:2405.08444 [pdf, ps, other]
-
Title: Multi-dimensional piecewise contractions are asymptotically periodicComments: 23 pagesSubjects: Dynamical Systems (math.DS)
Piecewise contractions (PCs) are piecewise smooth maps that decrease distance between pair of points in the same domain of continuity. The dynamics of a variety of systems is described by PCs. During the last decade, a lot of effort has been devoted to proving that in parametrized families of one-dimensional PCs, the $\omega$-limit set of a typical PC consists of finitely many periodic orbits while there exist atypical PCs with Cantor $\omega$-limit sets. In this article, we extend these results to the multi-dimensional case. More precisely, we provide criteria to show that an arbitrary family $\{f_{\mu}\}_{\mu\in U}$ of locally bi-Lipschitz piecewise contractions $f_\mu:X\to X$ defined on a compact metric space $X$ is asymptotically periodic for Lebesgue almost every parameter $\mu$ running over an open subset $U$ of the $M$-dimensional Euclidean space $\mathbb{R}^M$. As a corollary of our results, we prove that piecewise affine contractions of $\mathbb{R}^d$ defined in generic polyhedral partitions are asymptotically periodic.
- [93] arXiv:2405.08446 [pdf, ps, other]
-
Title: A mean curvature type flow with capillary boundary in a horoball in hyperbolic spaceComments: 18 pages, 1 figureSubjects: Differential Geometry (math.DG)
In this paper, we study a mean curvature type flow with capillary boundary in a horoball in hyperbolic space. Our flow preserves the volume of the bounded domain enclosed by the hypersurface and monotonically decreases the energy functional. We show that it has the long time existence and converges to a truncated umbilical hypersurface in hyperbolic space. As an application, we solve an isoperimetric type problem for hypersurfaces with capillary boundary in a horoball.
- [94] arXiv:2405.08450 [pdf, ps, other]
-
Title: Effective Front-Descent Algorithms with Convergence GuaranteesSubjects: Optimization and Control (math.OC)
In this manuscript, we address continuous unconstrained optimization problems and we discuss descent type methods for the reconstruction of the Pareto set. Specifically, we analyze the class of Front Descent methods, which generalizes the Front Steepest Descent algorithm allowing the employment of suitable, effective search directions (e.g., Newton, Quasi-Newton, Barzilai-Borwein). We provide a deep characterization of the behavior and the mechanisms of the algorithmic framework, and we prove that, under reasonable assumptions, standard convergence results and some complexity bounds hold for the generalized approach. Moreover, we prove that popular search directions can indeed be soundly used within the framework. Then, we provide a completely novel type of convergence results, concerning the sequence of sets produced by the procedure. In particular, iterate sets are shown to asymptotically approach stationarity for all of their points; additionally, in finite precision settings, the sets are shown to only be enriched through exploration steps in later iterations, and suitable stopping conditions can be devised. Finally, the results from a large experimental benchmark show that the proposed class of approaches far outperforms state-of-the-art methodologies.
- [95] arXiv:2405.08461 [pdf, ps, other]
-
Title: Velocity-vorticity geometric constraints for the energy conservation of 3D ideal incompressible fluidsComments: 17 pagesSubjects: Analysis of PDEs (math.AP)
In this paper we consider the 3D Euler equations and we first prove a criterion for energy conservation for weak solutions with velocity satisfying additional assumptions in fractional Sobolev spaces with respect to the space variables, balanced by proper integrability with respect to time. Next, we apply the criterion to study the energy conservation of solution of the Beltrami type, carefully applying properties of products in (fractional and possibly negative) Sobolev spaces and employing a suitable bootstrap argument.
- [96] arXiv:2405.08471 [pdf, ps, other]
-
Title: Varieties of MV-monoids and positive MV-algebrasSubjects: Rings and Algebras (math.RA); Logic (math.LO)
In this paper we investigate MV-monoids and their subquasivarieties. MV-monoids are algebras $\langle A,\vee,\wedge, \oplus,\odot, 0,1\rangle$ where $\langle A, \vee, \wedge, 0, 1\rangle$ is a bounded distributive lattice, $\langle A, \oplus, 0 \rangle$ and $\langle A, \odot, 1\rangle$ are commutative monoids, and some further connecting axioms are satisfied. Every MV-algebra in the signature $\{\oplus,\neg,0\}$ is term equivalent to an algebra that has an MV-monoid as a reduct, by defining, as standard, $1:= \neg 0$, $x \odot y := \neg(\neg x \oplus\neg y)$, $x \vee y := (x \odot \neg y) \oplus y$ and $x \wedge y := \neg(\neg x \vee \neg y)$. Particular examples of MV-monoids are positive MV-algebras, i.e. the $\{\vee, \wedge, \oplus, \odot, 0, 1\}$-subreducts of MV-algebras. Positive MV-algebras form a peculiar quasivariety in the sense that, albeit having a logical motivation (being the quasivariety of subreducts of MV-algebras), it is not the equivalent quasivariety semantics of any logic. In this paper, we study the lattice of subvarieties of MV-monoids and describe the lattice of subvarieties of positive MV-algebras. We characterize the finite subdirectly irreducible positive MV-algebras. Furthermore, we axiomatize all varieties of positive MV-algebras.
- [97] arXiv:2405.08475 [pdf, ps, html, other]
-
Title: Representing Information on DNA using Patterns Induced by Enzymatic LabelingComments: Accepted to The IEEE International Symposium on Information Theory (ISIT) 2024Subjects: Information Theory (cs.IT)
Enzymatic DNA labeling is a powerful tool with applications in biochemistry, molecular biology, biotechnology, medical science, and genomic research. This paper contributes to the evolving field of DNA-based data storage by presenting a formal framework for modeling DNA labeling in strings, specifically tailored for data storage purposes. Our approach involves a known DNA molecule as a template for labeling, employing patterns induced by a set of designed labels to represent information. One hypothetical implementation can use CRISPR-Cas9 and gRNA reagents for labeling. Various aspects of the general labeling channel, including fixed-length labels, are explored, and upper bounds on the maximal size of the corresponding codes are given. The study includes the development of an efficient encoder-decoder pair that is proven optimal in terms of maximum code size under specific conditions.
- [98] arXiv:2405.08485 [pdf, ps, other]
-
Title: Doubly relaxed forward-Douglas--Rachford splitting for the sum of two nonconvex and a DC functionSubjects: Optimization and Control (math.OC)
In this paper, we consider a class of structured nonconvex nonsmooth optimization problems whose objective function is the sum of three nonconvex functions, one of which is expressed in a difference-of-convex (DC) form. This problem class covers several important structures in the literature including the sum of three functions and the general DC program. We propose a splitting algorithm and prove the subsequential convergence to a stationary point of the problem. The full sequential convergence, along with convergence rates for both the iterates and objective function values, is then established without requiring differentiability of the concave part. Our analysis not only extends but also unifies and improves recent convergence analyses in nonconvex settings. We benchmark our proposed algorithm with notable algorithms in the literature to show its competitiveness on both synthetic data and real power system load data.
- [99] arXiv:2405.08488 [pdf, ps, other]
-
Title: Metastable hierarchy in abstract low-temperature lattice models: an application to Kawasaki dynamics for Ising lattice gas with macrscopic number of particlesComments: 54 pages, 18 figures and 5 tablesSubjects: Probability (math.PR)
This article is divided into two parts. In the first part, we study the hierarchical phenomenon of metastability in low-temperature lattice models in the most general setting. Given an abstract dynamical system governed by a Hamiltonian function, we prove that there exists a hierarchical decomposition of the collection of stable plateaux in the system into multiple $\mathfrak{m}$ levels, such that at each level there exist tunneling metastable transitions between the stable plateaux, which can be characterized by convergence to a simple Markov chain as the inverse temperature $\beta$ tends to infinity. In the second part, as an application, we characterize the $3$-level metastable hierarchy in Kawasaki dynamics for Ising lattice gas with macroscopic number of particles. We prove that the ground states in this model are those in which the particles line up and form a one-dimensional strip, and identify the full structure relevant to the tunneling transitions between these ground states. In particular, the results differ from the previous work [5] in that the particles in the ground states are likely to form a strip rather than a square droplet. The main tool is the resolvent approach to metastability, recently developed in [24]. Along with the analysis, we present a theorem on the sharp asymptotics of the exit distribution from cycles, which to the author's knowledge is not known in the community and therefore may be of independent interest.
- [100] arXiv:2405.08490 [pdf, ps, other]
-
Title: The ring of differential operators on a monomial curve is a Hopf algebroidSubjects: Quantum Algebra (math.QA); Commutative Algebra (math.AC); Rings and Algebras (math.RA)
This article considers cuspidal curves whose coordinate rings are numerical semigroup algebras. Their rings of differential operators are shown to be cocommutative and conilpotent left Hopf algebroids. If the semigroups are symmetric so that the curves are Gorenstein, they are full Hopf algebroids (admit an antipode).
- [101] arXiv:2405.08501 [pdf, ps, other]
-
Title: Similarity of Matrices over Complete Discrete Valuation RingSubjects: Number Theory (math.NT)
In this paper, we utilize linear algebra to investigate quadratic integral extensions over a complete discrete valuation ring and to classify all of their ideals. With this classification, we establish that the similarity of $2\times2$ matrices with coefficients in a certain complete discrete valuation ring can be ascertained in a larger ring of algebraic integers.
- [102] arXiv:2405.08506 [pdf, ps, html, other]
-
Title: From linear programming to colliding particlesComments: 20 pages, 13 figuresSubjects: Combinatorics (math.CO)
Although simplices are trivial from a linear optimization standpoint, the simplex algorithm can exhibit quite complex behavior. In this paper we study the behavior of max-slope pivot rules on (products of) simplices and describe the associated pivot rule polytopes. For simplices, the pivot rule polytopes are combinatorially isomorphic to associahedra. To prove this correspondence, we interpret max-slope pivot rules in terms of the combinatorics of colliding particles on a line. For prisms over simplices, we recover Stasheff's multiplihedra. For products of two simplices we get new realizations of constrainahedra, that capture the combinatorics of certain particle systems in the plane.
- [103] arXiv:2405.08509 [pdf, ps, html, other]
-
Title: Markoff-Fibonacci m-triplesSubjects: Number Theory (math.NT)
We classify all solution triples with Fibonacci components to the equation $a^2+b^2+c^2=3abc+m,$ for positive $m$. We show that for $m=2$ they are precisely $(1,F(b),F(b+2))$, with even $b$; for $m=21$, there exist exactly two Fibonacci solutions $(1,2,8)$ and $(2,2,13)$ and for any other $m$ there exists at most one Fibonacci solution, which, in case it exists, is always minimal (i.e. it is a root of a Markoff tree). Moreover, we show that there is an infinite number of values of $m$ admitting exactly one such solution.
- [104] arXiv:2405.08513 [pdf, ps, other]
-
Title: Subspace method based on neural networks for solving the partial differential equation in weak formComments: arXiv admin note: substantial text overlap with arXiv:2404.08223Subjects: Numerical Analysis (math.NA)
We present a subspace method based on neural networks for solving the partial differential equation in weak form with high accuracy. The basic idea of our method is to use some functions based on neural networks as base functions to span a subspace, then find an approximate solution in this subspace. Training base functions and finding an approximate solution can be separated, that is different methods can be used to train these base functions, and different methods can also be used to find an approximate solution. In this paper, we find an approximate solution of the partial differential equation in the weak form. Our method can achieve high accuracy with low cost of training. Numerical examples show that the cost of training these base functions is low, and only one hundred to two thousand epochs are needed for most tests. The error of our method can fall below the level of $10^{-7}$ for some tests. The proposed method has the better performance in terms of the accuracy and computational cost.
- [105] arXiv:2405.08518 [pdf, ps, other]
-
Title: Cryptography-Based Privacy-Preserving Method for Distributed Optimization over Time-Varying Directed Graphs with Enhanced EfficiencySubjects: Optimization and Control (math.OC)
In this paper, we study the privacy-preserving distributed optimization problem, aiming to prevent attackers from stealing the private information of agents. For this purpose, we propose a novel privacy-preserving algorithm based on the Advanced Encryption Standard (AES), which is both secure and computationally efficient. By appropriately constructing the underlying weight matrices, our algorithm can be applied to time-varying directed networks. We show that the proposed algorithm can protect an agent's privacy if the agent has at least one legitimate neighbor at the initial iteration. Under the assumption that the objective function is strongly convex and Lipschitz smooth, we rigorously prove that the proposed algorithm has a linear convergence rate. Finally, the effectiveness of the proposed algorithm is demonstrated by numerical simulations of the canonical sensor fusion problem.
- [106] arXiv:2405.08520 [pdf, ps, other]
-
Title: Empowering Programmable Wireless Environments with Optical Anchor-based PositioningDimitrios Tyrovolas, Dimitrios Bozanis, Sotiris A. Tegos, Vasilis K. Papanikolaou, Panagiotis D. Diamantoulakis, Christos K. Liaskos, Robert Schober, George K. KaragiannidisSubjects: Information Theory (cs.IT)
The evolution toward sixth-generation (6G) wireless networks has introduced programmable wireless environments (PWEs) and reconfigurable intelligent surfaces (RISs) as transformative elements for achieving near-deterministic wireless communications. However, the enhanced capabilities of RISs within PWEs, especially as we move toward more complex electromagnetic functions by increasing the number of reflecting elements, underscore the need for high-precision user localization, since inaccurate localization could lead to erroneous configuration of RISs, which would then compromise the effectiveness of PWEs. In this direction, this paper investigates the integration of RISs and optical anchors within PWEs, emphasizing the crucial role of ultra-precise localization in unlocking advanced electromagnetic functionalities. Specifically, we present an in-depth analysis of various localization techniques, both RISbased and RIS-independent, while introducing the concept of empowering PWEs with optical anchors for enhanced localization precision. Our findings highlight that accurate localization is essential to fully exploit the capabilities of RISs, paving the way for future applications. Through this exploration, we contribute to the advancement of PWEs in line with the ambitious goals of the 6G standards and improve the quality of service in next generation wireless networks.
- [107] arXiv:2405.08524 [pdf, ps, other]
-
Title: The Asymptotic Properties of the Extreme Eigenvectors of High-dimensional Generalized Spiked Covariance ModelSubjects: Statistics Theory (math.ST)
In this paper, we investigate the asymptotic behaviors of the extreme eigenvectors in a general spiked covariance matrix, where the dimension and sample size increase proportionally. We eliminate the restrictive assumption of the block diagonal structure in the population covariance matrix. Moreover, there is no requirement for the spiked eigenvalues and the 4th moment to be bounded. Specifically, we apply random matrix theory to derive the convergence and limiting distributions of certain projections of the extreme eigenvectors in a large sample covariance matrix within a generalized spiked population model. Furthermore, our techniques are robust and effective, even when spiked eigenvalues differ significantly in magnitude from nonspiked ones. Finally, we propose a powerful statistic for hypothesis testing for the eigenspaces of covariance matrices.
- [108] arXiv:2405.08532 [pdf, ps, other]
-
Title: A dynamical view of Tijdeman's solution of the chairman assignment problemValérie Berthé (IRIF (UMR\_8243)), Olivier Carton (IRIF (UMR\_8243)), Nicolas Chevallier (IRIMAS), Wolfgang Steiner (IRIF (UMR\_8243)), Reem YassawiSubjects: Dynamical Systems (math.DS)
In 1980, R. Tijdeman provided an on-line algorithm that generates sequences over a finite alphabet with minimal discrepancy, that is, such that the occurrence of each letter optimally tracks its frequency. In this article, we define discrete dynamical systems generating these sequences. The dynamical systems are defined as exchanges of polytopal pieces, yielding cut and project schemes, and they code tilings of the line whose sets of vertices form model sets. We prove that these sequences of low discrepancy are natural codings of toral translations with respect to polytopal atoms, and that they generate a minimal and uniquely ergodic subshift with purely discrete spectrum. Finally, we show that the factor complexity of these sequences is of polynomial growth order $n^{d-1}$, where $d$ is the cardinality of the alphabet.
- [109] arXiv:2405.08537 [pdf, ps, other]
-
Title: GS-PINN: Greedy Sampling for Parameter Estimation in Partial Differential EquationsSubjects: Dynamical Systems (math.DS)
Partial differential equation parameter estimation is a mathematical and computational process used to estimate the unknown parameters in a partial differential equation model from observational data. This paper employs a greedy sampling approach based on the Discrete Empirical Interpolation Method to identify the most informative samples in a dataset associated with a partial differential equation to estimate its parameters. Greedy samples are used to train a physics-informed neural network architecture which maps the nonlinear relation between spatio-temporal data and the measured values. To prove the impact of greedy samples on the training of the physics-informed neural network for parameter estimation of a partial differential equation, their performance is compared with random samples taken from the given dataset. Our simulation results show that for all considered partial differential equations, greedy samples outperform random samples, i.e., we can estimate parameters with a significantly lower number of samples while simultaneously reducing the relative estimation error. A Python package is also prepared to support different phases of the proposed algorithm, including data prepossessing, greedy sampling, neural network training, and comparison.
- [110] arXiv:2405.08544 [pdf, ps, other]
-
Title: On the characterization of Riemannian warped product Einstein metricsSubjects: Differential Geometry (math.DG)
We present a series of results, including local characterizations of $(\lambda,m+n)$-Einstein metrics in the context of warped product Einstein spaces. Using these local properties, we restate already known global characterizations of $(\lambda,m+n)$-Einstein manifolds from He, Petersen and Wylie.
- [111] arXiv:2405.08549 [pdf, ps, html, other]
-
Title: A Well-Balanced Method for an Unstaggered Central Scheme, the one-space Dimensional CaseSubjects: Numerical Analysis (math.NA)
In this paper, we propose a new MUSCL scheme by combining the ideas of the Kurganov and Tadmor scheme and the so-called Deviation method which results in a well-balanced finite volume method for the hyperbolic balance laws, by evolving the difference between the exact solution and a given stationary solution. After that, we derive a semi-discrete scheme from this new scheme and it can be shown to be essentially TVD when applied to a scalar conservation law. In the end, we apply and validate the developed methods by numerical experiments and solve classical problems featuring Euler equations with gravitational source term.
- [112] arXiv:2405.08552 [pdf, ps, html, other]
-
Title: A conjecture of Zhi-Wei Sun on matrices concerning multiplicative subgroups of finite fieldsSubjects: Number Theory (math.NT)
Motivated by the recent work of Zhi-Wei Sun on determinants involving the Legendre symbol, in this paper, we study some matrices concerning subgroups of finite fields.
For example, let $q\equiv 3\pmod 4$ be an odd prime power and let $\phi$ be the unique quadratic multiplicative character of the finite field $\mathbb{F}_q$. If set $\{s_1,\cdots,s_{(q-1)/2}\}=\{x^2:\ x\in\mathbb{F}_q\setminus\{0\}\}$, then we prove that
$$\det\left[t+\phi(s_i+s_j)+\phi(s_i-s_j)\right]_{1\le i,j\le (q-1)/2}=\left(\frac{q-1}{2}t-1\right)q^{\frac{q-3}{4}}.$$
This confirms a conjecture of Zhi-Wei Sun. - [113] arXiv:2405.08558 [pdf, ps, other]
-
Title: PTPI-DL-ROMs: pre-trained physics-informed deep learning-based reduced order models for nonlinear parametrized PDEsComments: 38 pagesSubjects: Numerical Analysis (math.NA); Machine Learning (cs.LG)
The coupling of Proper Orthogonal Decomposition (POD) and deep learning-based ROMs (DL-ROMs) has proved to be a successful strategy to construct non-intrusive, highly accurate, surrogates for the real time solution of parametric nonlinear time-dependent PDEs. Inexpensive to evaluate, POD-DL-ROMs are also relatively fast to train, thanks to their limited complexity. However, POD-DL-ROMs account for the physical laws governing the problem at hand only through the training data, that are usually obtained through a full order model (FOM) relying on a high-fidelity discretization of the underlying equations. Moreover, the accuracy of POD-DL-ROMs strongly depends on the amount of available data. In this paper, we consider a major extension of POD-DL-ROMs by enforcing the fulfillment of the governing physical laws in the training process -- that is, by making them physics-informed -- to compensate for possible scarce and/or unavailable data and improve the overall reliability. To do that, we first complement POD-DL-ROMs with a trunk net architecture, endowing them with the ability to compute the problem's solution at every point in the spatial domain, and ultimately enabling a seamless computation of the physics-based loss by means of the strong continuous formulation. Then, we introduce an efficient training strategy that limits the notorious computational burden entailed by a physics-informed training phase. In particular, we take advantage of the few available data to develop a low-cost pre-training procedure; then, we fine-tune the architecture in order to further improve the prediction reliability. Accuracy and efficiency of the resulting pre-trained physics-informed DL-ROMs (PTPI-DL-ROMs) are then assessed on a set of test cases ranging from non-affinely parametrized advection-diffusion-reaction equations, to nonlinear problems like the Navier-Stokes equations for fluid flows.
- [114] arXiv:2405.08559 [pdf, ps, other]
-
Title: A bridge connecting convex analysis and complex analysis and $L^2$-estimate of $d$ and $\bar\partial$Comments: arXiv admin note: text overlap with arXiv:2403.19152Subjects: Complex Variables (math.CV)
We propose a way to connect complex analysis and convex analysis. As applications, we derive some results about $L^2$-estimate for $d$-equation and prove some curvature positivity related to convex analysis from well known $L^2$-estimate for $\bar\partial$-equation or the results we prove in complex analysis.
- [115] arXiv:2405.08561 [pdf, ps, other]
-
Title: Minimax and maximin problems for sums of translates on the real axisSubjects: Classical Analysis and ODEs (math.CA)
Sums of translates generalize logarithms of weighted algebraic polynomials. The paper presents the solution to the minimax and maximin problems on the real axis for sums of translates. We prove that there is a unique function that is extremal in both problems. The key in our proof is a reduction to the problem on a segment. For this, we work out an analogue of the Mhaskar-Rakhmanov-Saff theorem, too.
- [116] arXiv:2405.08565 [pdf, ps, other]
-
Title: Space-time stochastic Galerkin boundary elements for acoustic scattering problemsComments: 30 pages, 14 figures, to appear in International Journal for Numerical Methods in EngineeringSubjects: Numerical Analysis (math.NA); Computational Engineering, Finance, and Science (cs.CE)
Acoustic emission or scattering problems naturally involve uncertainties about the sound sources or boundary conditions. This article initiates the study of time domain boundary elements for such stochastic boundary problems for the acoustic wave equation. We present a space-time stochastic Galerkin boundary element method which is applied to sound-hard, sound-soft and absorbing scatterers. Uncertainties in both the sources and the boundary conditions are considered using a polynomial chaos expansion. The numerical experiments illustrate the performance and convergence of the proposed method in model problems and present an application to a problem from traffic noise.
- [117] arXiv:2405.08566 [pdf, ps, other]
-
Title: Space-time boundary elements for frictional contact in elastodynamicsComments: 34 pages, 14 figures, to appear in Computer Methods in Applied Mechanics and EngineeringSubjects: Numerical Analysis (math.NA); Computational Engineering, Finance, and Science (cs.CE)
This article studies a boundary element method for dynamic frictional contact between linearly elastic bodies. We formulate these problems as a variational inequality on the boundary, involving the elastodynamic Poincaré-Steklov operator. The variational inequality is solved in a mixed formulation using boundary elements in space and time. In the model problem of unilateral Tresca friction contact with a rigid obstacle we obtain an a priori estimate for the resulting Galerkin approximations. Numerical experiments in two space dimensions demonstrate the stability, energy conservation and convergence of the proposed method for contact problems involving concrete and steel in the linearly elastic regime. They address both unilateral and two-sided dynamic contact with Tresca or Coulomb friction.
- [118] arXiv:2405.08575 [pdf, ps, other]
-
Title: Complexity of codes for Ramsey positive setsSubjects: Logic (math.LO)
Sabok showed that the set of codes for $G_\delta$ Ramsey positive subsets of $[\omega]^\omega$ is $\mathbf{\Sigma}^1_2$-complete. We extend this result by providing sufficient conditions for the set of codes for $G_\delta$ Ramsey positive subsets of an arbitrary topological Ramsey space to be $\mathbf{\Sigma}^1_2$-complete.
- [119] arXiv:2405.08583 [pdf, ps, other]
-
Title: On the $\sigma$-balancing property of multivariate generalized quasi-arithmetic meansSubjects: Classical Analysis and ODEs (math.CA)
The aim of this paper is to characterize the so-called $\sigma$-balancing property in the class of generalized quasi-arithmetic means. In general, the question is whether those elements of a given family of means that possess this property are quasi-arithmetic.
The first result in the latter direction is due to G. Aumann who showed that a balanced complex mean is necessariliy quasi-arithmetic provided that it is analytic. Then Aumann characterized quasi-arithmetic means among Cauchy means in terms of the balancing property. These results date back to the 1930s. In 2015, Lucio R. Berrone, generalizing balancedness, concluded that a mean having that more general property is quasi-arithmetic if it is symmetric, strict and continuously differentiable. A common feature of these results is that they assume a certain order of differentiability of the mean whether or not it is a natural condition.
In 2020, the balancing property was characterized in the family of generalized quasi-arithmetic means of two variables under only natural conditions, namely continuity and strict monotonicity of their generating functions. Here we extend the corresponding result for multivariate generalized quasi-arithmetic means by relaxing the conditions on the generating functions and considering the more general $\sigma$-balancing property. - [120] arXiv:2405.08584 [pdf, ps, other]
-
Title: When Do Low-Rate Concatenated Codes Approach The Gilbert-Varshamov Bound?Subjects: Information Theory (cs.IT); Computational Complexity (cs.CC)
The Gilbert--Varshamov (GV) bound is a classical existential result in coding theory. It implies that a random linear binary code of rate $\epsilon^2$ has relative distance at least $\frac{1}{2} - O(\epsilon)$ with high probability. However, it is a major challenge to construct explicit codes with similar parameters.
One hope to derandomize the Gilbert--Varshamov construction is with code concatenation: We begin with a (hopefully explicit) outer code ${C}_\mathrm{out}$ over a large alphabet, and concatenate that with a small binary random linear code ${C}_\mathrm{in}$. It is known that when we use \emph{independent} small codes for each coordinate, then the result lies on the GV bound with high probability, but this still uses a lot of randomness. In this paper, we consider the question of whether code concatenation with a single random linear inner code ${C}_\mathrm{in}$ can lie on the GV bound; and if so what conditions on ${C}_\mathrm{out}$ are sufficient for this.
We show that first, there do exist linear outer codes ${C}_\mathrm{out}$ that are "good" for concatenation in this sense (in fact, most linear codes codes are good). We also provide two sufficient conditions for ${C}_\mathrm{out}$, so that if ${C}_\mathrm{out}$ satisfies these, ${C}_\mathrm{out}\circ {C}_\mathrm{in}$ will likely lie on the GV bound. We hope that these conditions may inspire future work towards constructing explicit codes ${C}_\mathrm{out}$. - [121] arXiv:2405.08590 [pdf, ps, other]
-
Title: Accelerated Alternating Direction Method of Multipliers Gradient Tracking for Distributed OptimizationComments: This paper has been accepted for publication at IEEE Control Systems LettersSubjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
This paper presents a novel accelerated distributed algorithm for unconstrained consensus optimization over static undirected networks. The proposed algorithm combines the benefits of acceleration from momentum, the robustness of the alternating direction method of multipliers, and the computational efficiency of gradient tracking to surpass existing state-of-the-art methods in convergence speed, while preserving their computational and communication cost. First, we prove that, by applying momentum on the average dynamic consensus protocol over the estimates and gradient, we can study the algorithm as an interconnection of two singularly perturbed systems: the outer system connects the consensus variables and the optimization variables, and the inner system connects the estimates of the optimum and the auxiliary optimization variables. Next, we prove that, by adding momentum to the auxiliary dynamics, our algorithm always achieves faster convergence than the achievable linear convergence rate for the non-accelerated alternating direction method of multipliers gradient tracking algorithm case. Through simulations, we numerically show that our accelerated algorithm surpasses the existing accelerated and non-accelerated distributed consensus first-order optimization protocols in convergence speed.
- [122] arXiv:2405.08592 [pdf, ps, other]
-
Title: Horocycle flows on abelian covers of surfaces of negative curvatureComments: 40 pagesSubjects: Dynamical Systems (math.DS)
We consider the unit speed parametrization of the horocycle flow on infinite Abelian covers of compact surfaces of negative curvature. We prove an asymptotic result for the ergodic integrals of sufficiently regular functions. In the case of constant curvature, where the unit speed and the uniformly contracting parametrizations of horocycles coincide, we recover a result by Ledrappier and Sarig. Our method, which does not use symbolic dynamics, is based on a general Fourier decomposition for Abelian covers and on the study of spectral theory of weighted (and twisted) transfer operators for the geodesic flow acting on appropriate anisotropic Banach spaces.
- [123] arXiv:2405.08598 [pdf, ps, other]
-
Title: Spectral approximation of convolution operators of Fredholm typeSubjects: Numerical Analysis (math.NA)
We have developed a method for constructing spectral approximations for convolution operators of Fredholm type. The algorithm we propose is numerically stable and takes advantage of the recurrence relations satisfied by the entries of such a matrix approximation. When used for computing the Fredholm convolution of two given functions, such approximations produce the convolution more rapidly than the state-of-the-art methods. The proposed approximation also leads to a spectral method for solving the Fredholm convolution integral equations and enables the computation of eigenvalues and pseudospectra of Fredholm convolution operators, which is otherwise intractable with existing techniques.
- [124] arXiv:2405.08600 [pdf, ps, other]
-
Title: Stabilization and Optimal Control of Interconnected SDE - Scalar PDE SystemComments: 8 pages, 1 figure. Submitted to L-CSS journal jointly with CDC conferenceSubjects: Optimization and Control (math.OC)
In this paper, we design a controller for an interconnected system consisting of a linear Stochastic Differential Equation (SDE) actuated through a linear hyperbolic Partial Differential Equation (PDE). Our approach aims to minimize the variance of the state of the SDE component. We leverage a backstepping technique to transform the original PDE into an uncoupled stochastic PDE. As such, we reformulate our initial problem as the control of a delayed SDE with a non-deterministic drift. Under standard controllability assumptions, we design a controller steering the mean of the states to zero while keeping its covariance bounded. As final step, we address the optimal control of the delayed SDE employing Artstein's transformation and Linear Quadratic stochastic control techniques.
- [125] arXiv:2405.08605 [pdf, ps, other]
-
Title: On gradient estimates of the heat semigroups on step-two Carnot groupsSubjects: Analysis of PDEs (math.AP)
In this work, we give a sufficient condition for a step-two Carnot group to satisfy the quasi Bakry-Émery curvature condition. As an application, we establish the gradient estimate for the heat semigroup on the free step-two Carnot group with three generators $N_{3,2}$. Moreover, high order gradient estimates and the Riemannian counterparts are also deduced under an extra condition.
- [126] arXiv:2405.08608 [pdf, ps, other]
-
Title: On the Paley RIP and Paley graph extractorComments: 10 pages, comments are welcomeSubjects: Combinatorics (math.CO); Discrete Mathematics (cs.DM); Information Theory (cs.IT); Number Theory (math.NT)
Constructing explicit RIP matrices is an open problem in compressed sensing theory. In particular, it is quite challenging to construct explicit RIP matrices that break the square-root bottleneck. On the other hand, providing explicit $2$-source extractors is a fundamental problem in theoretical computer science, cryptography and combinatorics. Nowadays, there are only a few known constructions for explicit $2$-source extractors (with negligible errors) that break the half barrier for min-entropy.
In this paper, we establish a new connection between RIP matrices breaking the square-root bottleneck and $2$-source extractors breaking the half barrier for min-entropy. Here we focus on an RIP matrix (called the Paley ETF) and a $2$-source extractor (called the Paley graph extractor), where both are defined from quadratic residues over the finite field of odd prime order $p\equiv 1 \pmod{4}$. As a main result, we prove that if the Paley ETF breaks the square-root bottleneck, then the Paley graph extractor breaks the half barrier for min-entropy as well. Since it is widely believed that the Paley ETF breaks the square-root bottleneck, our result accordingly provides a new affirmative intuition on the conjecture for the Paley graph extractor by Benny Chor and Oded Goldreich. - [127] arXiv:2405.08613 [pdf, ps, other]
-
Title: GN-SINDy: Greedy Sampling Neural Network in Sparse Identification of Nonlinear Partial Differential EquationsSubjects: Dynamical Systems (math.DS); Machine Learning (cs.LG)
The sparse identification of nonlinear dynamical systems (SINDy) is a data-driven technique employed for uncovering and representing the fundamental dynamics of intricate systems based on observational data. However, a primary obstacle in the discovery of models for nonlinear partial differential equations (PDEs) lies in addressing the challenges posed by the curse of dimensionality and large datasets. Consequently, the strategic selection of the most informative samples within a given dataset plays a crucial role in reducing computational costs and enhancing the effectiveness of SINDy-based algorithms. To this aim, we employ a greedy sampling approach to the snapshot matrix of a PDE to obtain its valuable samples, which are suitable to train a deep neural network (DNN) in a SINDy framework. SINDy based algorithms often consist of a data collection unit, constructing a dictionary of basis functions, computing the time derivative, and solving a sparse identification problem which ends to regularised least squares minimization. In this paper, we extend the results of a SINDy based deep learning model discovery (DeePyMoD) approach by integrating greedy sampling technique in its data collection unit and new sparsity promoting algorithms in the least squares minimization unit. In this regard we introduce the greedy sampling neural network in sparse identification of nonlinear partial differential equations (GN-SINDy) which blends a greedy sampling method, the DNN, and the SINDy algorithm. In the implementation phase, to show the effectiveness of GN-SINDy, we compare its results with DeePyMoD by using a Python package that is prepared for this purpose on numerous PDE discovery
- [128] arXiv:2405.08615 [pdf, ps, other]
-
Title: Drazin and g-Drazin invertibility of combinations of three Banach algebra elementsSubjects: Functional Analysis (math.FA)
Consider a complex unital Banach algebra $\mathcal{A}.$ For $x_1,x_2,x_3\in\mathcal{A},$ in this paper, we establish that under certain assumptions on $x_1,x_2,x_3$, Drazin (resp. g-Drazin) invertibility of any three elements among $x_1,x_2,x_3$ and $x_1+x_2+x_3\text{ }(\text{or }x_1x_2+x_1x_3+x_2x_3)$ ensure the Drazin (resp. g-Drazin) invertibility of the remaining one. As a consequence for two idempotents $p,q\in\mathcal{A},$ this result indicates the equivalence between Drazin (resp. g-Drazin) invertibility of $$\lambda_1p+\gamma_1q-\lambda_1pq+\lambda_2\left(pqp-(pq)^2\right)+\cdots+\lambda_m\left((pq)^{m-1}p-(pq)^m\right)$$ and $$\lambda_1-\lambda_1pq+\lambda_2\left(pqp-(pq)^2\right)+\cdots+\lambda_m\left((pq)^{m-1}p-(pq)^m\right),$$ where $\gamma_1,\lambda_i\in\mathbb{C}$ for $i=1,2,\cdots,m,$ with $\lambda_1\gamma_1\neq0.$ Furthermore, for $x_1,x_2$, we establish that the Drazin (resp. g-Drazin) invertibility of any two elements among $x_1,x_2$ and $x_1+x_2$ indicates the Drazin (resp. g-Drazin) invertibility of the remaining one, provided that $x_1x_2=\alpha(x_1+x_2)$ for some $\alpha\in\mathbb{C}$. Additionally, if it exists, we furnish a new formula to represent the Drazin (resp. g-Drazin) inverse of any element among $x_1,x_2$ and $x_1+x_2$, by using the other two elements and their Drazin (resp. g-Drazin) inverse.
- [129] arXiv:2405.08618 [pdf, ps, html, other]
-
Title: The one-dimensional Coulomb Hamiltonian: Properties of its Birman-Schwinger operatorComments: 17 pages, no figures, no tablesSubjects: Mathematical Physics (math-ph); Functional Analysis (math.FA); Quantum Physics (quant-ph)
We study the Birman-Schwinger operator for a self-adjoint realisation of the one-dimensional Hamiltonian with the Coulomb potential. We study both the case in which this Hamiltonian is defined on the whole real line and when it is only defined on the positive semiaxis. In both cases, the Birman-Schwinger operator is Hilbert-Schmidt, even though it is not trace class. Then, we have considered some approximations to the Hamiltonian depending on a positive parameter, under given conditions, and proved the convergence of the Birman-Schwinger operators of these approximations to the original Hamiltonian as the parameter goes to zero. Further comments and results have been included.
- [130] arXiv:2405.08620 [pdf, ps, html, other]
-
Title: Ruijsenaars duality for B, C, D Toda chainsComments: 28 pagesSubjects: Mathematical Physics (math-ph); Exactly Solvable and Integrable Systems (nlin.SI)
We use the Hamiltonian reduction method to construct the Ruijsenaars dual systems to generalized Toda chains associated with the classical Lie algebras of types $B, C, D$. The dual systems turn out to be the $B, C$ and $D$ analogues of the rational Goldfish model, which is, as in the type $A$ case, the strong coupling limit of rational Ruijsenaars systems. We explain how both types of systems emerge in the reduction of the cotangent bundle of a Lie group and provide the formulae for dual Hamiltonians. We compute explicitly the higher Hamiltonians of Goldfish models using the Cauchy--Binet theorem.
- [131] arXiv:2405.08622 [pdf, ps, other]
-
Title: On sections of complex line bundles over surfaces minimizing a Ginzburg-Landau energySubjects: Analysis of PDEs (math.AP)
In this work we extend some of the results of Ignat and Jerrard for Ginzburg-Landau vortices of tangent vector fields on two-dimensional Riemannian manifolds to the setting of complex hermitian line bundles. In particular, we elucidate the locations of vortices for the cases of Q-tensors and their higher-rank analogs on a sphere.
- [132] arXiv:2405.08623 [pdf, ps, other]
-
Title: Accuracy of the Graphon Mean Field Approximation for Interacting Particle SystemsComments: preprintSubjects: Probability (math.PR)
We consider a system of $N$ particles whose interactions are characterized by a (weighted) graph $G^N$. Each particle is a node of the graph with an internal state. The state changes according to Markovian dynamics that depend on the states and connection to other particles. We study the limiting properties, focusing on the dense graph regime, where the number of neighbors of a given node grows with $N$. We show that when $G^N$ converges to a graphon $G$, the behavior of the system converges to a deterministic limit, the graphon mean field approximation. We obtain convergence rates depending on the system size $N$ and cut-norm distance between $G^N$ and $G$. We apply the results for two subcases: When $G^N$ is a discretization of the graph $G$ with individually weighted edges; when $G^N$ is a random graph obtained through edge sampling from the graphon $G$. In the case of weighted interactions, we obtain a bound of order $O(1/N)$. In the random graph case, the error is of order $O(\sqrt{\log(N)/N})$ with high probability. We illustrate the applicability of our results and the numerical efficiency of the approximation through two examples: a graph-based load-balancing model and a heterogeneous bike-sharing system.
- [133] arXiv:2405.08625 [pdf, ps, html, other]
-
Title: Optimal Almost-Balanced SequencesComments: Accepted to The IEEE International Symposium on Information Theory (ISIT) 2024Subjects: Information Theory (cs.IT)
This paper presents a novel approach to address the constrained coding challenge of generating almost-balanced sequences. While strictly balanced sequences have been well studied in the past, the problem of designing efficient algorithms with small redundancy, preferably constant or even a single bit, for almost balanced sequences has remained unsolved. A sequence is $\varepsilon(n)$-almost balanced if its Hamming weight is between $0.5n\pm \varepsilon(n)$. It is known that for any algorithm with a constant number of bits, $\varepsilon(n)$ has to be in the order of $\Theta(\sqrt{n})$, with $O(n)$ average time complexity. However, prior solutions with a single redundancy bit required $\varepsilon(n)$ to be a linear shift from $n/2$. Employing an iterative method and arithmetic coding, our emphasis lies in constructing almost balanced codes with a single redundancy bit. Notably, our method surpasses previous approaches by achieving the optimal balanced order of $\Theta(\sqrt{n})$. Additionally, we extend our method to the non-binary case considering $q$-ary almost polarity-balanced sequences for even $q$, and almost symbol-balanced for $q=4$. Our work marks the first asymptotically optimal solutions for almost-balanced sequences, for both, binary and non-binary alphabet.
- [134] arXiv:2405.08634 [pdf, ps, other]
-
Title: Stabilization of Integral Delay Equations by solving Fredholm equationsJean Auriol (L2S)Comments: IEEE Control Systems Letters, In pressSubjects: Dynamical Systems (math.DS)
In this paper, we design a stabilizing state-feedback control law for a system represented by a general class of integral delay equations subject to a pointwise and distributed input delay. The proposed controller is defined in terms of integrals of the state and input history over a fixed-length time window. We show that the closed-loop stability is guaranteed, provided the controller integral kernels are solutions to a set of Fredholm equations. The existence of solutions is guaranteed under an appropriate spectral controllability assumption, resulting in an implementable stabilizing control law. The proposed methodology appears simpler and more general compared to existing results in the literature. In particular, under additional regularity assumptions, the proposed approach can be expanded to address the degenerate case where only a distributed control term is present.
- [135] arXiv:2405.08635 [pdf, ps, other]
-
Title: Approaches to iterative algorithms for solving nonlinear equations with an application in tomographic absorption spectroscopyComments: Accepted for publication in the journal: Communications in Optimization TheorySubjects: Optimization and Control (math.OC); Numerical Analysis (math.NA); Medical Physics (physics.med-ph)
In this paper we propose an approach for solving systems of nonlinear equations without computing function derivatives. Motivated by the application area of tomographic absorption spectroscopy, which is a highly-nonlinear problem with variables coupling, we consider a situation where straightforward translation to a fixed point problem is not possible because the operators that represent the relevant systems of nonlinear equations are not self-mappings, i.e., they operate between spaces of different dimensions. To overcome this difficulty we suggest an "alternating common fixed points algorithm" that acts alternatingly on the different vector variables. This approach translates the original problem to a common fixed point problem for which iterative algorithms are abound and exhibits a viable alternative to translation to an optimization problem, which usually requires derivatives information. However, to apply any of these iterative algorithms requires to ascertain the conditions that appear in their convergence theorems. To circumvent the need to verify conditions for convergence, we propose and motivate a derivative-free algorithm that better suits the tomographic absorption spectroscopy problem at hand and is even further improved by applying to it the superiorization approach. This is presented along with experimental results that demonstrate our approach.
- [136] arXiv:2405.08639 [pdf, ps, other]
-
Title: Upwards homogeneity in iterated symmetric extensionsComments: 16 pages, 1 figureSubjects: Logic (math.LO)
It is sometimes desirable in choiceless constructions of set theory that one iteratively extends some ground model without adding new sets of ordinals after the first extension. Pushing this further, one may wish to have models $V \subseteq M \subseteq N$ of $\mathsf{ZF}$ such that $N$ contains no subsets of $V$ that do not already appear in $M$. We isolate, in the case that $M$ and $N$ are symmetric extensions (particular inner models of a generic extension of $V$), the exact conditions that cause this behaviour and show how it can broadly be applied to many known constructions. We call this behaviour upwards homogeneity.
- [137] arXiv:2405.08640 [pdf, ps, other]
-
Title: A sparsity test for multivariate Hawkes processesSubjects: Statistics Theory (math.ST)
Multivariate Hawkes processes (MHP) are a class of point processes in which events at different coordinates interact through mutual excitation. The weighted adjacency matrix of the MHP encodes the strength of the relations, and shares its support with the causal graph of interactions of the process. We consider the problem of testing for causal relationships across the dimensions of a marked MHP. The null hypothesis is that a joint group of adjacency coefficients are null, corresponding to the absence of interactions. The alternative is that they are positive, and the associated interactions do exist. To this end, we introduce a novel estimation procedure in the context of a large sample of independent event sequences. We construct the associated likelihood ratio test and derive the asymptotic distribution of the test statistic as a mixture of chi squared laws. We offer two applications on financial datasets to illustrate the performance of our method. In the first one, our test reveals a deviation from a static equilibrium in bidders' strategies on retail online auctions. In the second one, we uncover some factors at play in the dynamics of German intraday power prices.
- [138] arXiv:2405.08641 [pdf, ps, other]
-
Title: Asymptotic directions in the moduli space of curvesSubjects: Algebraic Geometry (math.AG)
In this paper we study asymptotic directions in the tangent bundle of the moduli space ${\mathcal M}_g$ of curves of genus $g$, namely those tangent directions that are annihilated by the second fundamental form of the Torelli map. We give examples of asymptotic directions for any $g \geq 4$. We prove that if the rank $d$ of a tangent direction $\zeta \in H^1(T_C)$ (with respect to the infinitesimal deformation map) is less than the Clifford index of the curve $C$, then $\zeta$ is not asymptotic. If the rank of $\zeta$ is equal to the Clifford index of the curve, we give sufficient conditions ensuring that the infinitesimal deformation $\zeta$ is not asymptotic. Then we determine all asymptotic directions of rank 1 and we give an almost complete description of asymptotic directions of rank 2.
- [139] arXiv:2405.08646 [pdf, ps, other]
-
Title: Partial order on involutive permutations and double Schubert cellsComments: 8 pages, 4 figuresSubjects: Combinatorics (math.CO)
As shown by A. Melnikov, the orbits of a Borel subgroup acting by conjugation on upper-triangular matrices with square zero are indexed by involutions in the symmetric group. The inclusion relation among the orbit closures defines a partial order on involutions. We observe that the same order on involutive permutations also arises while describing the inclusion order on B-orbit closures in the direct product of two Grassmannians. We establish a geometric relation between these two settings.
- [140] arXiv:2405.08650 [pdf, ps, other]
-
Title: An explicit economical additive basisComments: 5 pagesSubjects: Combinatorics (math.CO)
We present an explicit subset $A\subseteq \mathbb{N} = \{0,1,\ldots\}$ such that $A + A = \mathbb{N}$ and for all $\varepsilon > 0$, \[\lim_{N\to \infty}\frac{\big|\big\{(n_1,n_2): n_1 + n_2 = N, (n_1,n_2)\in A^2\big\}\big|}{N^{\varepsilon}} = 0.\] This answers a question of Erdős.
- [141] arXiv:2405.08652 [pdf, ps, other]
-
Title: Non-local parabolic equations with singular (Morrey) time-inhomogeneous driftSubjects: Analysis of PDEs (math.AP); Probability (math.PR)
We obtain Sobolev regularity estimates for solutions of non-local parabolic equations with locally unbounded drift satisfying some minimal assumptions. These results yield Krylov bound for the corresponding Feller stable process as well as some a priori regularity estimates on solutions of McKean-Vlasov equations. A key element of our arguments is a parabolic operator norm inequality that we prove using some ideas of Adams and Krylov.
- [142] arXiv:2405.08653 [pdf, ps, other]
-
Title: The Connectedness Homomorphism between Discrete Morse ComplexesComments: 18 pagesSubjects: Combinatorics (math.CO); Algebraic Topology (math.AT)
Given two discrete Morse functions on a simplicial complex, we introduce the {\em connectedness homomorphism} between the corresponding discrete Morse complexes. This concept leads to a novel framework for studying the connectedness in discrete Morse theory at the chain complex level. In particular, we apply it to describe a discrete analogy to `cusp-degeneration' of Morse complexes. A precise comparison between smooth case and our discrete cases is also given.
- [143] arXiv:2405.08662 [pdf, ps, other]
-
Title: Representation theory of skew bracesComments: 36 pagesSubjects: Representation Theory (math.RT); Group Theory (math.GR); Quantum Algebra (math.QA)
According to Letourmy and Vendramin, a representation of a skew brace is a pair of representations on the same vector space, one for the additive group and the other for the multiplicative group, that satisfies a certain compatibility condition. Following their definition, we shall develop the theory of representations of skew braces. We show that the analogs of Maschke's theorem and Clifford's theorem hold for skew braces. We also study irreducible representations in prime characteristic.
- [144] arXiv:2405.08664 [pdf, ps, other]
-
Title: Near critical asymptotics in the Frozen Erd\H{o}s-R\'enyiComments: 32 pages, 3 figuresSubjects: Probability (math.PR)
We consider a variant of the classical Erdős-Rényi random graph, where components with surplus are slowed down to prevent the apparition of complex components. The sizes of the components of this process undergo a similar phase transition to that of the classical model, and in the critical window the scaling limit of the sizes of the components is a "frozen" version of Aldous' multiplicative coalescent [2]. The aim of this article is to describe the long time asymptotics in the critical window for the total number of vertices which belong to a component with surplus.
- [145] arXiv:2405.08671 [pdf, ps, other]
-
Title: Sequentially Cohen-Macaulay binomial edge idealsSubjects: Commutative Algebra (math.AC); Combinatorics (math.CO)
We prove that cycles, wheels and block graphs have sequentially Cohen-Macaulay binomial edge ideals. Moreover, we provide a construction of new families of sequentially Cohen-Macaulay graphs by cones.
- [146] arXiv:2405.08678 [pdf, ps, other]
-
Title: Wiener's Tauberian theorem in classical and quantum harmonic analysisComments: 36 pages, comments welcomeSubjects: Functional Analysis (math.FA)
We investigate Wiener's Tauberian theorem from the perspective of limit functions, which results in several new versions of the Tauberian theorem. Based on this, we formulate and prove analogous Tauberian theorems for operators in the sense of quantum harmonic analysis. Using these results, we characterize the class of slowly oscillating operators and show that this class is strictly larger than the class of compact operators. Finally, we discuss uniform versions of Wiener's Tauberian theorem and its operator analogue and provide an application of this in operator theory.
- [147] arXiv:2405.08682 [pdf, ps, other]
-
Title: Norms of spherical averaging operators for some geometric group actionsComments: 20 pagesSubjects: Group Theory (math.GR); Functional Analysis (math.FA)
We obtain asymptotic estimates for the $\ell^p$-operator norm of spherical averaging operators associated to certain geometric group actions. The motivating example is the case of Gromov hyperbolic groups, for which we obtain asymptotically sharp estimates. We deduce asymptotic lower bounds for the combinatorial expansion of spheres.
- [148] arXiv:2405.08690 [pdf, ps, other]
-
Title: Double-activation neural network for solving parabolic equations with time delayComments: 21 pages,11 figuresSubjects: Numerical Analysis (math.NA)
This paper presents the double-activation neural network (DANN), a novel network architecture designed for solving parabolic equations with time delay. In DANN, each neuron is equipped with two activation functions to augment the network's nonlinear expressive capacity. Additionally, a new parameter is introduced for the construction of the quadratic terms in one of two activation functions, which further enhances the network's ability to capture complex nonlinear relationships. To address the issue of low fitting accuracy caused by the discontinuity of solution's derivative, a piecewise fitting approach is proposed by dividing the global solving domain into several subdomains. The convergence of the loss function is proven. Numerical results are presented to demonstrate the superior accuracy and faster convergence of DANN compared to the traditional physics-informed neural network (PINN).
- [149] arXiv:2405.08692 [pdf, ps, other]
-
Title: Quaternionic Cartan coverings and applicationsSubjects: Complex Variables (math.CV)
We present the topological foundation for solvability of Multiplicative Cousin problems formulated on an axially symmetric domain $\Omega \subset \mathbb H.$ In particular, we provide a geometric construction of quaternionic Cartan coverings, which are generalizations of (complex) Cartan coverings as presented in Section 4 of [FP]. Because of the requirements of symmetry inherent to the domains of definition of quaternionic regular functions, the existence of quaternionic Cartan coverings of $\Omega$ is not a consequence of existence of complex Cartan coverings, because for the latter there are no requirements for the symmetries with respect to the real axis. Due to the special role of the real axis, also the covering restricted to $\Omega \cap \mathbb R$ has to have additional properties. All these required properties were achieved by starting from a particular symmetric tiling of the symmetric set $\Omega \cap (\mathbb R + i\mathbb R)$. Finally we provide an application of these results to prove the vanishing of 'antisymmetric' cohomology groups of planar symmetric domains for $n \geq 2$.
- [150] arXiv:2405.08698 [pdf, ps, other]
-
Title: Byzantine-Resilient Secure Aggregation for Federated Learning Without Privacy CompromisesSubjects: Information Theory (cs.IT); Cryptography and Security (cs.CR); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG)
Federated learning (FL) shows great promise in large scale machine learning, but brings new risks in terms of privacy and security. We propose ByITFL, a novel scheme for FL that provides resilience against Byzantine users while keeping the users' data private from the federator and private from other users. The scheme builds on the preexisting non-private FLTrust scheme, which tolerates malicious users through trust scores (TS) that attenuate or amplify the users' gradients. The trust scores are based on the ReLU function, which we approximate by a polynomial. The distributed and privacy-preserving computation in ByITFL is designed using a combination of Lagrange coded computing, verifiable secret sharing and re-randomization steps. ByITFL is the first Byzantine resilient scheme for FL with full information-theoretic privacy.
- [151] arXiv:2405.08705 [pdf, ps, other]
-
Title: Pfaff's Method RevisitedComments: 16 pages. Comments are welcome!Subjects: Number Theory (math.NT); Classical Analysis and ODEs (math.CA)
In 1797, Pfaff gave a simple proof of a ${}_3F_2$ hypergeometric series which was much later reproved by Andrews in 1996. In the same paper, Andrews also proved other well-known hypergeometric identities using Pfaff's method. In this paper, we prove a number of terminating $q$-hypergeometric series-product identities using Pfaff's method thereby providing a detailed account of its wide applicability.
- [152] arXiv:2405.08708 [pdf, ps, html, other]
-
Title: Poisson approximation for cycles in the generalised random graphSubjects: Probability (math.PR)
The generalised random graph contains $n$ vertices with positive i.i.d. weights. The probability of adding an edge between two vertices is increasing in their weights. We require the weight distribution to have finite second moments and study the point process $\mathcal{C}_n$ on $\{3,4,\dots\}$, which counts how many cycles of the respective length are present in the graph. We establish convergence of $\mathcal{C}_n$ to a Poisson point process. Under the stronger assumption of the weights having finite fourth moments we provide the following results. When $\mathcal{C}_n$ is evaluated on a bounded set $A$, we provide a rate of convergence. If the graph is additionally subcritical, we extend this to unbounded sets $A$ at the cost of a slower rate of convergence. From this we deduce the limiting distribution of the length of the shortest and the longest cycle when the graph is subcritical, including rates of convergence. All mentioned results also apply to the Chung-Lu model and the Norros-Reittu model.
- [153] arXiv:2405.08709 [pdf, ps, other]
-
Title: Multi-Task Private Semantic CommunicationSubjects: Information Theory (cs.IT)
We study a multi-task private semantic communication problem, in which an encoder has access to an information source arbitrarily correlated with some latent private data. A user has $L$ tasks with priorities. The encoder designs a message to be revealed which is called the semantic of the information source. Due to the privacy constraints the semantic can not be disclosed directly and the encoder adds noise to produce disclosed data. The goal is to design the disclosed data that maximizes the weighted sum of the utilities achieved by the user while satisfying a privacy constraint on the private data. In this work, we first consider a single-task scenario and design the added noise utilizing various methods including the extended versions of the Functional Representation Lemma, Strong Functional Representation Lemma, and separation technique. We then study the multi-task scenario and derive a simple design of the source semantics. We show that in the multi-task scenario the main problem can be divided into multiple parallel single-task problems.
- [154] arXiv:2405.08716 [pdf, ps, other]
-
Title: Commuting Clifford actionsComments: Approx 10 pagesSubjects: Mathematical Physics (math-ph); High Energy Physics - Theory (hep-th); Quantum Algebra (math.QA)
It shown that if a vector space carries commuting actions of two Clifford algebras, then the quadratic monomials using generators from either Clifford algebra determine a spinor representation of an orthogonal Lie algebra.
Examples of this construction have applications to high energy physics, particularly to the standard model and unification. It is shown how to use Clifford data to construct spectral triples for the Pati-Salam model that admit an action of Spin(10). - [155] arXiv:2405.08721 [pdf, ps, other]
-
Title: A regularized eigenmatrix method for unstructured sparse recoveryComments: 11 pages, 5 figuresSubjects: Numerical Analysis (math.NA)
The recently developed data-driven eigenmatrix method shows very promising reconstruction accuracy in sparse recovery for a wide range of kernel functions and random sample locations. However, its current implementation can lead to numerical instability if the threshold tolerance is not appropriately chosen. To incorporate regularization techniques, we propose to regularize the eigenmatrix method by replacing the computation of an ill-conditioned pseudo-inverse by the solution of an ill-conditioned least square system, which can be efficiently treated by Tikhonov regularization. Extensive numerical examples confirmed the improved effectiveness of our proposed method, especially when the noise levels are relatively high.
- [156] arXiv:2405.08723 [pdf, ps, other]
-
Title: Decomposition numbers in the principal block and Sylow normalisersSubjects: Representation Theory (math.RT); Group Theory (math.GR)
If G is a finite group and p is a prime number, we investigate the relationship between the p-modular decomposition numbers of characters of height zero in the principal p-block of G and the p-local structure of G.
- [157] arXiv:2405.08725 [pdf, ps, other]
-
Title: Lower bounds for shifted moments of the Riemann zeta functionSubjects: Number Theory (math.NT)
In previous work, the author gave upper bounds for the shifted moments of the zeta function \[ M_{{\alpha},{\beta}}(T) = \int_T^{2T} \prod_{k = 1}^m |\zeta(\tfrac{1}{2} + i (t + \alpha_k))|^{2 \beta_k} dt \] introduced by Chandee, where ${\alpha} = {\alpha}(T) = (\alpha_1, \ldots, \alpha_m)$ and ${\beta} = (\beta_1 \ldots , \beta_m)$ satisfy $|\alpha_k| \leq T/2$ and $\beta_k\geq 0$. Assuming the Riemann hypothesis, we shall prove the corresponding lower bounds: \[ M_{{\alpha},{\beta}}(T) \gg_{\beta} T (\log T)^{\beta_1^2 + \cdots + \beta_m^2} \prod_{1\leq j < k \leq m} |\zeta(1 + i(\alpha_j - \alpha_k) + 1/ \log T )|^{2\beta_j \beta_k}. \]
- [158] arXiv:2405.08732 [pdf, ps, other]
-
Title: Multi-Server Multi-Function Distributed ComputationComments: Submitted to EntropySubjects: Information Theory (cs.IT)
The work here studies the communication cost for a multi-server multi-task distributed computation framework, and does so for a broad class of functions and data statistics. Considering the framework where a user seeks the computation of multiple complex (conceivably non-linear) tasks from a set of distributed servers, we establish communication cost upper bounds for a variety of data statistics, function classes and data placements across the servers. To do so, we proceed to apply, for the first time here, Körner's characteristic graph approach -- which is known to capture the structural properties of data and functions -- to the promising framework of multi-server multi-task distributed computing. Going beyond the general expressions, and in order to offer clearer insight, we also consider the well-known scenario of cyclic dataset placement and linearly separable functions over the binary field, in which case our approach exhibits considerable gains over the state of art. Similar gains are identified for the case of multi-linear functions.
- [159] arXiv:2405.08734 [pdf, ps, other]
-
Title: On the independence number of regular graphs of matrix ringsJournal-ref: Linear Algebra and its Applications 681 (2024), 89-96Subjects: Combinatorics (math.CO)
Consider a graph on the non-singular matrices over a finite field, in which two distinct non-singular matrices are joined by an edge whenever their sum is singular. We prove an upper bound for the independence number of this graph. As a consequence, we obtain a lower bound for its chromatic number that significantly improves a previous result of Tomon.
- [160] arXiv:2405.08736 [pdf, ps, other]
-
Title: Polytropic Dynamical Systems with Time SingularityComments: 18 pages, 7 figuresSubjects: Dynamical Systems (math.DS)
In this paper we consider a class of second order singular homogeneous differential equations called the Lane-Emden-type with time singularity in the drift coefficient. Lane-Emden equations are singular initial value problems that model phenomena in astrophysics such as stellar structure and are governed by polytropics with applications in isothermal gas spheres. A hybrid method is proposed to approximate the solution of this type of dynamic equations.
- [161] arXiv:2405.08747 [pdf, ps, other]
-
Title: Minimax optimal seriation in polynomial timeSubjects: Statistics Theory (math.ST)
We consider the statistical seriation problem, where the statistician seeks to recover a hidden ordering from a noisy observation of a permuted Robinson matrix. In this paper, we tightly characterize the minimax rate for this problem of matrix reordering when the Robinson matrix is bi-Lipschitz, and we also provide a polynomial time algorithm achieving this rate; thereby answering two open questions of [Giraud et al., 2021]. Our analysis further extends to broader classes of similarity matrices.
- [162] arXiv:2405.08753 [pdf, ps, other]
-
Title: Self-repellent Brownian Bridges in an Interacting Bose GasSubjects: Probability (math.PR)
We consider a model of $d$-dimensional interacting quantum Bose gas, expressed in terms of an ensemble of interacting Brownian bridges in a large box and undergoing the influence of all the interactions between the legs of each of the Brownian bridges. We study the thermodynamic limit of the system and give an explicit formula for the limiting free energy and a necessary and sufficient criterion for the occurrence of a condensation phase transition. For $d\geq 5$ and sufficiently small interaction, we prove that the condensate phase is not empty. The ideas of proof rely on the similarity of the interaction to that of the self-repellent random walk, and build on a lace expansion method conducive to treating {\it paths} undergoing mutual repellence within each bridge.
- [163] arXiv:2405.08757 [pdf, ps, other]
-
Title: Local well-posedness and regularity properties for an initial-boundary value problem associated to the fifth order Korteweg-de Vries equationSubjects: Analysis of PDEs (math.AP)
In this work we prove that the initial-boundary value problem (IBVP) for the fifth order Korteweg-de Vries equation \begin{align*} \left. \begin{array}{rlr} u_t+\partial_x^5 u+u\partial_x u&\hspace{-2mm}=0,&\quad x\in\mathbb R^+,\; t\in\mathbb R^+,\\ u(x,0)&\hspace{-2mm}=g(x),&\\ u(0,t)=h_1(t),\, \partial_x u(0,t)&\hspace{-2mm}=h_2(t),\,\partial_x^2 u(0,t)=h_3(t), \end{array} \right\} \end{align*} is locally well posed, when the data $g$, $h_1$, $h_2$, $h_3$ are taken in such a way that $g\in H^s(\mathbb R_x^+)$, and $h_{j+1}\in H^{\frac{s+2-j}5}(\mathbb R_t^+)$, $j=0,1,2$, $s\in [0,\frac{11}4)\setminus \{\frac12,\frac32,\frac52\}$, and satisfy the following compatibility conditions: \begin{align*} g(0)=h_1(0) \text{ if } \frac12<s<\frac32;\\ g(0)=h_1(0),\; g'(0)=h_2(0) \text{ if } \frac32<s<\frac52;\\ g(0)=h_1(0), \; g'(0)=h_2(0),\; g''(0)=h_3(0) \text{ if } \frac52<s<\frac{11}4. \end{align*} Besides, we prove that the nonlinear part of the solution is smoother than the initial datum $g$.
- [164] arXiv:2405.08763 [pdf, ps, other]
-
Title: Genus, Fiberedness, $\tau$ and $\epsilon$ of Satellite Knots with $n$-Twisted Generalized Mazur patternsComments: 44 pages, 56 figures. Comments Welcome!Subjects: Geometric Topology (math.GT)
We study a family of $(1,1)$-pattern knots that generalize the Mazur pattern, and compute the concordance invariants $\tau$ and $\epsilon$ of $n$-twisted satellites formed from these patterns. We show that none of the $n$-twisted patterns from this family act surjectively on the smooth or rational concordance group. We also determine when the $n$-twisted generalized Mazur patterns are fibered in the solid torus, compute their genus in $S^1 \times D^2$, and show that $n$-twisted satellites with generalized Mazur patterns and non-trivial companions are not Floer thin.
- [165] arXiv:2405.08764 [pdf, ps, other]
-
Title: A Generalized Curvilinear Coordinate system-based Patch Dynamics Scheme in Equation-free Multiscale ModellingComments: 21 pages, 8 figures, 4 tablesSubjects: Numerical Analysis (math.NA)
The patch dynamics scheme in equation-free multiscale modelling can efficiently predict the macroscopic behaviours by simulating the microscale problem in a fraction of the space-time domain. The patch dynamics schemes developed so far, are mainly on rectangular domains with uniform grids and uniform rectangular patches. In real-life problems where the geometry of the domain is not regular or simple, rectangular and uniform grids or patches may not be useful. To address this kind of complexity, the concept of a generalized curvilinear coordinate system is used. An explicit representation of a patch dynamics scheme on a generalized curvilinear coordinate system in a two-dimensional domain is proposed for evolution equations. It has been applied to unsteady convection-diffusion-reaction (CDR) problems. The robustness of the scheme on the generalized curvilinear coordinate system is assessed through numerical test cases. Firstly, a convection-dominated CDR equation is considered, featuring high gradient regions in some part of the domain, for which stretched grids with non-uniform patch sizes are employed. Secondly, a non-axisymmetric diffusion equation is examined in an annulus region, where the patches have non-rectangular shapes. The results obtained demonstrate excellent agreement with the analytical solution or existing numerical solutions.
- [166] arXiv:2405.08770 [pdf, ps, other]
-
Title: An optimization-based construction procedure for function space based summation-by-parts operators on arbitrary gridsComments: 18 pages, 8 FiguresSubjects: Numerical Analysis (math.NA); Optimization and Control (math.OC)
We introduce a novel construction procedure for one-dimensional summation-by-parts (SBP) operators. Existing construction procedures for FSBP operators of the form $D = P^{-1} Q$ proceed as follows: Given a boundary operator $B$, the norm matrix $P$ is first determined and then in a second step the complementary matrix $Q$ is calculated to finally get the FSBP operator $D$. In contrast, the approach proposed here determines the norm and complementary matrices, $P$ and $Q$, simultaneously by solving an optimization problem. The proposed construction procedure applies to classical SBP operators based on polynomial approximation and the broader class of function space SBP (FSBP) operators. According to our experiments, the presented approach yields a numerically stable construction procedure and FSBP operators with higher accuracy for diagonal norm difference operators at the boundaries than the traditional approach. Through numerical simulations, we highlight the advantages of our proposed technique.
- [167] arXiv:2405.08771 [pdf, ps, other]
-
Title: Multi-objective SINDy for parameterized model discovery from single transient trajectory dataSubjects: Dynamical Systems (math.DS)
The sparse identification of nonlinear dynamics (SINDy) has been established as an effective technique to produce interpretable models of dynamical systems from time-resolved state data via sparse regression. However, to model parameterized systems, SINDy requires data from transient trajectories for various parameter values over the range of interest, which are typically difficult to acquire experimentally. In this work, we extend SINDy to be able to leverage data on fixed points and/or limit cycles to reduce the number of transient trajectories needed for successful system identification. To achieve this, we incorporate the data on these attractors at various parameter values as constraints in the optimization problem. First, we show that enforcing these as hard constraints leads to an ill-conditioned regression problem due to the large number of constraints. Instead, we implement soft constraints by modifying the cost function to be minimized. This leads to the formulation of a multi-objective sparse regression problem where we simultaneously seek to minimize the error of the fit to the transients trajectories and to the data on attractors, while penalizing the number of terms in the model. Our extension, demonstrated on several numerical examples, is more robust to noisy measurements and requires substantially less training data than the original SINDy method to correctly identify a parameterized dynamical system.
- [168] arXiv:2405.08778 [pdf, ps, other]
-
Title: Quantum Integrable Systems arising from Separation of Variables on S3Subjects: Mathematical Physics (math-ph); Classical Analysis and ODEs (math.CA); Dynamical Systems (math.DS)
We study the family of quantum integrable systems that arise from separating the Schrödinger equation in all 6 separable orthogonal coordinates on the 3 sphere: ellipsoidal, prolate, oblate, Lamé, spherical and cylindrical. On the one hand each separating coordinate system gives rise to a quantum integrable system on S2 x S2, on the other hand it also leads to families of harmonic polynomials in R4. We show that separation in ellipsoidal coordinates yields a generalised Lamé equation - a Fuchsian ODE with 5 regular singular points. We seek polynomial solutions so that the eigenfunctions are analytic at all finite singularities. We classify eigenfunctions by their discrete symmetry and compute the joint spectrum for each symmetry class. The latter 5 separable coordinate systems are all degenerations of the ellipsoidal coordinates. We perform similar analyses on these systems and show how the ODEs degenerate in a fashion akin to their respective coordinates. For the prolate system we show that there exists a defect in the joint spectrum which prohibits a global assignment of quantum numbers: the system has quantum monodromy. This is a companion paper to our previous work where the respective classical systems were studied.
- [169] arXiv:2405.08781 [pdf, ps, html, other]
-
Title: To graph total coloring via efficient dominating setsComments: 6 pages, 2 figuresSubjects: Combinatorics (math.CO)
A totally efficient coloring of a regular graph with color set formed by one unit more than its degree is a total coloring in which each vertex color class is an efficient dominating set, that is a perfect code. We find that the 3-cube graph has a totally efficient coloring and conjecture that this is the only existing case of a totally effective color We also deal with a related problem exemplified by star transposition graphs and related graphs, leading to edge-girth colorings on their prism graphs.
- [170] arXiv:2405.08791 [pdf, ps, html, other]
-
Title: On the basin of attraction of a critical three-cycle of a model for the secant mapSubjects: Dynamical Systems (math.DS)
We consider the secant method $S_p$ applied to a real polynomial $p$ of degree $d+1$ as a discrete dynamical system on $\mathbb R^2$. If the polynomial $p$ has a local extremum at a point $\alpha$ then the discrete dynamical system generated by the iterates of the secant map exhibits a critical periodic orbit of period 3 or three-cycle at the point $(\alpha,\alpha)$. We propose a simple model map $T_{a,d}$ having a unique fixed point at the origin which encodes the dynamical behaviour of $S_p^3$ at the critical three-cycle. The main goal of the paper is to describe the geometry and topology of the basin of attraction of the origin of $T_{a,d}$ as well as its boundary. Our results concern global, rather than local, dynamical behaviour. They include that the boundary of the basin of attraction is the stable manifold of a fixed point or contains the stable manifold of a two-cycle, depending on the values of the parameters of $d$ (even or odd) and $a\in \mathbb R$ (positive or negative).
- [171] arXiv:2405.08795 [pdf, ps, other]
-
Title: The fundamental martingale with applications to Markov Random FieldsSubjects: Probability (math.PR)
We consider collections of SDEs indexed by a graph. Each SDE is driven by an additive Gaussian noise and each drift term interacts with all other SDEs within the graph neighbourhood. We derive the fundamental martingale for a class of Gaussian processes and use this to prove a Girsanov type theorem. Further, we use this to construct a clique factorisation to prove that the law of the interacting SDEs forms a 2-Markov Random Field.
- [172] arXiv:2405.08797 [pdf, ps, other]
-
Title: Two questions on Kneser coloringsSubjects: Combinatorics (math.CO); Discrete Mathematics (cs.DM)
In this paper, we investigate two questions on Kneser graphs $KG_{n,k}$. First, we prove that the union of $s$ non-trivial intersecting families in ${[n]\choose k}$ has size at most ${n\choose k}-{n-s\choose k}$ for all sufficiently large $n$ that satisfy $n>(2+\epsilon)k^2$ with $\epsilon>0$. We provide an example that shows that this result is essentially tight for the number of colors close to $\chi(KG_{n,k})=n-2k+2$. We also improve the result of Bulankina and Kupavskii on the choice chromatic number, showing that it is at least $\frac 1{16} n\log n$ for all $k<\sqrt n$ and $n$ sufficiently large.
- [173] arXiv:2405.08799 [pdf, ps, html, other]
-
Title: Uniqueness and $(\infty,2)$-Naturality of YonedaComments: 9 pages, comments are welcome!Subjects: Category Theory (math.CT); Algebraic Topology (math.AT)
We show that the Yoneda embedding extends to an $(\infty,2)$-natural transformation. Furthermore, as such, it is uniquely determined by its value at the trivial $\infty$-category. We also study the naturality of the Yoneda lemma in its arguments, showing that it is an isomorphism of $(\infty,2)$-natural transformations.
- [174] arXiv:2405.08803 [pdf, ps, html, other]
-
Title: A Mimicking Theorem for processes driven by fractional Brownian motionSubjects: Probability (math.PR)
In this paper, we prove a mimicking theorem for stochastic processes with an additive Gaussian noise along with some entropy and transport type estimates. As an application of these results, we prove sharp quantitative propagation of chaos result and derive a formula for the marginal dynamics of collections of locally interacting stochastic differential equations with additive Gaussian noise.
- [175] arXiv:2405.08805 [pdf, ps, other]
-
Title: Special potentials for relativistic Laplacians I: Fractional Rollnik-classSubjects: Functional Analysis (math.FA); Mathematical Physics (math-ph)
We propose a counterpart of the classical Rollnik-class of potentials for fractional and massive relativistic Laplacians, and describe this space in terms of appropriate Riesz potentials. These definitions rely on precise resolvent estimates. We show that Coulomb-type potentials are elements of fractional Rollnik-class up to but not including the critical singularity of the Hardy potential. For the operators with fractional exponent $\alpha = 1$ there exists no fractional Rollnik potential, however, in low dimensions we make sense of these classes as limiting cases by using $\Gamma$-convergence. In a second part of the paper we derive detailed results on the self-adjointness and spectral properties of relativistic Schrödinger operators obtained under perturbations by fractional Rollnik potentials. We also define an extended fractional Rollnik-class which is the maximal space for the Hilbert-Schmidt property of the related Birman-Schwinger operators.
- [176] arXiv:2405.08806 [pdf, ps, html, other]
-
Title: Bounds on the Distribution of a Sum of Two Random Variables: Revisiting a problem of Kolmogorov with application to Individual Treatment EffectsSubjects: Statistics Theory (math.ST); Econometrics (econ.EM); Probability (math.PR)
We revisit the following problem, proposed by Kolmogorov: given prescribed marginal distributions $F$ and $G$ for random variables $X,Y$ respectively, characterize the set of compatible distribution functions for the sum $Z=X+Y$. Bounds on the distribution function for $Z$ were given by Markarov (1982), and Frank et al. (1987), the latter using copula theory. However, though they obtain the same bounds, they make different assertions concerning their sharpness. In addition, their solutions leave some open problems in the case when the given marginal distribution functions are discontinuous. These issues have led to some confusion and erroneous statements in subsequent literature, which we correct.
Kolmogorov's problem is closely related to inferring possible distributions for individual treatment effects $Y_1 - Y_0$ given the marginal distributions of $Y_1$ and $Y_0$; the latter being identified from a randomized experiment. We use our new insights to sharpen and correct results due to Fan and Park (2010) concerning individual treatment effects, and to fill some other logical gaps. - [177] arXiv:2405.08811 [pdf, ps, html, other]
-
Title: Slow-growing counterexamples to the strong Eremenko ConjectureComments: 26 pages, 2 figuresSubjects: Dynamical Systems (math.DS); Complex Variables (math.CV)
Let $f\colon\mathbb{C} \to\mathbb{C}$ be a transcendental entire function. In 1989, Eremenko asked the following question concerning the set $I(f)$ of points that tend to infinity under iteration: can every point of $I(f)$ be joined to $\infty$ by a curve in $I(f)$? This is known as the strong Eremenko conjecture and was disproved in 2011 by Rottenfußer, Rückert, Rempe and Schleicher. The function has relatively small infinite order: it can be chosen such that $\log \log \,\lvert f(z)\rvert = (\log \lvert z\rvert)^{1+o(1)}$ as $f(z)\to \infty$. Moreover, $f$ belongs to the \emph{Eremenko--Lyubich class $\mathcal{B}$}. Rottenfußer et al also show that the strong Eremenko conjecture does hold for any $f\in\mathcal{B}$ of finite order. We consider how slow a counterexample $f\in\mathcal{B}$ can grow. Suppose that $\Theta\colon [t_0,\infty)\to [0,\infty)$ satisfies $\Theta(t) \to 0$ and \[ (\log t)^{-\beta \Theta(\log t)}/\Theta(t) \to 0 \quad\text{ as $t\to \infty$} \] for some $0<\beta<1$, along with a certain regularity assumption. Then there exists a counterexample $f\in\mathcal{B}$ as above such that \[ \log \log \vert f(z)\vert = O ( (\log \vert z \vert)^{1 + \Theta( \log \vert z \vert )}) \] as $\vert f(z)\vert \to\infty$. The hypotheses are satisfied, in particular, for $\Theta(t) = 1/(\log \log t)^{\alpha}$, for any $\alpha>0$.
- [178] arXiv:2405.08812 [pdf, ps, other]
-
Title: Chaotic dynamics at the boundary of a basin of attraction via non-transversal intersections for a non-global smooth diffeomorphismSubjects: Dynamical Systems (math.DS)
In this paper we give analytic proofs of the existence of transversal homoclinic points for a family of non-globally smooth diffeomorphisms having the origin as a fixed point which come out as a truncated map governing the local dynamics near a critical period three cycle associated to the Secant map. Using Moser's version of Birkhoff-Smale's Theorem, we prove that the boundary of the basin of attraction of the origin contains a Cantor-like invariant subset such that the restricted dynamics to it is conjugate to the full shift of $N$-symbols for any integer $N\ge 2$ or infinity.
New submissions for Wednesday, 15 May 2024 (showing 178 of 178 entries )
- [179] arXiv:2405.08058 (cross-list from hep-th) [pdf, ps, html, other]
-
Title: The Fusion Categorical DiagonalComments: 25 pages, 4 appendicesSubjects: High Energy Physics - Theory (hep-th); Strongly Correlated Electrons (cond-mat.str-el); Quantum Algebra (math.QA)
We define a Frobenius algebra over fusion categories of the form Rep$(G)\boxtimes$Rep$(G)$ which generalizes the diagonal subgroup of $G\times G$. This allows us to extend field theoretical constructions which depend on the existence of a diagonal subgroup to non-invertible symmetries. We give explicit calculations for theories with Rep$(S_3)\boxtimes$Rep$(S_3)$ symmetry, applying the results to gauging topological quantum field theories which carry this non-invertible symmetry. Along the way, we also discuss how Morita equivalence is implemented for algebras in symmetry categories.
- [180] arXiv:2405.08067 (cross-list from hep-th) [pdf, ps, other]
-
Title: Local Zeta Functions of Multiparameter Calabi-Yau Threefolds from the Picard-Fuchs EquationsComments: The associated Mathematica package can be downloaded from this https URLSubjects: High Energy Physics - Theory (hep-th); Number Theory (math.NT)
The deformation approach of arXiv:2104.07816 for computing zeta functions of one-parameter Calabi-Yau threefolds is generalised to cover also multiparameter manifolds. Consideration of the multiparameter case requires the development of an improved formalism. This allows us, among other things, to make progress on some issues left open in previous work, such as the treatment of apparent and conifold singularities and changes of coordinates. We also discuss the efficient numerical computation of the zeta functions. As examples, we compute the zeta functions of the two-parameter mirror octic, a non-symmetric split of the quintic threefold also with two parameters, and the $S_5$ symmetric five-parameter Hulek-Verrill manifolds. These examples allow us to exhibit the several new types of geometries for which our methods make practical computations possible. They also act as consistency checks, as our results reproduce and extend those of arXiv:hep-th/0409202 and arXiv:math/0304169. To make the methods developed here more approachable, a Mathematica package "CY3Zeta" for computing the zeta functions of Calabi-Yau threefolds, which is attached to this paper, is presented.
- [181] arXiv:2405.08083 (cross-list from hep-th) [pdf, ps, html, other]
-
Title: 5d 2-Chern-Simons theory and 3d integrable field theoriesComments: 28 pagesSubjects: High Energy Physics - Theory (hep-th); Mathematical Physics (math-ph)
The $4$-dimensional semi-holomorphic Chern-Simons theory of Costello and Yamazaki provides a gauge-theoretic origin for the Lax connection of $2$-dimensional integrable field theories. The purpose of this paper is to extend this framework to the setting of $3$-dimensional integrable field theories by considering a $5$-dimensional semi-holomorphic higher Chern-Simons theory for a higher connection $(A,B)$ on $\mathbb{R}^3 \times \mathbb{C}P^1$. The input data for this theory are the choice of a meromorphic $1$-form $\omega$ on $\mathbb{C}P^1$ and a strict Lie $2$-group with cyclic structure on its underlying Lie $2$-algebra. Integrable field theories on $\mathbb{R}^3$ are constructed by imposing suitable boundary conditions on the connection $(A,B)$ at the $3$-dimensional defects located at the poles of $\omega$ and choosing certain admissible meromorphic solutions of the bulk equations of motion. The latter provides a natural notion of higher Lax connection for $3$-dimensional integrable field theories, including a $2$-form component $B$ which can be integrated over Cauchy surfaces to produce conserved charges. As a first application of this approach, we show how to construct a generalization of Ward's $(2+1)$-dimensional integrable chiral model from a suitable choice of data in the $5$-dimensional theory.
- [182] arXiv:2405.08095 (cross-list from quant-ph) [pdf, ps, html, other]
-
Title: Defining subsystems in Hilbert spaces with non-Euclidean metricComments: 9 pages, 1 figure, comments are welcome!Subjects: Quantum Physics (quant-ph); Mathematical Physics (math-ph)
It is well established that pseudo-Hermitian quantum mechanics is a trivial extension of regular quantum mechanics. The modified inner-product space defined through the so-called metric operator, turns out to be the most natural way to represent certain phenomena such as those involving balanced gain and loss. However, for composite systems undergoing pseudo-Hermitian evolution, defining the subsystems is generally considered feasible only when the metric operator is chosen to have a tensor product. In this work, we use arguments from algebraic quantum mechanics to show that the subsystems can be well-defined in every metric space -- irrespective of whether or not the metric is of tensor product form. This is done by identifying subsystems with a decomposition of the underlying algebra into sub-algebras. In fact, we show that different decompositions of the underlying $C^*-$algebra correspond to choosing different equivalence classes of metric operators. We show how each of the subsystems, defined this way, can be tomographically constructed and that these subsystems satisfy the no-signaling principle. Therefore, we put all the choices of the metric on an equal footing.
- [183] arXiv:2405.08097 (cross-list from cs.LG) [pdf, ps, html, other]
-
Title: Learning functions on symmetric matrices and point clouds via lightweight invariant featuresComments: 28 pages, 2 figures, 2 tablesSubjects: Machine Learning (cs.LG); Commutative Algebra (math.AC)
In this work, we present a mathematical formulation for machine learning of (1) functions on symmetric matrices that are invariant with respect to the action of permutations by conjugation, and (2) functions on point clouds that are invariant with respect to rotations, reflections, and permutations of the points. To achieve this, we construct $O(n^2)$ invariant features derived from generators for the field of rational functions on $n\times n$ symmetric matrices that are invariant under joint permutations of rows and columns. We show that these invariant features can separate all distinct orbits of symmetric matrices except for a measure zero set; such features can be used to universally approximate invariant functions on almost all weighted graphs. For point clouds in a fixed dimension, we prove that the number of invariant features can be reduced, generically without losing expressivity, to $O(n)$, where $n$ is the number of points. We combine these invariant features with DeepSets to learn functions on symmetric matrices and point clouds with varying sizes. We empirically demonstrate the feasibility of our approach on molecule property regression and point cloud distance prediction.
- [184] arXiv:2405.08135 (cross-list from cs.DC) [pdf, ps, html, other]
-
Title: An Optimal Multilevel Quorum System for Probabilistic ConsensusSubjects: Distributed, Parallel, and Cluster Computing (cs.DC); Discrete Mathematics (cs.DM); Probability (math.PR)
We present the notion of a multilevel, slashable quorum system, where an application can obtain gradual levels of assurance that a certain value is bound to be decided (or "finalized") in a global consensus procedure, unless a large number of Byzantine processes are exposed to slashing (that is, penalty on staked assets). Our construction is a highly parameterized generalization of quorum systems based on finite projective spaces, with asymptotic high availability and optimal slashing properties. In particular, we show that any quorum system whose ground elements are disjoint subsets of nodes (e.g. "commmittees" in committee-based consensus protocols) has asymptotic high availability under very reasonable conditions, a general proof with significance of its own. Under similarly relaxed conditions, we show that our construction has asymptotically optimal slashing properties with respect to message complexity and process load; this illustrates a fundamental trade off between message complexity, load, and slashing. Our multilevel construction allows nodes to decide how many "levels" of finalization assurance they wish to obtain, noting that this functionality, if applied to a proof-of-stake blockchain, can be seen either as (i) a form of an early, slashing-based, probabilistic block finalization; or (ii) a service for reorg tolerance.
- [185] arXiv:2405.08152 (cross-list from quant-ph) [pdf, ps, html, other]
-
Title: From Entanglement to Universality: A Multiparticle Spacetime Algebra Approach to Quantum Computational Gates RevisitedComments: 25 pages, 2 figures, 3 tablesSubjects: Quantum Physics (quant-ph); Mathematical Physics (math-ph)
Alternative mathematical explorations in quantum computing can be of great scientific interest, especially if they come with penetrating physical insights. In this paper, we present a critical revisitation of our geometric (Clifford) algebras (GAs) application in quantum computing as originally presented in [C. Cafaro and S. Mancini, Adv. Appl. Clifford Algebras 21, 493 (2011)]. Our focus is on testing the usefulness of geometric algebras (GAs) techniques in two applications to quantum computing. First, making use of the geometric algebra of a relativistic configuration space (a.k.a., multiparticle spacetime algebra or MSTA), we offer an explicit algebraic characterization of one- and two-qubit quantum states together with a MSTA description of one- and two-qubit quantum computational gates. In this first application, we devote special attention to the concept of entanglement, focusing on entangled quantum states and two-qubit entangling quantum gates. Second, exploiting the previously mentioned MSTA characterization together with the GA depiction of the Lie algebras SO(3;R) and SU(2;C) depending on the rotor group formalism, we focus our attention to the concept of universality in quantum computing by reevaluating Boykin's proof on the identification of a suitable set of universal quantum gates. At the end of our mathematical exploration, we arrive at two main conclusions. Firstly, the MSTA perspective leads to a powerful conceptual unification between quantum states and quantum operators. More specifically, the complex qubit space and the complex space of unitary operators acting on them merge in a single multivectorial real space. Secondly, the GA viewpoint on rotations based on the rotor group carries both conceptual and computational upper hands compared to conventional vectorial and matricial methods.
- [186] arXiv:2405.08170 (cross-list from gr-qc) [pdf, ps, html, other]
-
Title: Twistor theory of the Chen--Teo gravitational instantonComments: Dedicated to Nick Woodhouse on the occasion of his 75th birthdaySubjects: General Relativity and Quantum Cosmology (gr-qc); Differential Geometry (math.DG)
Toric Ricci--flat metrics in dimension four correspond to certain holomorphic vector bundles over a twistor space. We construct these bundles explicitly, by exhibiting and characterising their patching matrices, for the five--parameter family of Riemannian ALF metrics constructed by Chen and Teo. The Chen--Teo family contains a two--parameter family of asymptotically flat gravitational instantons. The patching matrices for these instantons take a simple rational form.
- [187] arXiv:2405.08178 (cross-list from gr-qc) [pdf, ps, html, other]
-
Title: A Theoretical Framework for Self-Gravitating k-Form Boson Stars with Internal SymmetriesComments: 58 pages including appendix, both authors are first authorsSubjects: General Relativity and Quantum Cosmology (gr-qc); Mathematical Physics (math-ph)
Current boson star models are largely restricted to global symmetries and lower spin fields. In this work, we generalize these systems of self-gravitating bosonic fields to allow for arbitrary totally antisymmetric tensor fields and arbitrary internal gauge symmetries. We construct a generalized formalism for Yang-Mills-like theories, which allows for arbitrary k-form fields, instead of just vector fields. The k-form fields have gauge symmetries described by semisimple, compact Lie groups. We further derive equations of motion for the k-form fields and connection coefficients of the Lie group. Extensions and applications are also discussed. We present a novel way to fix the group connection using a spacetime connection. As an example, we derive explicitly the connection coefficients for SU(2) in a spherically symmetric spacetime using rectangular vielbeins. The combination of methods presented leads to a powerful, adaptable and practical framework. As a proof of concept, we derive ordinary differential equations for a 0-form field with a SU(2) symmetry. Our framework can be used to model self-gravitating (multi) particle states with internal symmetries, such as pion condensates or dark matter. It is also suited as a tool to approach open problems in modified gravity and string theory.
- [188] arXiv:2405.08223 (cross-list from cs.CL) [pdf, ps, html, other]
-
Title: An information-theoretic model of shallow and deep language comprehensionComments: 6 pages; accepted to COGSCI 2024Subjects: Computation and Language (cs.CL); Information Theory (cs.IT)
A large body of work in psycholinguistics has focused on the idea that online language comprehension can be shallow or `good enough': given constraints on time or available computation, comprehenders may form interpretations of their input that are plausible but inaccurate. However, this idea has not yet been linked with formal theories of computation under resource constraints. Here we use information theory to formulate a model of language comprehension as an optimal trade-off between accuracy and processing depth, formalized as bits of information extracted from the input, which increases with processing time. The model provides a measure of processing effort as the change in processing depth, which we link to EEG signals and reading times. We validate our theory against a large-scale dataset of garden path sentence reading times, and EEG experiments featuring N400, P600 and biphasic ERP effects. By quantifying the timecourse of language processing as it proceeds from shallow to deep, our model provides a unified framework to explain behavioral and neural signatures of language comprehension.
- [189] arXiv:2405.08253 (cross-list from stat.ML) [pdf, ps, other]
-
Title: Thompson Sampling for Infinite-Horizon Discounted Decision ProcessesSubjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
We model a Markov decision process, parametrized by an unknown parameter, and study the asymptotic behavior of a sampling-based algorithm, called Thompson sampling. The standard definition of regret is not always suitable to evaluate a policy, especially when the underlying chain structure is general. We show that the standard (expected) regret can grow (super-)linearly and fails to capture the notion of learning in realistic settings with non-trivial state evolution. By decomposing the standard (expected) regret, we develop a new metric, called the expected residual regret, which forgets the immutable consequences of past actions. Instead, it measures regret against the optimal reward moving forward from the current period. We show that the expected residual regret of the Thompson sampling algorithm is upper bounded by a term which converges exponentially fast to 0. We present conditions under which the posterior sampling error of Thompson sampling converges to 0 almost surely. We then introduce the probabilistic version of the expected residual regret and present conditions under which it converges to 0 almost surely. Thus, we provide a viable concept of learning for sampling algorithms which will serve useful in broader settings than had been considered previously.
- [190] arXiv:2405.08285 (cross-list from cs.NE) [pdf, ps, other]
-
Title: Future Trends in the Design of Memetic Algorithms: the Case of the Linear Ordering ProblemSubjects: Neural and Evolutionary Computing (cs.NE); Optimization and Control (math.OC)
The way heuristic optimizers are designed has evolved over the decades, as computing power has increased. Initially, trajectory metaheuristics used to shape the state of the art in many problems, whereas today, population-based mechanisms tend to be more effective.Such has been the case for the Linear Ordering Problem (LOP), a field in which strategies such as Iterated Local Search and Variable Neighborhood Search led the way during the 1990s, but which have now been surpassed by evolutionary and memetic schemes. This paper focuses on understanding how the design of LOP optimizers will change in the future, as computing power continues to increase, yielding two main contributions. On the one hand, a metaheuristic was designed that is capable of effectively exploiting a large amount of computational resources, specifically, computing power equivalent to what a recent core can output during runs lasting over four months. Our analysis of this aspect relied on parallelization, and allowed us to conclude that as the power of the computational resources increases, it will be necessary to boost the capacities of the intensification methods applied in the memetic algorithms to keep the population from stagnating. And on the other, the best-known results for today's most challenging set of instances (xLOLIB2) were significantly outperformed. Instances with sizes ranging from 300 to 1000 were analyzed, and new bounds were established that provide a frame of reference for future research.
- [191] arXiv:2405.08307 (cross-list from stat.ME) [pdf, ps, other]
-
Title: Sequential Maximal Updated Density Parameter Estimation for Dynamical Systems with Parameter DriftComments: 29 pages, 9 Figures, Code available at this https URLSubjects: Methodology (stat.ME); Numerical Analysis (math.NA); Other Statistics (stat.OT)
We present a novel method for generating sequential parameter estimates and quantifying epistemic uncertainty in dynamical systems within a data-consistent (DC) framework. The DC framework differs from traditional Bayesian approaches due to the incorporation of the push-forward of an initial density, which performs selective regularization in parameter directions not informed by the data in the resulting updated density. This extends a previous study that included the linear Gaussian theory within the DC framework and introduced the maximal updated density (MUD) estimate as an alternative to both least squares and maximum a posterior (MAP) estimates. In this work, we introduce algorithms for operational settings of MUD estimation in real or near-real time where spatio-temporal datasets arrive in packets to provide updated estimates of parameters and identify potential parameter drift. Computational diagnostics within the DC framework prove critical for evaluating (1) the quality of the DC update and MUD estimate and (2) the detection of parameter value drift. The algorithms are applied to estimate (1) wind drag parameters in a high-fidelity storm surge model, (2) thermal diffusivity field for a heat conductivity problem, and (3) changing infection and incubation rates of an epidemiological model.
- [192] arXiv:2405.08343 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Accuracy Evaluation of a Lightweight Analytic Vehicle Dynamics Model for Maneuver PlanningComments: 9 pages, 13 figuresJournal-ref: 2020 5th International Conference on Robotics and Automation Engineering (ICRAE)Subjects: Robotics (cs.RO); Systems and Control (eess.SY); Numerical Analysis (math.NA)
Models for vehicle dynamics play an important role in maneuver planning for automated driving. They are used to derive trajectories from given control inputs, or to evaluate a given trajectory in terms of constraint violation or optimality criteria such as safety, comfort or ecology. Depending on the computation process, models with different assumptions and levels of detail are used; since maneuver planning usually has strong requirements for computation speed at a potentially high number of trajectory evaluations per planning cycle, most of the applied models aim to reduce complexity by implicitly or explicitly introducing simplifying assumptions. While evaluations show that these assumptions may be sufficiently valid under typical conditions, their effect has yet to be studied conclusively.
We propose a model for vehicle dynamics that is convenient for maneuver planning by supporting both an analytic approach of extracting parameters from a given trajectory, and a generative approach of establishing a trajectory from given control inputs. Both applications of the model are evaluated in real-world test drives under dynamic conditions, both on a closed-off test track and on public roads, and effects arising from the simplifying assumptions are analyzed. - [193] arXiv:2405.08401 (cross-list from cs.RO) [pdf, ps, other]
-
Title: Realtime Global Optimization of a Fail-Safe Emergency Stop Maneuver for Arbitrary Electrical / Electronical Failures in Automated DrivingComments: 8 pages, 7 figuresJournal-ref: 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC)Subjects: Robotics (cs.RO); Systems and Control (eess.SY); Numerical Analysis (math.NA)
In the event of a critical system failures in auto-mated vehicles, fail-operational or fail-safe measures provide minimum guarantees for the vehicle's performance, depending on which of its subsystems remain operational. Various such methods have been proposed which, upon failure, use different remaining sets of operational subsystems to execute maneuvers that bring the vehicle into a safe state under different environmental conditions. One particular such method proposes a fail-safe emergency stop system that requires no particular electric or electronic subsystem to be available after failure, and still provides a basic situation-dependent emergency stop maneuver. This is achieved by preemptively setting parameters to a hydraulic / mechanical system prior to failure, which after failure executes the preset maneuver "blindly". The focus of this paper is the particular challenge of implementing a lightweight planning algorithm that can cope with the complex uncertainties of the given task while still providing a globally optimal solution at regular intervals, based on the perceived and predicted environment of the automated vehicle.
- [194] arXiv:2405.08421 (cross-list from cond-mat.dis-nn) [pdf, ps, other]
-
Title: Faster algorithms for the alignment of sparse correlated Erd\"os-R\'enyi random graphsComments: 31 pagesSubjects: Disordered Systems and Neural Networks (cond-mat.dis-nn); Data Structures and Algorithms (cs.DS); Probability (math.PR); Statistics Theory (math.ST)
The correlated Erdös-Rényi random graph ensemble is a probability law on pairs of graphs with $n$ vertices, parametrized by their average degree $\lambda$ and their correlation coefficient $s$. It can be used as a benchmark for the graph alignment problem, in which the labels of the vertices of one of the graphs are reshuffled by an unknown permutation; the goal is to infer this permutation and thus properly match the pairs of vertices in both graphs. A series of recent works has unveiled the role of Otter's constant $\alpha$ (that controls the exponential rate of growth of the number of unlabeled rooted trees as a function of their sizes) in this problem: for $s>\sqrt{\alpha}$ and $\lambda$ large enough it is possible to recover in a time polynomial in $n$ a positive fraction of the hidden permutation. The exponent of this polynomial growth is however quite large and depends on the other parameters, which limits the range of applications of the algorithm. In this work we present a family of faster algorithms for this task, show through numerical simulations that their accuracy is only slightly reduced with respect to the original one, and conjecture that they undergo, in the large $\lambda$ limit, phase transitions at modified Otter's thresholds $\sqrt{\widehat{\alpha}}>\sqrt{\alpha}$, with $\widehat{\alpha}$ related to the enumeration of a restricted family of trees.
- [195] arXiv:2405.08424 (cross-list from cs.LG) [pdf, ps, html, other]
-
Title: Tackling Prevalent Conditions in Unsupervised Combinatorial Optimization: Cardinality, Minimum, Covering, and MoreComments: ICML 2024Subjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
Combinatorial optimization (CO) is naturally discrete, making machine learning based on differentiable optimization inapplicable. Karalias & Loukas (2020) adapted the probabilistic method to incorporate CO into differentiable optimization. Their work ignited the research on unsupervised learning for CO, composed of two main components: probabilistic objectives and derandomization. However, each component confronts unique challenges. First, deriving objectives under various conditions (e.g., cardinality constraints and minimum) is nontrivial. Second, the derandomization process is underexplored, and the existing derandomization methods are either random sampling or naive rounding. In this work, we aim to tackle prevalent (i.e., commonly involved) conditions in unsupervised CO. First, we concretize the targets for objective construction and derandomization with theoretical justification. Then, for various conditions commonly involved in different CO problems, we derive nontrivial objectives and derandomization to meet the targets. Finally, we apply the derivations to various CO problems. Via extensive experiments on synthetic and real-world graphs, we validate the correctness of our derivations and show our empirical superiority w.r.t. both optimization quality and speed.
- [196] arXiv:2405.08523 (cross-list from q-bio.PE) [pdf, ps, html, other]
-
Title: How forest insect outbreaks depend on forest size and tree distribution: an individual-based model resultsSubjects: Populations and Evolution (q-bio.PE); Probability (math.PR)
In this work, an individual-based model of forest insect outbreaks is presented. The results obtained show that the outbreak is an emerging feature of the system. It is a common product of the characteristics of insects, the environment in which the insects live, and the way insects behave in it. The outbreak dynamics is an effect of scale. In a sufficiently large forest regardless of the density of trees and their spatial distribution, provided that the range of insect dispersion is large enough, it develops in the form of an outbreak. In very small forests, the dynamics becomes more chaotic. It loses the outbreak character and, especially in the forest with random tree distribution, there is a possibility that the insect population goes extinct. The local dynamics of the number of insects on one tree in a forest, where the dynamics of all insects has the character of outbreak, is characterized by a rapid increase in number and then a rapid decrease until the extinction of the local population. It is the result of the influx of immigrants from neighboring trees. The type of tree distribution in the forest becomes visible when the density of trees becomes low and/or the range of insect dispersion is small. When trees are uniformly distributed and the range of insect dispersion is small, the system persists as a set of more or less isolated local populations. In the forest with randomly distributed trees, the insect population becomes more susceptible to extinction when the tree density and/or range of insect dispersion are small.
- [197] arXiv:2405.08568 (cross-list from quant-ph) [pdf, ps, other]
-
Title: Generating quantum dissonance via local operationsComments: 13 pages, 2 figuresSubjects: Quantum Physics (quant-ph); Mathematical Physics (math-ph)
Correlations may arise in quantum systems through various means, of which the most remarkable one is quantum entanglement. Additionally, there are systems that exhibit non-classical correlations even in the absence of entanglement. Quantum dissonance refers to how quantum discord (QD) -- the difference between the total correlation and the classical correlation in a given quantum state -- appears as a non-classical correlation in a system without entanglement. It could be said that QD has the potential to provide a more inclusive viewpoint for discerning the non-classical correlations. In this work, we address the problem of manipulating the QD between two subsystems through local operations. We propose two explicit procedures for obtaining separable Werner states, a type of mixed state with nonzero QD. Both approaches involve performing local operations on classically correlated states and offers a step-by-step method for obtaining separable Werner states with nonzero discord, providing an alternative (explicit and user-friendly) to existing methods.
- [198] arXiv:2405.08599 (cross-list from eess.SY) [pdf, ps, other]
-
Title: The distributed biased min-consensus protocol revisited: pre-specified finite time control strategies and small-gain based analysisSubjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
Unlike the classical distributed consensus protocols enabling the group of agents as a whole to reach an agreement regarding a certain quantity of interest in a distributed fashion, the distributed biased min-consensus protocol (DBMC) has been proven to generate advanced complexity pertaining to solving the shortest path problem. As such a protocol is commonly incorporated as the first step of a hierarchical architecture in real applications, e.g., robots path planning, management of dispersed computing services, an impedance limiting the application potential of DBMC lies in, the lack of results regarding to its convergence within a user-assigned time. In this paper, we first propose two control strategies ensuring the state error of DBMC decrease exactly to zero or a desired level manipulated by the user, respectively. To compensate the high feedback gains incurred by these two control strategies, this paper further investigates the nominal DBMC itself. By leveraging small gain based stability tools, this paper also proves the global exponential input-to-state stability of DBMC, outperforming its current stability results. Simulations have been provided to validate the efficacy of our theoretical result.
- [199] arXiv:2405.08659 (cross-list from gr-qc) [pdf, ps, other]
-
Title: Conformal scattering of the wave equation in the Vaidya spacetimeSubjects: General Relativity and Quantum Cosmology (gr-qc); Mathematical Physics (math-ph)
We construct the conformal scattering operator for the scalar wave equation on the Vaidya spacetime using vector field methods. The spacetime we consider is Schwarzschild, near both past and future timelike infinities, in order to use existing decay results for the scalar field, ensuring our energy estimates. These estimates guarantee the injectivity of the trace operator and the closure of its range. Finally, we solve a Goursat problem for the scalar waves on null infinities, demonstrating that the range of the trace operator is dense. Consequently, this implies that the scattering operator is an isomorphism.
- [200] arXiv:2405.08661 (cross-list from cs.LG) [pdf, ps, other]
-
Title: Gradient Estimation and Variance Reduction in Stochastic and Deterministic ModelsComments: cornell university dissertationSubjects: Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC)
It seems that in the current age, computers, computation, and data have an increasingly important role to play in scientific research and discovery. This is reflected in part by the rise of machine learning and artificial intelligence, which have become great areas of interest not just for computer science but also for many other fields of study. More generally, there have been trends moving towards the use of bigger, more complex and higher capacity models. It also seems that stochastic models, and stochastic variants of existing deterministic models, have become important research directions in various fields. For all of these types of models, gradient-based optimization remains as the dominant paradigm for model fitting, control, and more. This dissertation considers unconstrained, nonlinear optimization problems, with a focus on the gradient itself, that key quantity which enables the solution of such problems.
In chapter 1, we introduce the notion of reverse differentiation, a term which describes the body of techniques which enables the efficient computation of gradients. We cover relevant techniques both in the deterministic and stochastic cases. We present a new framework for calculating the gradient of problems which involve both deterministic and stochastic elements. In chapter 2, we analyze the properties of the gradient estimator, with a focus on those properties which are typically assumed in convergence proofs of optimization algorithms. Chapter 3 gives various examples of applying our new gradient estimator. We further explore the idea of working with piecewise continuous models, that is, models with distinct branches and if statements which define what specific branch to use. - [201] arXiv:2405.08680 (cross-list from gr-qc) [pdf, ps, other]
-
Title: Generalized uncertainty principle distorted quintessence dynamicsComments: 12. arXiv admin note: text overlap with arXiv:2404.09049Subjects: General Relativity and Quantum Cosmology (gr-qc); High Energy Physics - Theory (hep-th); Mathematical Physics (math-ph); Quantum Physics (quant-ph)
In this paper, we invoke a generalized uncertainty principle (GUP) in the symmetry-reduced cosmological Hamiltonian for a universe driven by a quintessence scalar field with potential. Our study focuses on semi-classical regime. In particular, we derive the GUP-distorted Friedmann, Raychaudhuri, and the Klein-Gordon equation. This is followed by a systematic analysis of the qualitative dynamics for the choice of potential $V(\phi)= V_0 \sinh^{-n}{(\mu \phi)}$. This involves constructing an autonomous dynamical system of equations by choosing appropriate dynamical variables, followed by a qualitative study using linear stability theory. Our analysis shows that incorporating GUP significantly changes the existing fixed points compared to the limiting case without quantum effects by switching off the GUP.
- [202] arXiv:2405.08735 (cross-list from q-bio.PE) [pdf, ps, other]
-
Title: Competition in the nutrient-driven self-cycling fermentation processComments: 17 pages, 2 figuresSubjects: Populations and Evolution (q-bio.PE); Dynamical Systems (math.DS)
Self-cycling fermentation is an automated process used for culturing microorganisms. We consider a model of $n$ distinct species competing for a single non-reproducing nutrient in a self-cycling fermentor in which the nutrient level is used as the decanting condition. The model is formulated in terms of impulsive ordinary differential equations. We prove that two species are able to coexist in the fermentor under certain conditions. We also provide numerical simulations that suggest coexistence of three species is possible and that competitor-mediated coexistence can occur in this case. These results are in contrast to the chemostat, the continuous analogue, where multiple species cannot coexist on a single nonreproducing nutrient.
- [203] arXiv:2405.08741 (cross-list from cs.DM) [pdf, ps, other]
-
Title: On Maximal Families of Binary Polynomials with Pairwise Linear Common FactorsComments: 5 pages. Extended abstract submitted to BFA 2024Subjects: Discrete Mathematics (cs.DM); Cryptography and Security (cs.CR); Combinatorics (math.CO)
We consider the construction of maximal families of polynomials over the finite field $\mathbb{F}_q$, all having the same degree $n$ and a nonzero constant term, where the degree of the GCD of any two polynomials is $d$ with $1 \le d\le n$. The motivation for this problem lies in a recent construction for subspace codes based on cellular automata. More precisely, the minimum distance of such subspace codes relates to the maximum degree $d$ of the pairwise GCD in this family of polynomials. Hence, characterizing the maximal families of such polynomials is equivalent to determining the maximum cardinality of the corresponding subspace codes for a given minimum distance. We first show a lower bound on the cardinality of such families, and then focus on the specific case where $d=1$. There, we characterize the maximal families of polynomials over the binary field $\mathbb{F}_2$. Our findings prompt several more open questions, which we plan to address in an extended version of this work.
- [204] arXiv:2405.08787 (cross-list from cs.DS) [pdf, ps, other]
-
Title: Explicit Orthogonal Arrays and Universal Hashing with Arbitrary ParametersSubjects: Data Structures and Algorithms (cs.DS); Computational Complexity (cs.CC); Combinatorics (math.CO); Statistics Theory (math.ST)
Orthogonal arrays are a type of combinatorial design that were developed in the 1940s in the design of statistical experiments. In 1947, Rao proved a lower bound on the size of any orthogonal array, and raised the problem of constructing arrays of minimum size. Kuperberg, Lovett and Peled (2017) gave a non-constructive existence proof of orthogonal arrays whose size is near-optimal (i.e., within a polynomial of Rao's lower bound), leaving open the question of an algorithmic construction. We give the first explicit, deterministic, algorithmic construction of orthogonal arrays achieving near-optimal size for all parameters. Our construction uses algebraic geometry codes.
In pseudorandomness, the notions of $t$-independent generators or $t$-independent hash functions are equivalent to orthogonal arrays. Classical constructions of $t$-independent hash functions are known when the size of the codomain is a prime power, but very few constructions are known for an arbitrary codomain. Our construction yields algorithmically efficient $t$-independent hash functions for arbitrary domain and codomain. - [205] arXiv:2405.08809 (cross-list from hep-th) [pdf, ps, other]
-
Title: Giving a $KO$ to 8D Gauge AnomaliesComments: 16 pages + referencesSubjects: High Energy Physics - Theory (hep-th); Algebraic Topology (math.AT)
In \cite{Garcia-Etxebarria:2017crf}, it was found that the system of $k$ D7-branes probing an $O7^+$-plane suffers from an $\mathfrak{sp}(k)$ gauge anomaly when $k>1$. These authors then conjectured that this 8D $\mathcal{N}=1$ gauge theory couples to an 8D topological field theory (TFT) such that the total system is anomaly-free, thus acting as a "topological" Green-Schwarz mechanism. In this note, we construct such an 8D TFT and show that it indeed cancels the gauge anomaly. The key step is to engineer the relevant topological operators from D3-branes and fluxbranes placed infinitely far away from the stack of 7-branes. Such symmetry operators have topological vector bundles defined on them whose $KO/KSp$-homology classes play a role in the anomaly cancellation.
- [206] arXiv:nlin/0408039 (cross-list from nlin.AO) [pdf, ps, other]
-
Title: Stability and Diversity in Collective AdaptationComments: 22 pages, 23 figures; updated references, corrected typos, changed contentSubjects: Adaptation and Self-Organizing Systems (nlin.AO); Machine Learning (cs.LG); Dynamical Systems (math.DS); Chaotic Dynamics (nlin.CD); Machine Learning (stat.ML)
We derive a class of macroscopic differential equations that describe collective adaptation, starting from a discrete-time stochastic microscopic model. The behavior of each agent is a dynamic balance between adaptation that locally achieves the best action and memory loss that leads to randomized behavior. We show that, although individual agents interact with their environment and other agents in a purely self-interested way, macroscopic behavior can be interpreted as game dynamics. Application to several familiar, explicit game interactions shows that the adaptation dynamics exhibits a diversity of collective behaviors. The simplicity of the assumptions underlying the macroscopic equations suggests that these behaviors should be expected broadly in collective adaptation. We also analyze the adaptation dynamics from an information-theoretic viewpoint and discuss self-organization induced by information flux between agents, giving a novel view of collective adaptation.
Cross submissions for Wednesday, 15 May 2024 (showing 28 of 28 entries )
- [207] arXiv:2001.01108 (replaced) [pdf, ps, html, other]
-
Title: Normal crossing immersions, cobordisms and flipsSubjects: Geometric Topology (math.GT); Combinatorics (math.CO)
We study various analogues of theorems from PL topology for cubical complexes. In particular, we characterize when two PL homeomorphic cubulations are equivalent by Pachner moves by showing the question to be equivalent to the existence of cobordisms between generic immersions of hypersurfaces. This solves a question and conjecture of Habegger and Funar.
- [208] arXiv:2005.10928 (replaced) [pdf, ps, other]
-
Title: The effect of perturbations on the convergence of attractors for reaction-diffusion equations concerning variations of nonlinear boundary conditionsSubjects: Analysis of PDEs (math.AP)
This paper presents estimates of the convergence of asymptotic dynamics of reaction-diffusion equations with nonlinear boundary conditions. We show how the convergence of the global attractors can be affected by the variations of diffusion coefficients, boundary conditions, and vector fields.
- [209] arXiv:2005.14157 (replaced) [pdf, ps, other]
-
Title: Higher R\'edei reciprocity and integral points on conicsComments: Some results and method superseded by arXiv:2201.13424Subjects: Number Theory (math.NT)
Fix an integer $l$ such that $|l|$ is a prime $3$ modulo $4$. Let $d > 0$ be a squarefree integer and let $N_d(x, y)$ be the principal binary quadratic form of $\mathbb{Q}(\sqrt{d})$. Building on a breakthrough of Alexander Smith, we give an asymptotic formula for the solubility of $N_d(x, y) = l$ in integers $x$ and $y$ as $d$ varies among squarefree integers divisible by $l$.
As a corollary we give, in case $l > 0$, an asymptotic formula for the event that the Hasse Unit Index of the field $\mathbb{Q}(\sqrt{-l}, \sqrt{d})$ is $2$ as $d$ varies over all positive squarefree integers. We also improve the results of Fouvry and Klüners and recent results of Chan, Milovic and the authors on the solubility of the negative Pell equation. Our main new tool is a generalization of a classical reciprocity law due to Rédei. - [210] arXiv:2006.04302 (replaced) [pdf, ps, other]
-
Title: Archimedean Zeta Integrals for Unitary GroupsComments: Accepted for publication in the Journal für die reine und angewandte Mathematik (Crelle's Journal)Subjects: Number Theory (math.NT); Representation Theory (math.RT)
We derive precise formulas for the archimedean Euler factors occurring in certain standard Langlands $L$-functions for unitary groups. In the 1980s, Paul Garrett, as well as Ilya Piatetski-Shapiro and Stephen Rallis (independently of Garrett), discovered integral representations of automorphic $L$-functions that are Eulerian but, in contrast to the Rankin--Selberg and Langlands--Shahidi methods, do not require that the automorphic representations to which the $L$-functions are associated are globally generic. Their approach, the doubling method, opened the door to a variety of applications that could not be handled by prior methods.
For over three decades, though, the integrals occurring in the Euler factors at archimedean places for unitary groups eluded precise computation, except under particular simplifications (such as requiring certain representations to be one-dimensional, as Garrett did in the first major progress on this computation and only prior progress for general signatures). We compute these integrals for holomorphic discrete series of general vector weights for unitary groups of any signature. This has consequences not only for special values of $L$-functions in the archimedean setting, but also for $p$-adic $L$-functions, where the corresponding term had remained open. - [211] arXiv:2006.08144 (replaced) [pdf, ps, other]
-
Title: A Matrix Generalization of the Hardy-Littlewood-P\'olya Rearrangement Inequality and Its ApplicationsSubjects: Functional Analysis (math.FA)
We prove a generalization of the Hardy-Littlewood-Pólya rearrangement inequality to positive definite matrices. The inequality can be seen as a commutation principle in the sense of Iusem and Seeger. An important instrument in the proof is a first-order perturbation formula for a certain spectral function, which could be of independent interests. The inequality is then extended to rectangular matrices. Using our main results, we derive new inequalities for several distance-like functions encountered in various signal processing or machine learning applications.
- [212] arXiv:2012.10290 (replaced) [pdf, ps, other]
-
Title: Building Data for Stacky CoversComments: With changed suggested by the referee. Published in Selecta Mathematica. ArXiv version 46 pagesJournal-ref: Sel. Math. New Ser. 30, 50 (2024)Subjects: Algebraic Geometry (math.AG); Number Theory (math.NT)
We define stacky building data for stacky covers in the spirit of Pardini and give an equivalence of (2,1)-categories between the category of stacky covers and the category of stacky building data. We show that every stacky cover is a flat root stack in the sense of Olsson and Borne--Vistoli and give an intrinsic description of it as a root stack using stacky building data. When the base scheme S is defined over a field, we give a criterion for when a birational building datum comes from a tamely ramified cover for a finite abelian group scheme, generalizing a result of Biswas--Borne.
- [213] arXiv:2102.07196 (replaced) [pdf, ps, html, other]
-
Title: Remarks on the Stanley depth and Hilbert depth of monomial ideals with linear quotientsComments: 11 pages; major revision - we corrected several proofsSubjects: Commutative Algebra (math.AC)
We prove that if $I$ is a monomial ideal with linear quotients in a ring of polynomials $S$ in $n$ indeterminates and $\operatorname{depth}(S/I)=n-2$, then $\operatorname{sdepth}(S/I)=n-2$ and, if $I$ is squarefree, $\operatorname{hdepth}(S/I)=n-2$.
Also, we prove that $\operatorname{sdepth}(S/I)\geq \operatorname{depth}(S/I)$ for a monomial ideal $I$ with linear quotients which satisfies certain technical conditions. - [214] arXiv:2108.09497 (replaced) [pdf, ps, other]
-
Title: Internal and String Stability of an Observer-based Controller for Vehicle Platooning under the MPF TopologySubjects: Optimization and Control (math.OC)
In this paper, we study the internal stability and string stability of a vehicle platoon under the constant time headway spacing (CTHS) policy and the multiple-predecessor-following (MPF) vehicle-to-vehicle information flow topology. More specifically, we depart from the conventional Proportional-Integral-Derivative (PID) controller design for such systems and we propose the design of an observer-based controller. For designing our observer-based controller, we first design a distributed observer, with which each follower estimates their position, speed and acceleration error with respect to the leader. The observer is designed by means of constructing an observer matrix whose parameters should be determined. Next, we simplify the design of the matrix of the observer in such a way that the design boils down to choosing a single scalar value; this design further simplifies the structure of the controller, whose simplicity enables the derivation of string stability conditions by means of a frequency response method. Subsequently, the string stability conditions for a given time headway, are transformed to conditions for the controller parameters. We obtain controller parameters that satisfy the stability conditions by designing a novel heuristic search algorithm. Furthermore, we extend the search algorithm by incorporating a bisection-like algorithm, which allows to obtain (within some deviation tolerance) the minimum available value of the time headway. Finally, we provide insights about how to finalize the observer-based controller parameters from above algorithms to avoid the peaking phenomenon. The performance of the proposed observer-based controller is demonstrated via illustrative examples. Additionally, a comparison with a widely-used PID controller for MPF topology shows that our proposed observer-based controller has better convergence performance.
- [215] arXiv:2109.07463 (replaced) [pdf, ps, other]
-
Title: Bias in cubic Gauss sums: Patterson's conjectureComments: 79 pages; minor corrections, accepted Annals of MathSubjects: Number Theory (math.NT)
Let $W$ be a smooth test function with compact support in $(0,\infty)$. Conditional on the Generalized Riemann Hypothesis for Hecke $L$-functions over $\mathbb{Q}(\omega)$, we prove that $$\sum_{p \equiv 1 \pmod{3}} \frac{1}{2 \sqrt{p}} \cdot \Big ( \sum_{x \pmod{p}} e^{2\pi i x^3 / p} \Big ) W \Big ( \frac{p}{X} \Big ) \sim \frac{(2\pi)^{2/3}}{3 \Gamma(\tfrac 23)} \int_{0}^{\infty} W(x) x^{-1/6} dx \cdot \frac{X^{5/6}}{\log X},$$ as $X \rightarrow \infty$ and $p$ runs over primes. This explains a well-known numerical bias in the distribution of cubic Gauss sums first observed by Kummer in 1846 and confirms (conditionally on the Generalized Riemann Hypothesis) a conjecture of Patterson from 1978.
There are two important byproducts of our proof. The first is an explicit level aspect Voronoi summation formula for cubic Gauss sums, extending computations of Patterson and Yoshimoto. Secondly, we show that Heath-Brown's cubic large sieve is sharp up to factors of $X^{o(1)}$ under the Generalized Riemann Hypothesis. This disproves the popular belief that the cubic large sieve can be improved.
An important ingredient in our proof is a dispersion estimate for cubic Gauss sums. It can be interpreted as a cubic large sieve with correction by a non-trivial asymptotic main term. This estimate relies on the Generalized Riemann Hypothesis, and is one of the fundamental reasons why our result is conditional. - [216] arXiv:2109.09055 (replaced) [pdf, ps, html, other]
-
Title: Combining Learning and Control for Data-driven Approaches of Cyber-Physical SystemsComments: 17Subjects: Optimization and Control (math.OC)
Cyber-physical systems (CPS), in most instances, represent systems of systems with an informationally decentralized structure such as emerging mobility systems, networked control systems, sustainable manufacturing, smart power grids, power systems, mobility markets, social media platforms, cooperation of robots, and internet of things. To optimize the operation of such systems, we typically assume an ideal model. Such model-based control approaches cannot effectively facilitate optimal solutions with performance guarantees due to the discrepancy between the model and the actual CPS. On the other hand, in most CPS there is a large volume of data with a dynamic nature which is added to the system gradually in real time and not altogether in advance. Thus, traditional supervised learning approaches cannot always facilitate robust solutions using data derived offline. By contrast, applying reinforcement learning approaches directly to the actual CPS might impose significant implications on the safety and robust operation of the system. The overarching goal of the Information and Decision Science (IDS) Lab is to investigate how to circumvent these challenges by developing data-driven system approaches at the intersection of learning and control. The emphasis is on how to improve energy efficiency and reduce greenhouse gas emissions in applications related to emerging mobility systems, e.g., connected and automated vehicles (CAVs), shared mobility, sociotechnical systems, and smart cities, and thus contribute to the health of the planet.
- [217] arXiv:2111.13452 (replaced) [pdf, ps, other]
-
Title: Multiplier Submodule Sheaves and a problem of LempertComments: 56 pagesSubjects: Complex Variables (math.CV); Algebraic Geometry (math.AG); Differential Geometry (math.DG)
In this article, we establish an $L^2$ extension theorem for Nakano semi-positive singular Hermitian metrics on holomorphic vector bundles, and the strong openness and stability properties of the multiplier submodule sheaves associated to Nakano semi-positive singular Hermitian metrics on holomorphic vector bundles.
We solve affirmatively a question of Lempert on the preservation of Nakano semi-positivity under limit of an increasing metrics based on Deng-Ning-Wang-Zhou's characterization of Nakano positivity. - [218] arXiv:2204.04568 (replaced) [pdf, ps, other]
-
Title: Generalized Tuza's conjecture for random hypergraphsComments: 32 pages including references and appendix; minor corrections throughout the article; accepted to SIAM J. Discrete MathSubjects: Combinatorics (math.CO)
A celebrated conjecture of Tuza states that in any finite graph the minimum size of a cover of triangles by edges is at most twice the maximum size of a set of edge-disjoint triangles. For an $r$-uniform hypergraph ($r$-graph) $G$, let $\tau(G)$ be the minimum size of a cover of edges by $(r-1)$-sets of vertices, and let $\nu(G)$ be the maximum size of a set of edges pairwise intersecting in fewer than $r-1$ vertices. Aharoni and Zerbib proposed the following generalization of Tuza's conjecture: $$ \text{For any $r$-graph $G$, $\tau(G)/\nu(G) \leq \lceil(r+1)/2\rceil$.} $$
Let $H_r(n,p)$ be the uniformly random $r$-graph on $n$ vertices. We show that, for $r \in \{3, 4, 5\}$ and any $p = p(n)$, $H_r(n,p)$ satisfies the Aharoni-Zerbib conjecture with high probability (i.e., with probability approaching 1 as $n \rightarrow \infty$). We also show that there is a $C < 1$ such that, for any $r \geq 6$ and any $p = p(n)$, $\tau(H_r(n, p))/\nu(H_r(n, p)) \leq C r$ with high probability. Furthermore, we may take $C < 1/2 + \varepsilon$, for any $\varepsilon > 0$, by restricting to sufficiently large $r$ (depending on $\varepsilon$). - [219] arXiv:2205.03379 (replaced) [pdf, ps, other]
-
Title: The Module Structure of a Group Action on a RingComments: Rewrote part of section 3 to clarify argument. Corrected calculations of examples in section 9Subjects: Commutative Algebra (math.AC); Representation Theory (math.RT)
Consider a finite group $G$ acting on a graded Noetherian $k$-algebra $S$, for some field $k$ of characteristic $p$; for example $S$ might be a polynomial ring. Regard $S$ as a $kG$-module and consider the multiplicity of a particular indecomposable module as a summand in each degree. We show how this can be described in terms of homological algebra and how it is linked to the geometry of the group action on the spectrum of $S$.
- [220] arXiv:2207.11093 (replaced) [pdf, ps, other]
-
Title: On moments of integrals with respect to Markov additive processes and of Markov modulated generalized Ornstein-Uhlenbeck processesSubjects: Probability (math.PR)
We establish sufficient conditions for the existence, and derive explicit formulas for the $\kappa$'th moments, $\kappa \geq 1$, of Markov modulated generalized Ornstein-Uhlenbeck processes as well as their stationary distributions. In particular, the running mean, the autocovariance function, and integer moments of the stationary distribution are derived in terms of the characteristics of the driving Markov additive process. Our derivations rely on new general results on moments of Markov additive processes and (multidimensional) integrals with respect to Markov additive processes.
- [221] arXiv:2209.01669 (replaced) [pdf, ps, other]
-
Title: Lambda-invariants of Mazur--Tate elements attached to Ramanujan's tau function and congruences with Eisenstein seriesComments: Added calculations on mu-invariants and made several corrections. To appear in Research in Number TheorySubjects: Number Theory (math.NT)
Let $p\in\{3,5,7\}$ and let $\Delta$ denote the weight twelve modular form arising from Ramanujan's tau function. We show that $\Delta$ is congruent to an Eisenstein series $E_{k,\chi, \psi}$ modulo $p$ for explicit choices of $k$ and Dirichlet characters $\chi$ and $\psi$. We then prove formulae describing the Iwasawa invariants of the Mazur--Tate elements attached to $\Delta$, confirming numerical data gathered by the authors in a previous work.
- [222] arXiv:2210.04999 (replaced) [pdf, ps, other]
-
Title: Edgeworth-type expansion for the one-point distribution of the KPZ fixed point with a large height at a prior locationComments: We fixed all the typos in the notations and rephrased Lemma 3.4Subjects: Probability (math.PR); Mathematical Physics (math-ph)
We consider the Kardar-Parisi-Zhang (KPZ) fixed point $\mathrm{H}(x,\tau)$ with the step initial condition and investigate the distribution of $\mathrm{H}(x,\tau)$ conditioned on a large height at an earlier space-time point $\mathrm{H}(x',\tau')$. As $\mathrm{H}(x',\tau')$ tends to infinity, we prove that the conditional one-point distribution of $\mathrm{H}(x,\tau)$ in the regime $\tau>\tau'$ converges to the Gaussian Unitary Ensemble (GUE) Tracy-Widom distribution and the next two lower-order error terms can be expressed as derivatives of the GUE Tracy-Widom distribution, which is a conceptual analogue to the Edgeworth expansion in the central limit theorem. These KPZ-type limiting behaviors are different from the Gaussian-type ones obtained in \cite{Liu-Wang22} where they study the finite-dimensional distribution of $\mathrm{H}(x,\tau)$ conditioned on a large height at a later space-time point $\mathrm{H}(x',\tau')$. They show, with the step initial condition, that the conditional random field $\mathrm{H}(x,\tau)$ in the regime $\tau<\tau'$ converges to the minimum of two independent Brownian bridges modified by linear drifts as $\mathrm{H}(x',\tau')$ goes to infinity. The two results stated above provide the phase diagram of the asymptotic behaviors of a conditional law of KPZ fixed point in the regimes $\tau>\tau'$ and $\tau<\tau'$ when $\mathrm{H}(x',\tau')$ goes to infinity.
- [223] arXiv:2210.16908 (replaced) [pdf, ps, other]
-
Title: Statistical properties for mixing Markov chains with applications to dynamical systemsComments: 42 pages, 1 figure. Compared to the previous version we added several references and rephrased the main result in a more general settingSubjects: Dynamical Systems (math.DS); Probability (math.PR)
We establish an abstract, effective, exponential large deviations type estimate for Markov systems satisfying a weaker form of mixing. We employ this result to derive such estimates, as well as a central limit theorem, for the skew product encoding a random torus translation, a model we call a mixed random-quasiperiodic dynamical system. This abstract scheme is applicable to many other types of skew product dynamics, including systems for which the spectral gap property for the transition or the transfer operator does not hold.
- [224] arXiv:2211.11234 (replaced) [pdf, ps, other]
-
Title: The measure transfer for subshifts induced by a morphism of free monoidsComments: The second half of section 5, concerning the discussion around "recognizable for aperiodic points" (including Figure 1), is new. Small errors have been corrected, and parts of the introduction has been rewrittenSubjects: Dynamical Systems (math.DS)
Every non-erasing monoid morphism $\sigma: {\cal A}^* \to {\cal B}^*$ induces a measure transfer map $\sigma_X^{\cal M}: {\cal M}(X) \to {\cal M}(\sigma(X))$ between the measure cones ${\cal M}(X)$ and ${\cal M}(\sigma(X))$, associated to any subshift $X \subseteq {\cal A}^\mathbb Z$ and its image subshift $\sigma(X) \subseteq {\cal B}^\mathbb Z$ respectively. We define and study this map in detail and show that it is continuous, linear and functorial. It also turns out to be surjective. Furthermore, an efficient technique to compute the value of the transferred measure $\sigma_X^{\cal M}(\mu)$ on any cylinder $[w]$ (for $w \in {\cal B}^\mathbb Z$) is presented.
Theorem: If a non-erasing morphism $\sigma: {\cal A}^* \to {\cal B}^*$ is injective on the shift-orbits of some subshift $X \subseteq {\cal A}^\mathbb Z$, then $\sigma^{\cal M}_X$ is bijective.
The assumption on $\sigma$ that it is "injective on the shift-orbits of $X$" is strictly weaker than "recognizable in $X$", and strictly stronger than "recognizable for aperiodic points in $X$". The last assumption does in general not suffice to obtain the injectivity of the measure transfer map $\sigma_X^{\cal M}$. - [225] arXiv:2212.02388 (replaced) [pdf, ps, html, other]
-
Title: Bounded-Degree Planar Graphs Do Not Have Bounded-Degree Product StructureComments: Small corrections, clearer notationSubjects: Combinatorics (math.CO)
Product structure theorems are a collection of recent results that have been used to resolve a number of longstanding open problems on planar graphs and related graph classes. One particularly useful version states that every planar graph $G$ is contained in the strong product of a $3$-tree $H$, a path $P$, and a $3$-cycle $K_3$; written as $G\subseteq H\boxtimes P\boxtimes K_3$. A number of researchers have asked if this theorem can be strengthened so that the maximum degree in $H$ can be bounded by a function of the maximum degree in $G$. We show that no such strengthening is possible. Specifically, we describe an infinite family $\mathcal{G}$ of planar graphs of maximum degree $5$ such that, if an $n$-vertex member $G$ of $\mathcal{G}$ is isomorphic to a subgraph of $H\boxtimes P\boxtimes K_c$ where $P$ is a path and $H$ is a graph of maximum degree $\Delta$ and treewidth $t$, then $t\Delta c \ge 2^{\Omega(\sqrt{\log\log n})}$.
- [226] arXiv:2212.07227 (replaced) [pdf, ps, html, other]
-
Title: Hyperelliptic curves and Ulrich sheaves on the complete intersection of two quadricsComments: 26 pages, improved expositionSubjects: Algebraic Geometry (math.AG)
Using the connection between hyperelliptic curves, Clifford algebras, and complete intersections $X$ of two quadrics, we describe Ulrich bundles on $X$ and construct some of minimal possible rank.
- [227] arXiv:2212.07561 (replaced) [pdf, ps, html, other]
-
Title: Monotonicity of the period map for the equation $-\varphi''+\varphi-\varphi^{k}=0$Comments: The correction of a small imperfection in the proof of Lemma 3.3 has been made. Accepted for publication in Monatshefte für Mathematik (2024)Subjects: Dynamical Systems (math.DS); Classical Analysis and ODEs (math.CA)
In this paper, we establish the monotonicity of the period map in terms of the energy levels for certain periodic solutions of the equation $-\varphi''+\varphi-\varphi^{k}=0$, where $k>1$ is a real number. We present a new approach to demonstrate this property, utilizing spectral information of the corresponding linearized operator around the periodic solution and tools related to Floquet theory.
- [228] arXiv:2301.00712 (replaced) [pdf, ps, html, other]
-
Title: On Finding Small Hyper-Gradients in Bilevel Optimization: Hardness Results and Improved AnalysisComments: Add new upper bounds of nonconvex-PL bilevel problems compared to arXiv version 1 in 2023.1Subjects: Optimization and Control (math.OC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Bilevel optimization reveals the inner structure of otherwise oblique optimization problems, such as hyperparameter tuning, neural architecture search, and meta-learning. A common goal in bilevel optimization is to minimize a hyper-objective that implicitly depends on the solution set of the lower-level function. Although this hyper-objective approach is widely used, its theoretical properties have not been thoroughly investigated in cases where the lower-level functions lack strong convexity. In this work, we first provide hardness results to show that the goal of finding stationary points of the hyper-objective for nonconvex-convex bilevel optimization can be intractable for zero-respecting algorithms. Then we study a class of tractable nonconvex-nonconvex bilevel problems when the lower-level function satisfies the Polyak-Łojasiewicz (PL) condition. We show a simple first-order algorithm can achieve better complexity bounds of $\tilde{\mathcal{O}}(\epsilon^{-2})$, $\tilde{\mathcal{O}}(\epsilon^{-4})$ and $\tilde{\mathcal{O}}(\epsilon^{-6})$ in the deterministic, partially stochastic, and fully stochastic setting respectively.
- [229] arXiv:2301.06428 (replaced) [pdf, ps, other]
-
Title: Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic OptimizationComments: ICML 2023Subjects: Optimization and Control (math.OC); Machine Learning (cs.LG)
We consider the optimization problem of the form $\min_{x \in \mathbb{R}^d} f(x) \triangleq \mathbb{E}_{\xi} [F(x; \xi)]$, where the component $F(x;\xi)$ is $L$-mean-squared Lipschitz but possibly nonconvex and nonsmooth. The recently proposed gradient-free method requires at most $\mathcal{O}( L^4 d^{3/2} \epsilon^{-4} + \Delta L^3 d^{3/2} \delta^{-1} \epsilon^{-4})$ stochastic zeroth-order oracle complexity to find a $(\delta,\epsilon)$-Goldstein stationary point of objective function, where $\Delta = f(x_0) - \inf_{x \in \mathbb{R}^d} f(x)$ and $x_0$ is the initial point of the algorithm. This paper proposes a more efficient algorithm using stochastic recursive gradient estimators, which improves the complexity to $\mathcal{O}(L^3 d^{3/2} \epsilon^{-3}+ \Delta L^2 d^{3/2} \delta^{-1} \epsilon^{-3})$.
- [230] arXiv:2301.08662 (replaced) [pdf, ps, html, other]
-
Title: Identification and existence of Boltzmann processesSubjects: Probability (math.PR)
The stochastic differential equation of McKean-Vlasov type is identified such that the Fokker-Planck equation associated to it is the Boltzmann equation. Hence, we call its solutions as Boltzmann processes. They describe the dynamics (in position and velocity) of particles expanding in vacuum in accordance with the Boltzmann equation. Given a solution $f:=$ $\{f(t,x,v\}_{0 \leq t \leq T} $ of the Boltzmann equation, the existence of solutions to the McKean-Vlasov SDE is established for the non-cutoff hard sphere case.
- [231] arXiv:2302.07382 (replaced) [pdf, ps, other]
-
Title: Free extreme points span generalized free spectrahedra given by compact coefficientsComments: Version 1 and 2 incorrectly claimed to prove that free spectrahedrops are spanned by free extreme points. The proof had an error and there are counterexamples to the claimed dimension bound. The error was in Lemma 3.1 and Theorem 3.7 in version 2. A compactness argument was used on the "smallest nonzero eigenvalue" function. This function is not continuous so the conclusion does not holdSubjects: Operator Algebras (math.OA)
Matrix convexity generalizes convexity to the dimension free setting and has connections to many mathematical and applied pursuits including operator theory, quantum information, noncommutative optimization, and linear control systems. In the setting of classical convex sets, extreme points are central objects which exhibit many important properties. For example, the Minkowski theorem shows that any element of a closed bounded convex set can be expressed as a convex combination of extreme points. Extreme points are also of great interest in the dimension free setting of matrix convex sets; however, here the situation requires more nuance.
In the dimension free setting, there are many different types of extreme points. Of particular importance are free extreme points, a highly restricted type of extreme point that is closely connected to the dilation theoretic Arveson boundary. If free extreme points span a matrix convex set through matrix convex combinations, then they satisfy a strong notion of minimality in doing so. However, not all closed bounded matrix convex sets even have free extreme points. Thus, a major goal is to determine which matrix convex sets are spanned by their free extreme points.
Building on a recent work of J. W. Helton and the author which shows that free spectrahedra, i.e., dimension free solution sets to linear matrix inequalities, are spanned by their free extreme points, we establish two additional classes of matrix convex sets which are the matrix convex hull of their free extreme points. Namely, we show that closed bounded free spectrahedrops, i.e, closed bounded projections of free spectrahedra, are the span of their free extreme points. Furthermore, we show that if one considers linear operator inequalities that have compact operator defining tuples, then the resulting ``generalized" free spectrahedra are spanned by their free extreme points. - [232] arXiv:2303.08108 (replaced) [pdf, ps, other]
-
Title: Clairaut semi-invariant Riemannian maps to Kaehler manifoldsComments: 19 pagesJournal-ref: Mediterranean Journal of Mathematics, Vol. 21, No. 3, Article 121 (2024)Subjects: Differential Geometry (math.DG)
In this paper, first, we recall the notion of Clairaut Riemannian map (CRM) ${F}$ using a geodesic curve on the base manifold and give the Ricci equation. We also show that if base manifold of CRM is space form then leaves of $(ker{F}_\ast)^\perp$ become space forms and symmetric as well. Secondly, we define Clairaut semi-invariant Riemannian map (CSIRM) from a Riemannian manifold $(M, g_{M})$ to a Kähler manifold $(N, g_{N}, P)$ with a non-trivial example. We find necessary and sufficient conditions for a curve on the base manifold of semi-invariant Riemannian map (SIRM) to be geodesic. Further, we obtain necessary and sufficient conditions for a SIRM to be CSIRM. Moreover, we find necessary and sufficient condition for CSIRM to be harmonic and totally geodesic. In addition, we find necessary and sufficient condition for the distributions $\bar{D_1}$ and $\bar{D_2}$ of $(ker{F}_\ast)^\bot$ (which are arisen from the definition of CSIRM) to define totally geodesic foliations. Finally, we obtain necessary and sufficient conditions for $(ker{F}_\ast)^\bot$ and base manifold to be locally product manifold $\bar{D_1} \times \bar{D_2}$ and ${(range{F}_\ast)} \times {(range{F}_\ast)^\bot}$, respectively.
- [233] arXiv:2305.01290 (replaced) [pdf, ps, other]
-
Title: A Construction of Arbitrarily Large Type-II $Z$ Complementary Code SetSubjects: Information Theory (cs.IT)
For a type-I $(K,M,Z,N)$-ZCCS, it follows $K \leq M \left\lfloor \frac{N}{Z}\right\rfloor$. In this paper, we propose a construction of type-II $(p^{k+n},p^k,p^{n+r}-p^r+1,p^{n+r})$-$Z$ complementary code set (ZCCS) using an extended Boolean function, its properties of Hamiltonian paths and the concept of isolated vertices, where $p\ge 2$. However, the proposed type-II ZCCS provides $K = M(N-Z+1)$ codes, where as for type-I $(K,M,N,Z)$-ZCCS, it is $K \leq M \left\lfloor \frac{N}{Z}\right\rfloor$. Therefore, the proposed type-II ZCCS provides a larger number of codes compared to type-I ZCCS. Further, as a special case of the proposed construction, $(p^k,p^k,p^n)$-CCC can be generated, for any integral value of $p\ge2$ and $k\le n$.
- [234] arXiv:2305.04452 (replaced) [pdf, ps, html, other]
-
Title: On the regular representation of solvable Lie groups with open coadjoint quasi-orbitsComments: 15 pages; we have overhauled the material of the former versions of the preprints 2111.01034 [math.RT] and 2305.04452 [math.RT]Subjects: Representation Theory (math.RT)
We obtain a Lie theoretic intrinsic characterization of the connected and simply connected solvable Lie groups whose regular representation is a factor representation. When this is the case, the corresponding von Neumann algebras are isomorphic to the hyperfinite II$_\infty$ factor, and every Casimir function is constant. We thus obtain a family of geometric models for the standard representation of that factor. Finally, we show that the regular representation of any connected and simply connected solvable Lie group with open coadjoint orbits is always of type I, though the group needs not be of type I, and include some relevant examples.
- [235] arXiv:2305.17693 (replaced) [pdf, ps, other]
-
Title: Deflation for the off-diagonal block in symmetric saddle point systemsComments: 28 pages, 13 figuresSubjects: Numerical Analysis (math.NA)
Deflation techniques are typically used to shift isolated clusters of small eigenvalues in order to obtain a tighter distribution and a smaller condition number. Such changes induce a positive effect in the convergence behavior of Krylov subspace methods, which are among the most popular iterative solvers for large sparse linear systems. We develop a deflation strategy for symmetric saddle point matrices by taking advantage of their underlying block structure. The vectors used for deflation come from an elliptic singular value decomposition relying on the generalized Golub-Kahan bidiagonalization process. The block targeted by deflation is the off-diagonal one since it features a problematic singular value distribution for certain applications. One example is the Stokes flow in elongated channels, where the off-diagonal block has several small, isolated singular values, depending on the length of the channel. Applying deflation to specific parts of the saddle point system is important when using solvers such as CRAIG, which operates on individual blocks rather than the whole system. The theory is developed by extending the existing framework for deflating square matrices before applying a Krylov subspace method like MINRES. Numerical experiments confirm the merits of our strategy and lead to interesting questions about using approximate vectors for deflation.
- [236] arXiv:2306.02345 (replaced) [pdf, ps, html, other]
-
Title: Configuration spaces as commutative monoidsComments: 11 pages, including an appendix with Quoc P. Ho. v2: 13 pages. Accepted version, to appear in the Bulletin of the LMSSubjects: Algebraic Topology (math.AT)
After 1-point compactification, the collection of all unordered configuration spaces of a manifold admits a commutative multiplication by superposition of configurations. We explain a simple (derived) presentation for this commutative monoid object. Using this presentation, one can quickly deduce Knudsen's formula for the rational cohomology of configuration spaces, prove rational homological stability, and understand how automorphisms of the manifold act on the cohomology of configuration spaces. Similar considerations reproduce the work of Farb--Wolfson--Wood on homological densities.
- [237] arXiv:2306.04981 (replaced) [pdf, ps, other]
-
Title: Computation of a Unified Graph-Based Rate Optimization ProblemSubjects: Information Theory (cs.IT)
We define a graph-based rate optimization problem and consider its computation, which provides a unified approach to the computation of various theoretical limits, such as the (conditional) graph entropy, rate-distortion functions and capacity-cost functions with two-sided information. Our contributions are twofold.
On the theoretical side, we simplify the graph-based problem by constructing explicit graph contractions in some special cases. These efforts reduce the number of decision variables in the optimization problem. Graph characterizations for rate-distortion and capacity-cost functions with two-sided information are simplified by specializing the results.
On the computational side, we design an alternating minimization algorithm for the graph-based problem, which deals with the inequality constraint by a flexible multiplier update strategy. Moreover, deflation techniques are introduced, so that the computing time can be largely reduced. Theoretical analysis shows that the algorithm converges to an optimal solution. The accuracy and efficiency of the algorithm are illustrated by numerical experiments. - [238] arXiv:2306.06045 (replaced) [pdf, ps, other]
-
Title: Global solution and blow-up for the SKT model in Population DynamicsSubjects: Analysis of PDEs (math.AP); Dynamical Systems (math.DS)
In this paper, we prove the existence and uniqueness of the global solution to the reaction diffusion system SKT with homogeneous Newmann boundary conditions. We use the lower and upper solution method and its associated monotone iterations where the reaction functions are locally Lipschitz .We study the blowing-up property of the solution, we give a sufficient condition on the reaction parameters of the model to ensure the blow-up of the solution continuous functions spaces.
- [239] arXiv:2306.11286 (replaced) [pdf, ps, html, other]
-
Title: Globally optimal solutions to a class of fractional optimization problems based on proximity gradient algorithmComments: 29 pages, 2 figuresSubjects: Optimization and Control (math.OC); Numerical Analysis (math.NA)
In this paper, we investigate a category of constrained fractional optimization problems that emerge in various practical applications. The objective function for this category is characterized by the ratio of a numerator and denominator, both being convex, semi-algebraic, Lipschitz continuous, and differentiable with Lipschitz continuous gradients over the constraint sets. The constrained sets associated with these problems are closed, convex, and semi-algebraic. We propose an efficient algorithm that is inspired by the proximal gradient method, and we provide a thorough convergence analysis. Our algorithm offers several benefits compared to existing methods. It requires only a single proximal gradient operation per iteration, thus avoiding the complicated inner-loop concave maximization usually required. Additionally, our method converges to a critical point without the typical need for a nonnegative numerator, and this critical point becomes a globally optimal solution with an appropriate condition. Our approach is adaptable to unbounded constraint sets as well. Therefore, our approach is viable for many more practical models. Numerical experiments show that our method not only reliably reaches ground-truth solutions in some model problems but also outperforms several existing methods in maximizing the Sharpe ratio with real-world financial data.
- [240] arXiv:2306.13618 (replaced) [pdf, ps, other]
-
Title: Unbalanced Optimal Transport and Maximum Mean Discrepancies: Interconnections and Rapid EvaluationSubjects: Optimization and Control (math.OC)
This contribution presents substantial computational advancements to compare measures even with varying masses. Specifically, we utilize the nonequispaced fast Fourier transform to accelerate the radial kernel convolution in unbalanced optimal transport approximation, built upon the Sinkhorn algorithm. We also present accelerated schemes for maximum mean discrepancies involving kernels. Our approaches reduce the arithmetic operations needed to compute distances from $\mathcal O(n^2)$ to $\mathcal O(n\log n)$, opening the door to handle large and high-dimensional datasets efficiently. Furthermore, we establish robust connections between transportation problems, encompassing Wasserstein distance and unbalanced optimal transport, and maximum mean discrepancies. This empowers practitioners with compelling rationale to opt for adaptable distances.
- [241] arXiv:2307.01335 (replaced) [pdf, ps, other]
-
Title: Waves in cosmological background with static Schwarzschild radius in the expanding universeSubjects: Analysis of PDEs (math.AP); Mathematical Physics (math-ph)
In this paper, we prove the existence of global in time small data solutions of semilinear Klein-Gordon equations in space-time with a static Schwarzschild radius in the expanding universe.
- [242] arXiv:2307.02769 (replaced) [pdf, ps, other]
-
Title: The Goldman bracket characterizes homeomorphisms between non-compact surfacesComments: to appear in Algebraic and Geometric TopologySubjects: Geometric Topology (math.GT)
We show that a homotopy equivalence between two non-compact orientable surfaces is homotopic to a homeomorphism if and only if it preserves the Goldman bracket, provided our surfaces are neither the plane nor the punctured plane.
- [243] arXiv:2307.06317 (replaced) [pdf, ps, other]
-
Title: A geometric classification of rod complements in the 3-torusComments: 15 pages, 1 figure. Two typos corrected, acknowledgements updated. To appear in Proc AMSSubjects: Geometric Topology (math.GT)
Rod packings are used in crystallography to describe crystal structures with linear or zigzag chains of particles, and each rod packing can be topologically viewed as a collection of disjoint geodesics in the 3-torus. Hui and Purcell developed a method to study the complements of rods in the 3-torus with the use of 3-dimensional geometry and tools from the 3-sphere, and they partially classified the geometry of some families of rod complements in the 3-torus. In this paper, we provide a complete classification of the geometry of all rod complements in the 3-torus using topological arguments.
- [244] arXiv:2307.10053 (replaced) [pdf, ps, other]
-
Title: SGD-type Methods with Guaranteed Global Stability in Nonsmooth Nonconvex OptimizationComments: 36 pagesSubjects: Optimization and Control (math.OC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
In this paper, we focus on providing convergence guarantees for variants of the stochastic subgradient descent (SGD) method in minimizing nonsmooth nonconvex functions. We first develop a general framework to establish global stability for general stochastic subgradient methods, where the corresponding differential inclusion admits a coercive Lyapunov function. We prove that, with sufficiently small stepsizes and controlled noises, the iterates asymptotically stabilize around the stable set of its corresponding differential inclusion. Then we introduce a scheme for developing SGD-type methods with regularized update directions for the primal variables. Based on our developed framework, we prove the global stability of our proposed scheme under mild conditions. We further illustrate that our scheme yields variants of SGD-type methods, which enjoy guaranteed convergence in training nonsmooth neural networks. In particular, by employing the sign map to regularize the update directions, we propose a novel subgradient method named the Sign-map Regularized SGD method (SRSGD). Preliminary numerical experiments exhibit the high efficiency of SRSGD in training deep neural networks.
- [245] arXiv:2309.00405 (replaced) [pdf, ps, other]
-
Title: Hamiltonian for the Hilbert-P\'olya ConjectureComments: 15 pagesSubjects: Mathematical Physics (math-ph); Quantum Physics (quant-ph)
We introduce a Hamiltonian to address the Hilbert-Pólya conjecture. The eigenfunctions of the introduced Hamiltonian, subject to the Dirichlet boundary conditions on the positive half-line, vanish at the origin by the nontrivial zeros of the Riemann zeta function. Consequently, the eigenvalues are determined by these nontrivial Riemann zeros. If the Riemann hypothesis (RH) is true, the eigenvalues become real and represent the imaginary parts of the nontrivial zeros. Conversely, if the Hamiltonian is self-adjoint, or more generally, admits only real eigenvalues, then the RH follows. In our attempt to demonstrate the latter, we establish the existence of a similarity transformation of the introduced Hamiltonian that is self-adjoint on the domain specified by an appropriate boundary condition, which is satisfied by the eigenfunctions through the vanishing of the Riemann zeta function. Our result can be extended to a broader class of functions whose zeros lie on the critical line.
- [246] arXiv:2309.15449 (replaced) [pdf, ps, other]
-
Title: Spinal constructions for continuous type-space branching processes with interactionsComments: 44 pages, 4 figuresSubjects: Probability (math.PR)
We consider branching processes describing structured, interacting populations in continuous time. Dynamics of each individuals characteristics and branching properties can be influenced by the entire population. We propose a Girsanov-type result based on a spinal construction, and establish a many-to-one formula. By combining this result with the spinal decomposition, we derive a generalized continuous-time version of the Kesten-Stigum theorem that incorporates interactions. Additionally, we propose an alternative approach of the spine construction for exact simulations of stochastic size-dependent populations.
- [247] arXiv:2309.15615 (replaced) [pdf, ps, other]
-
Title: Mappings of finite distortion on metric surfacesSubjects: Metric Geometry (math.MG); Complex Variables (math.CV)
We investigate basic properties of mappings of finite distortion $f:X \to \mathbb{R}^2$, where $X$ is any metric surface, i.e., metric space homeomorphic to a planar domain with locally finite $2$-dimensional Hausdorff measure. We introduce lower gradients, which complement the upper gradients of Heinonen and Koskela, to study the distortion of non-homeomorphic maps on metric spaces. We extend the Iwaniec-Šverák theorem to metric surfaces: a non-constant $f:X \to \mathbb{R}^2$ with locally square integrable upper gradient and locally integrable distortion is continuous, open and discrete. We also extend the Hencl-Koskela theorem by showing that if $f$ is moreover injective then $f^{-1}$ is a Sobolev map.
- [248] arXiv:2310.03722 (replaced) [pdf, ps, html, other]
-
Title: Anytime-valid t-tests and confidence sequences for Gaussian means with unknown varianceComments: Substantive revision in v3 (Apr 23 2024)Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG); Methodology (stat.ME); Machine Learning (stat.ML)
In 1976, Lai constructed a nontrivial confidence sequence for the mean $\mu$ of a Gaussian distribution with unknown variance $\sigma^2$. Curiously, he employed both an improper (right Haar) mixture over $\sigma$ and an improper (flat) mixture over $\mu$. Here, we elaborate carefully on the details of his construction, which use generalized nonintegrable martingales and an extended Ville's inequality. While this does yield a sequential t-test, it does not yield an "e-process" (due to the nonintegrability of his martingale). In this paper, we develop two new e-processes and confidence sequences for the same setting: one is a test martingale in a reduced filtration, while the other is an e-process in the canonical data filtration. These are respectively obtained by swapping Lai's flat mixture for a Gaussian mixture, and swapping the right Haar mixture over $\sigma$ with the maximum likelihood estimate under the null, as done in universal inference. We also analyze the width of resulting confidence sequences, which have a curious polynomial dependence on the error probability $\alpha$ that we prove to be not only unavoidable, but (for universal inference) even better than the classical fixed-sample t-test. Numerical experiments are provided along the way to compare and contrast the various approaches, including some recent suboptimal ones.
- [249] arXiv:2310.05726 (replaced) [pdf, ps, html, other]
-
Title: New Partial Trace Inequalities and Distillability of Werner StatesSubjects: Mathematical Physics (math-ph); Quantum Physics (quant-ph)
One of the oldest problems in quantum information theory is to study whether any undistillable state has a positive partial transpose (PPT). This problem has been open for almost 30 years, and still no one has been able to give a complete answer to it. This work presents a new strategy to try to solve this problem by translating the distillability condition on the family of Werner states into a problem of partial trace inequalities. We present our two main results and a new bound for the 2-distillability, $\alpha \geq -\frac{1}{4}$. Moreover, we present throughout this work numerous partial trace inequalities, which are valid for many families of matrices.
- [250] arXiv:2310.13507 (replaced) [pdf, ps, other]
-
Title: Matsumoto theorem for skeletaComments: 10 pages, minor change in ntroductionSubjects: Representation Theory (math.RT)
We present a proof of a generalization of the theorem of H.~Matsumoto on Coxeter groups. Our generalized version is applicable to "graphs admitting geometric realization". The original version of the theorem for Coxeter groups is a special case when applied to the Cayley graph and the geometric representation of a Coxeter group. Our version of Matsumoto theorem is also applicable to skeleta, graphs that were defined in the recent paper by the authors on root Lie superalgebras.
- [251] arXiv:2310.16016 (replaced) [pdf, ps, html, other]
-
Title: Uniform asymptotic expansions for the zeros of Bessel functionsSubjects: Classical Analysis and ODEs (math.CA)
Reformulated uniform asymptotic expansions are derived for ordinary differential equations having a large parameter and a simple turning point. These involve Airy functions, but not their derivatives, unlike traditional asymptotic expansions. From these, asymptotic expansions are derived for the zeros of Bessel functions that are valid for large positive values of the order, uniformly valid for all the zeros. The coefficients in the expansions are explicitly given elementary functions, and similar expansions are derived for the zeros of the derivatives of Bessel functions.
- [252] arXiv:2310.20674 (replaced) [pdf, ps, other]
-
Title: Linear and nonlinear instability of vortex columnsComments: 39 pages, 2 figuresSubjects: Analysis of PDEs (math.AP)
We consider vortex column solutions $v = V(r) e_\theta + W(r) e_z$ to the $3$D Euler equations. We give a mathematically rigorous construction of the countable family of unstable modes discovered by Liebovich and Stewartson (J. Fluid Mech. 126, 1983) via formal asymptotic analysis. The unstable modes exhibit $O(1)$ growth rates and concentrate on a ring $r= r_0$ asymptotically as the azimuthal and axial wavenumbers $n, \alpha \to \infty$ with a fixed ratio. We construct these so-called ring modes with an inner-outer gluing procedure. Finally, we prove that each linear instability implies nonlinear instability for vortex columns. In particular, our analysis yields nonlinear instability for the Batchelor trailing line vortex $V(r) := \frac{q}{r} (1-\mathrm{e}^{-r^2})$ and $W(r) := \mathrm{e}^{-r^2}$ when $0 < q \ll 1$.
- [253] arXiv:2311.00435 (replaced) [pdf, ps, html, other]
-
Title: Stabilization of divisors in high-dimensional contact manifoldsComments: 33 pages, 10 figures. v2 Updates: New title and notation updates. Theorem 1.8 added. Improved exposition including description of surgeries on transverse linksSubjects: Symplectic Geometry (math.SG); Geometric Topology (math.GT)
A stabilization operation is defined for codimension $2$ contact submanifolds in $\dim \geq 5$ contact manifolds $(M, \xi)$. The definition is such that (1) a given $(M, \xi)$ is overtwisted iff its standard transverse unknot is stabilized and (2) transverse stabilization preserves the formal contact isotopy class and intrinsic contact structure of a link. We prove that many transverse links are non-simple.
- [254] arXiv:2311.05603 (replaced) [pdf, ps, html, other]
-
Title: On the relaxation of Gauss's capillarity theory under spanning conditionsJournal-ref: J Geom Anal 34, 228 (2024)Subjects: Analysis of PDEs (math.AP)
We study a variational model for soap films in which the films are represented by sets with fixed small volume rather than surfaces. In this problem, a minimizing sequence of completely "wet" films, or sets of finite perimeter spanning a wire frame, may converge to a film containing both wet regions of positive volume and collapsed (dry) surfaces. When collapsing occurs, these limiting objects lie outside the original minimization class and instead are admissible for a relaxed problem. Here we show that the relaxation and the original formulation are equivalent by approximating the collapsed films in the relaxed class by wet films in the original class.
- [255] arXiv:2311.14201 (replaced) [pdf, ps, other]
-
Title: On the convergence of adaptive approximations for stochastic differential equationsComments: 33 pages, 4 figuresSubjects: Numerical Analysis (math.NA); Probability (math.PR)
In this paper, we study numerical approximations for stochastic differential equations (SDEs) that use adaptive step sizes. In particular, we consider a general setting where decisions to reduce step sizes are allowed to depend on the future trajectory of the underlying Brownian motion. Since these adaptive step sizes may not be previsible, the standard mean squared error analysis cannot be directly applied to show that the numerical method converges to the solution of the SDE. Building upon the pioneering work of Gaines and Lyons, we shall instead use rough path theory to establish convergence for a wide class of adaptive numerical methods on general Stratonovich SDEs (with sufficiently smooth vector fields). To our knowledge, this is the first convergence guarantee that applies to standard solvers, such as the Milstein and Heun methods, with non-previsible step sizes. In our analysis, we require the sequence of adaptive step sizes to be nested and the SDE solver to have unbiased "Lévy area" terms in its Taylor expansion. We conjecture that for adaptive SDE solvers more generally, convergence is still possible provided the method does not introduce "Lévy area bias". We present a simple example where the step size control can skip over previously considered times, resulting in the numerical method converging to an incorrect limit (i.e. not the Stratonovich SDE). Finally, we conclude with an experiment demonstrating the accuracy of Heun's method and a newly introduced Splitting Path-based Runge-Kutta scheme (SPaRK) when used with adaptive step sizes.
- [256] arXiv:2311.15457 (replaced) [pdf, ps, other]
-
Title: Fonctions d'une variable $p$-adique et repr\'esentations de ${\rm GL}_2(\mathbf{Q}_p)$Comments: 20 pages, in French language, accepted by JNTSubjects: Number Theory (math.NT)
We extend the dictionary between Fontaine rings and $p$-adic functionnal analysis, and we give a refinement of the $p$-adic local Langlands correspondence for principal series representations of ${\rm GL}_2(\mathbf{Q}_p)$.
- [257] arXiv:2312.00172 (replaced) [pdf, ps, html, other]
-
Title: Projected exponential methods for stiff dynamical low-rank approximation problemsComments: 39 pages, 9 figuresSubjects: Numerical Analysis (math.NA)
The numerical integration of stiff equations is a challenging problem that needs to be approached by specialized numerical methods. Exponential integrators form a popular class of such methods since they are provably robust to stiffness and have been successfully applied to a variety of problems. The dynamical low- \rank approximation is a recent technique for solving high-dimensional differential equations by means of low-rank approximations. However, the domain is lacking numerical methods for stiff equations since existing methods are either not robust-to-stiffness or have unreasonably large hidden constants. In this paper, we focus on solving large-scale stiff matrix differential equations with a Sylvester-like structure, that admit good low-rank approximations. We propose two new methods that have good convergence properties, small memory footprint and that are fast to compute. The theoretical analysis shows that the new methods have order one and two, respectively. We also propose a practical implementation based on Krylov techniques. The approximation error is analyzed, leading to a priori error bounds and, therefore, a mean for choosing the size of the Krylov space. Numerical experiments are performed on several examples, confirming the theory and showing good speedup in comparison to existing techniques.
- [258] arXiv:2312.00301 (replaced) [pdf, ps, other]
-
Title: A Simple Formula for Binomial Coefficients Revealed Through Polynomial EncodingComments: Revision includes acknowledgment of Adi Shamir's 1978 work. Clarified the unique contributions of this paperSubjects: General Mathematics (math.GM)
We revisit an unconventional formula for binomial coefficients by applying an underexplored property of polynomial encoding. The formula, $\binom{n}{k} = \left\lfloor\frac{(1 + 2^{n})^{n}}{2^{n k}}\right\rfloor \bmod{2^{n}}$, is valid for $n > 0$ and $0 \leq k \leq n$. We relate this formula to existing mathematical methods via Kronecker substitution. Additionally, we generalize this formula to compute multinomial coefficients. A baseline computational complexity analysis identifies opportunities for optimization. We conclude by positing an open problem concerning the efficient computation of $\binom{n}{k}$ modulo $n$ using the formula.
- [259] arXiv:2312.05601 (replaced) [pdf, ps, other]
-
Title: A Meshless Solver for Blood Flow Simulations in Elastic Vessels Using Physics-Informed Neural NetworkSubjects: Numerical Analysis (math.NA); Fluid Dynamics (physics.flu-dyn)
Investigating blood flow in the cardiovascular system is crucial for assessing cardiovascular health. Computational approaches offer some non-invasive alternatives to measure blood flow dynamics. Numerical simulations based on traditional methods such as finite-element and other numerical discretizations have been extensively studied and have yielded excellent results. However, adapting these methods to real-life simulations remains a complex task. In this paper, we propose a method that offers flexibility and can efficiently handle real-life simulations. We suggest utilizing the physics-informed neural network (PINN) to solve the Navier-Stokes equation in a deformable domain, specifically addressing the simulation of blood flow in elastic vessels. Our approach models blood flow using an incompressible, viscous Navier-Stokes equation in an Arbitrary Lagrangian-Eulerian form. The mechanical model for the vessel wall structure is formulated by an equation of Newton's second law of momentum and linear elasticity to the force exerted by the fluid flow. Our method is a mesh-free approach that eliminates the need for discretization and meshing of the computational domain. This makes it highly efficient in solving simulations involving complex geometries. Additionally, with the availability of well-developed open-source machine learning framework packages and parallel modules, our method can easily be accelerated through GPU computing and parallel computing. To evaluate our approach, we conducted experiments on regular cylinder vessels as well as vessels with plaque on their walls. We compared our results to a solution calculated by Finite Element Methods using a dense grid and small time steps, which we considered as the ground truth solution. We report the relative error and the time consumed to solve the problem, highlighting the advantages of our method.
- [260] arXiv:2312.09552 (replaced) [pdf, ps, html, other]
-
Title: The number of inscribed and circumscribed graphs of a convex polyhedronSubjects: General Mathematics (math.GM)
In the paper we prove that the number of graphs inscribed into graph of a convex polyhedron and circumscribed around another graph does not exceed 4. For this we first studied Poncelet type problem about the number of convex $n$-gons inscribed into one convex $n$-gon and circumscribed around another convex $n$-gon. It is proved that their number is also at most 4. This contrasts with Poncelet type porisms where usually infinitude of such polygons is proved, provided that one such polygon already exists. An inequality involving ratio of lengths of line segments is used. Alternative way of using Maclaurin-Braikenridge's conic generation method is also discussed. Properties related to constructibility with straightedge and compass are also studied. A new proof, based on mathematical induction, of generalized Maclaurin- Braikenridge's theorem is given. We also gave examples of regular polygons and a polyhedron for which number 4 is realized.
- [261] arXiv:2312.11016 (replaced) [pdf, ps, other]
-
Title: Asymptotic stability of small standing solitary waves of the one-dimensional cubic-quintic Schr\"odinger equationComments: Minor changesSubjects: Analysis of PDEs (math.AP)
For the Schrödinger equation with a cubic-quintic, focusing-focusing nonlinearity in one space dimension, this article proves the local asymptotic completeness of the family of small standing solitary waves under even perturbations in the energy space. For this model, perturbative of the integrable cubic Schrödinger equation for small solutions, the linearized equation around a small solitary wave has an internal mode, whose contribution to the dynamics is handled by the Fermi golden rule.
- [262] arXiv:2312.16657 (replaced) [pdf, ps, other]
-
Title: On a finite sum of cosecants appearing in various problemsSubjects: Number Theory (math.NT); Classical Analysis and ODEs (math.CA); Numerical Analysis (math.NA)
In this paper we investigate the finite sum of cosecants $\sum\csc\big(\varphi+a\pi l/n\big),$ where the index $l$ runs through 1 to $n-1$ and $\varphi$ and $a$ are arbitrary parameters, as well as several closely related sums, such as similar sums of a series of secants, of tangents and of cotangents. These trigonometric sums appear in various problems in mathematics, physics, and a variety of related disciplines. Their particular cases were fragmentarily considered in previous works, and it was noted that even a simple particular case $\sum\csc\big(\pi l/n\big)$ does not have a closed-form, i.e. a compact summation formula. In the paper, we derive several alternative representations for the above-mentioned sums, study their properties, relate them to many other finite and infinite sums, obtain their complete asymptotic expansions for large $n$ and provide accurate upper and lower bounds (e.g. the typical relative error for the upper bound is lesser than $2\times10^{-9}$ for $n\geqslant10$ and lesser than $7\times10^{-14}$ for $n\geqslant50$, which is much better than the bounds we could find in previous works). Our researches reveal that these sums are deeply related to several special numbers and functions, especially to the digamma function (furthermore, as a by-product, we obtain several interesting summations formulae for the digamma function). Asymptotical studies show that these sums may have qualitatively different behaviour depending on the choice of $\varphi$ and $a$; in particular, as $n$ increases some of them may become sporadically large. Finally, we also provide several historical remarks related to various sums considered in the paper. We show that some results in the field either were rediscovered several times or can easily be deduced from various known formulae, including some formulae dating back to the XVIIIth century.
- [263] arXiv:2312.16958 (replaced) [pdf, ps, other]
-
Title: Set-theoretical solutions to the pentagon equation: a surveySubjects: Quantum Algebra (math.QA)
This survey aims to collect the main results of the theory of the set-theoretical solutions to the pentagon equation obtained up to now in the literature. In particular, we present some classes of solutions and raise some questions.
- [264] arXiv:2312.17112 (replaced) [pdf, ps, other]
-
Title: Lipschitz regularity of sub-elliptic harmonic maps into CAT(0) spacesComments: 27 pages, no figures, comments are welcome. In version 2, a missing bibliographical information has been addedSubjects: Differential Geometry (math.DG); Analysis of PDEs (math.AP); Metric Geometry (math.MG)
We prove the local Lipschitz continuity of sub-elliptic harmonic maps between certain singular spaces, more specifically from the $n$-dimensional Heisenberg group into $CAT(0)$ spaces. Our main theorem establishes that these maps have the desired Lipschitz regularity, extending the Hölder regularity in this setting proven by Y. Gui et. al and obtaining same regularity as in the work of H-C. Zhang and X-P. Zhu for certain sub-Riemannian geometries, see also the works of A. Mondino and D. Semola, as well as N. Gigli, for generalisations to RCD spaces. The present result paves the way for a general regularity theory of sub-elliptic harmonic maps, providing a versatile approach applicable beyond the Heisenberg group.
- [265] arXiv:2401.12152 (replaced) [pdf, ps, html, other]
-
Title: Maximum principles for weakly $1$-coercive operators with applications to capillary and prescribed mean curvature graphsComments: 28 pages. Invited contribution to a special issue in honour of Professor Marcos Dajczer on the occasion of his 75th birthday. Some typos corrected in versions 2 and 3. In version 3 we also added Remark 5 and we corrected Remark 7 and a statement following Theorem 7. Comments are welcome!Subjects: Analysis of PDEs (math.AP); Differential Geometry (math.DG)
In this paper we establish maximum principles for weakly 1-coercive operators $L$ on complete, non-compact Riemannian manifolds $M$. In particular, we search for conditions under which one can guarantee that solutions $u$ of differential equations of the form $L(u)\geq f(u)$ satisfy $f(u)\leq 0$ on $M$. The case of weakly $p$-coercive operators with $p>1$, including the $p$-Laplacian and in particular the Laplace-Beltrami operator for $p=2$, has been considered in a recent paper of ours. As a consequence of the main results we infer comparison principles for that kind of operators. Furthermore we apply them to geometric situations dealing with the mean curvature operator, which is a typical weakly 1-coercive operator. We first consider the case of $\mathcal C^1$ operators $L$ acting on functions $u$ of class $\mathcal C^2$ and, in the last section of the paper, we show how our results can be extended to the case of less regular operators $L$ acting on functions $u$ which are just continuous and locally $W^{1,1}$ regular.
- [266] arXiv:2401.16987 (replaced) [pdf, ps, other]
-
Title: The higher order chain rule in Sobolev spacesSubjects: Functional Analysis (math.FA)
We establish the Faà di Bruno formula, in the sense of almost everywhere equality, for derivatives of the composed function $f \circ g$, for all function $f : R \rightarrow R$ such that $f$ acts on $W^m_p(R^n)$ by composition, and all $g \in W^m_p(R^n)$.
- [267] arXiv:2402.04486 (replaced) [pdf, ps, html, other]
-
Title: Outer Code Designs for Augmented and Local-Global Polar Code ArchitecturesComments: 8 pages, 8 figures. Accepted by ISIT 2024Subjects: Information Theory (cs.IT)
In this paper, we introduce two novel methods to design outer polar codes for two previously proposed concatenated polar code architectures: augmented polar codes and local-global polar codes. These methods include a stopping set (SS) construction and a nonstationary density evolution (NDE) construction. Simulation results demonstrate the advantage of these methods over previously proposed constructions based on density evolution (DE) and LLR evolution.
- [268] arXiv:2402.06183 (replaced) [pdf, ps, other]
-
Title: On operadic open-closed maps in characteristic $p$Comments: 65 pages, 6 figures. Fixed typos, updated references, changed notation for finite p-cyclic category (to avoid confusion with a different category due to Kaledin)Subjects: Symplectic Geometry (math.SG); K-Theory and Homology (math.KT)
Consider a closed monotone symplectic manifold $(M,\omega)$. \cite{Gan2} constructed a cyclic open-closed map, which goes from the cyclic homology of the Fukaya category of $M$ to the $S^1$-equivariant quantum cohomology of $M$. In this paper, we show that with mod $p$ coefficients, Ganatra's cyclic open-closed map is compatible with a certain $\mathbb{Z}/p$-equivariant open-closed map under the natural $\mathbb{Z}/p$-Gysin type comparison map for Hochschild homology. Along with the proof, this paper gives a new homotopy theoretic framework for studying open-closed maps in symplectic topology. These will be used in an upcoming work \cite{Che} to study mod $p$ equivariant enumerative invariants such as the Quantum Steenrod operations. The main insights of this paper are: 1) a $\mathbb{Z}/p$-Gysin comparison result for ($\mathcal{A}_{\infty}$-) cyclic objects, 2) a new construction of the open-closed map using operadic Floer theory of \cite{AGV}, which gives rise to a new interpretation of its `$S^1$-equivariant' property, and 3) comparison of the new construction with its classical counterpart.
- [269] arXiv:2402.08438 (replaced) [pdf, ps, other]
-
Title: T-semidefinite programming relaxation with third-order tensors for constrained polynomial optimizationComments: 31 pages, 4 tablesSubjects: Optimization and Control (math.OC)
We study T-semidefinite programming (SDP) relaxation for constrained polynomial optimization problems (POPs). T-SDP relaxation for unconstrained POPs was introduced by Zheng, Huang and Hu in 2022. In this work, we propose a T-SDP relaxation for POPs with polynomial inequality constraints and show that the resulting T-SDP relaxation formulated with third-order tensors can be transformed into the standard SDP relaxation with block-diagonal structures. The convergence of the T-SDP relaxation to the optimal value of a given constrained POP is established under moderate assumptions as the relaxation level increases. Additionally, the feasibility and optimality of the T-SDP relaxation are discussed. Numerical results illustrate that the proposed T-SDP relaxation enhances numerical efficiency.
- [270] arXiv:2402.11184 (replaced) [pdf, ps, other]
-
Title: A modified version of the PRESB preconditioner for a class of non-Hermitian complex systems of linear equationsComments: 12 pages, 1 Figure, SubmittedSubjects: Numerical Analysis (math.NA)
We present a modified version of the PRESB preconditioner for two-by-two block system of linear equations with the coefficient matrix $$\textbf{A}=\left(\begin{array}{cc}
F & -G^*
G & F \end{array}\right),$$ where $F\in\mathbb{C}^{n\times n}$ is Hermitian positive definite and $G\in\mathbb{C}^{n\times n}$ is positive semidefinite. Spectral analysis of the preconditioned matrix is analyzed. In each iteration of a Krylov subspace method, like GMRES, for solving the preconditioned system in conjunction with proposed preconditioner two subsystems with Hermitian positive definite coefficient matrix should be solved which can be accomplished exactly using the Cholesky factorization or inexactly utilizing the conjugate gradient method. Application of the proposed preconditioner to the systems arising from finite element discretization of PDE-constrained optimization problems is presented. Numerical results are given to demonstrate the efficiency of the preconditioner. - [271] arXiv:2402.11460 (replaced) [pdf, ps, other]
-
Title: Drazin and group invertibility in algebras spanned by two idempotentsSubjects: Functional Analysis (math.FA); Operator Algebras (math.OA); Rings and Algebras (math.RA)
For two given idempotents $p\text{ and }q$ from an associative algebra $\mathcal{A},$ in this paper, we offer a comprehensive classification of algebras spanned by the idempotents $p\text{ and }q$. This classification is based on the condition that $p\text{ and }q$ are not tightly coupled and satisfies $(pq)^{m-1}=(pq)^{m}$ but $(pq)^{m-2}p\neq (pq)^{m-1}p$ for some $m(\geq2)\in\mathbb{N}.$ Subsequently, we categorized all the group invertible elements and established an upper bound for Drazin index of any elements in these algebras spanned by $p,q$. Moreover, we formulate a new representation for the Drazin inverse of $(\alpha p+q)$ under two different assumptions, $(pq)^{m-1}=(pq)^m$ and $\lambda(pq)^{m-1}=(pq)^m,$ here $\alpha$ is a non-zero and $\lambda$ is a non-unit real or complex number.
- [272] arXiv:2403.02434 (replaced) [pdf, ps, html, other]
-
Title: On the character tables of the finite reductive groups $E_6(q)_{\text{ad}}$ and ${^2\!E}_6(q)_{\text{ad}}$Comments: 30 pages; added appendix by Jonas Hetz on groups of type D5Subjects: Representation Theory (math.RT)
We show how the character tables of the groups $E_6(q)_{\text{ad}}$ and ${^2\!E}_6(q)_{\text{ad}}$ can be constructed, where $q$ is a power of~$2$. (Partial results are also obtained for any $q$ not divisible by~$3$.) This is based on previous work by Hetz, Lusztig, Malle, Mizuno and Shoji, plus computations using Michel's version of {\sf CHEVIE}. We also need some general results that are specific to semisimple groups which are not of simply connected type. A further crucial ingredient is the determination of the values of the unipotent characters on unipotent elements for groups of type $D_4$ and $D_5$ (in characteristic~$2$).
- [273] arXiv:2403.03104 (replaced) [pdf, ps, other]
-
Title: Low-rank approximated Kalman-Bucy filters using Oja's principal component flow for linear time-invariant systemsComments: 6 pages, fixed typographical errors and clarified some unclear statementsSubjects: Optimization and Control (math.OC); Systems and Control (eess.SY)
The Kalman-Bucy filter is extensively utilized across various applications. However, its computational complexity increases significantly in large-scale systems. To mitigate this challenge, a low-rank approximated Kalman--Bucy filter was proposed, comprising Oja's principal component flow and a low-dimensional Riccati differential equation. Previously, the estimation error was confirmed solely for linear time-invariant systems with a symmetric system matrix. This study extends the application by eliminating the constraint on the symmetricity of the system matrix and describes the equilibrium points of the Oja flow along with their stability for general matrices. In addition, the domain of attraction for a set of stable equilibrium points is estimated. Based on these findings, we demonstrate that the low-rank approximated Kalman--Bucy filter with a suitable rank maintains a bounded estimation error covariance matrix if the system is controllable and observable.
- [274] arXiv:2403.04544 (replaced) [pdf, ps, other]
-
Title: On products of K-moduli spacesComments: Strengthened main results, including main theorem, and added a section on \Q-Gorenstein deformations of products of Fano varieties. Comments very welcome!Subjects: Algebraic Geometry (math.AG)
We study the K-moduli space of products of Fano varieties in relation to the product of K-moduli spaces of the product components. We show that there exists a well-defined morphism from the product of K-moduli stacks of Fano varieties to the K-moduli stack of their product. Furthermore, we show that this morphism is an isomorphism if any two varieties with different irreducible components are non-isomorphic, and a torsor if they are. Our results rely on the theory of stacks and previous work by Zhuang.
We use our main result to obtain an explicit description of the K-moduli stack/ space of Fano threefolds with Picard rank greater than 6, along with a wall-crossing description, and a detailed polyhedral wall-crossing description for K-moduli of log Fano pairs. - [275] arXiv:2403.10383 (replaced) [pdf, ps, html, other]
-
Title: A new canonical reduction of three-vortex motion and its application to vortex-dipole scatteringComments: This is a major restructuring on the paper. Whereas the initial submission focused on the scattering problem, the revision focuses on the method of the reduction, its mathematical structure, and its advantages. It treats the scattering problem as an application of the method and adds a second application: the problem of three identical vortices. The title has been changed to reflect thisSubjects: Dynamical Systems (math.DS)
We introduce a new reduction of the motion of three point vortices in a two-dimensional ideal fluid. This proceeds in two stages: a change of variables to Jacobi coordinates and then a Nambu reduction. The new coordinates demonstrate that the dynamics evolve on a two-dimensional manifold whose topology depends on the sign of a parameter $\kappa_2$ that arises in the reduction. For $\kappa_2>0$, the phase space is spherical, while for $\kappa_2<0$, the dynamics are confined to the upper sheet of a two-sheeted hyperboloid. We contrast this reduction with earlier reduced systems derived by Gröbli, Aref, and others in which the dynamics are determined from the pairwise distances between the vortices. The new coordinate system overcomes two related shortcomings of Gröbli's reduction that have made understanding the dynamics difficult: their lack of a standard phase plane and their singularity at all configurations in which the vortices are collinear. We apply this to two canonical problems. We first discuss the dynamics of three identical vortices and then consider the scattering of a propagating dipole by a stationary vortex. We show that the points dividing direct and exchange scattering solutions correspond to the locations of the invariant manifolds of equilibria of the reduced equations and relate changes in the scattering diagram as the circulation of one vortex is varied to bifurcations of these equilibria.
- [276] arXiv:2403.11937 (replaced) [pdf, ps, other]
-
Title: Regularity and nondegeneracy for nonlocal Bernoulli problems with variable kernelsComments: 16 pages. Some minor errors correctedSubjects: Analysis of PDEs (math.AP)
We consider a generalization of the Bernoulli free boundary problem where the underlying differential operator is a nonlocal, non-translation-invariant elliptic operator of order $2s\in (0,2)$. Because of the lack of translation invariance, the Caffarelli-Silvestre extension is unavailable, and we must work with the nonlocal problem directly instead of transforming to a thin free boundary problem. We prove global Hölder continuity of minimizers for both the one- and two-phase problems. Next, for the one-phase problem, we show Hölder continuity at the free boundary with the optimal exponent $s$. We also prove matching nondegeneracy estimates. A key novelty of our work is that all our findings hold without requiring any regularity assumptions on the kernel of the nonlocal operator. This characteristic makes them crucial in the development of a universal regularity theory for nonlocal free boundary problems.
- [277] arXiv:2403.17957 (replaced) [pdf, ps, html, other]
-
Title: The Density of Borromean PrimesComments: 19 pages. To appear in Commentarii Mathematici Universitatis Sancti Pauli. Added acknowledgementsSubjects: Number Theory (math.NT)
In this paper, we study an asymptotic distribution of sets of primes satisfying certain "linking conditions" in arithmetic topology, namely, conditions given by the Legendre and Rédei symbols among sets of primes. As our Main Theorem, we prove an asymptotic density formula for Borromean primes among all primes. For the proof, we use the effective Chebotarev density formula under the Generalized Riemann Hypothesis and explicit computations of discriminants of the number fields involved in Rédei's extension.
- [278] arXiv:2403.18027 (replaced) [pdf, ps, html, other]
-
Title: Notes on chain conditions in $C_p(X)$Subjects: General Topology (math.GN)
We present new results regarding calibers in the function spaces $C_p(X)$. We give sufficient conditions to characterize the calibers of $C_p(X)$ when $X$ is a topological sum, and we calculate the calibers of $C_p(X)$ when $X = \prod_{\xi < \lambda}X_\xi$ is a product of non-trivial Tychonoff spaces with $i$-weight $\leq \lambda$. Moreover, we calculate the calibers of $C_p(X)$ when $X$ is an interval of ordinals and when $X$ is the one-point $\lambda$-Lindelöf extension of a discrete space of cardinality $\geq \lambda$. This allows to give examples of compact Hausdorff spaces $Z$ such that $iw(Z)=\kappa^{+}$ and $C_p(Z^{\kappa})$ does not have caliber $iw(Z)$; and examples of spaces $\{Z_\alpha : \alpha<cf(\kappa)\}$ such that $\kappa$ is a caliber for $C_p(Z_\alpha)$ whenever $\alpha<cf(\kappa)$ but it is not a caliber for $C_p(\bigoplus_{\alpha<cf(\kappa)} Z_\alpha)$.
- [279] arXiv:2403.19396 (replaced) [pdf, ps, other]
-
Title: Persistence Diagram Estimation of Multivariate Piecewise H\"older-continuous SignalsComments: 33 pagesSubjects: Statistics Theory (math.ST); Algebraic Topology (math.AT)
To our knowledge, the analysis of convergence rates for persistence diagram estimation from noisy signals had predominantly relied on lifting signal estimation results through sup norm (or other functional norm) stability theorems. We believe that moving forward from this approach can lead to considerable gains. We illustrate it in the setting of Gaussian white noise model. We examine from a minimax perspective, the inference of persistence diagram (for sublevel sets filtration). We show that for piecewise Hölder-continuous functions, with control over the reach of the discontinuities set, taking the persistence diagram coming from a simple histogram estimator of the signal, permit to achieve the minimax rates known for Hölder-continuous functions.
- [280] arXiv:2404.09295 (replaced) [pdf, ps, other]
-
Title: Estimates for trilinear and quadrilinear character sumsComments: 30 pages. All comments are welcome!Subjects: Number Theory (math.NT)
We obtain new bounds on some trilinear and quadrilinear character sums, which are non-trivial starting from very short ranges of the variables. An application to an apparently new problem on oscillations of characters on differences between Farey fractions is given. Other applications include a modular analogue of a multiplicative hybrid problem of Iwaniec and Sárközy (1987) and the solvability of some prime type equations with constraints.
- [281] arXiv:2404.10366 (replaced) [pdf, ps, other]
-
Title: The lowest discriminant ideal of central extensions of Abelian groupsComments: 13 pagesSubjects: Representation Theory (math.RT); Rings and Algebras (math.RA)
In a previous joint paper with Wu and Yakimov, we gave an explicit description of the lowest discriminant ideal in a Cayley-Hamilton Hopf algebra (H,C,tr) of degree d over an algebraically closed field k, char k $\notin[1, d]$ with basic identity fiber, i.e. all irreducible representations over the kernel of the counit of the central Hopf subalgebra C are one-dimensional. Using results developed in that paper, we compute relevant quantities associated with irreducible representations to explicitly describe the zero set of the lowest discriminant ideal in the group algebra of a central extension of the product of two arbitrary finitely generated Abelian groups by any finite Abelian group under some conditions. Over a fixed maximal ideal of C the representations are tensor products of representations each corresponding to a central extension of a subgroup isomorphic to the product of two cyclic groups of the same order. A description of the orbit of the identity, i.e. the kernel of the counit of C, under winding automorphisms is also given.
- [282] arXiv:2404.11966 (replaced) [pdf, ps, other]
-
Title: A $\delta$-first Whitehead Lemma for Jordan algebrasComments: 8 pages; v2: added a missing case of simple Jordan algebrasSubjects: Rings and Algebras (math.RA)
We compute $\delta$-derivations of simple Jordan algebras with values in irreducible bimodules. They turn out to be either ordinary derivations ($\delta = 1$), or scalar multiples of the identity map ($\delta = \frac 12$). This can be considered as a generalization of the "First Whitehead Lemma" for Jordan algebras which claims that all such ordinary derivations are inner. The proof amounts to simple calculations in matrix algebras, or, in the case of Jordan algebras of a symmetric bilinear form, to more elaborated calculations in Clifford algebras.
- [283] arXiv:2404.12795 (replaced) [pdf, ps, html, other]
-
Title: Stability for a class of three-tori with small negative scalar curvatureComments: In the second version, the abstract is updated; some typos are correctedSubjects: Differential Geometry (math.DG)
We define a flexible class of Riemmanian metrics on the three-torus. Then, using Stern's inequality relating scalar curvature to harmonic one-forms, we show that any sequence of metrics in this family whose negative part of the scalar curvature tends to zero in $L^2$ norm has a subsequence which converges to some flat metric on the three-torus in the sense of Dong-Song.
- [284] arXiv:2404.13624 (replaced) [pdf, ps, other]
-
Title: Necessary and Sufficient Conditions for Capacity-Achieving Private Information Retrieval with Non-Colluding and Colluding ServersComments: 16 pagesSubjects: Information Theory (cs.IT)
Private Information Retrieval (PIR) is a mechanism for efficiently downloading messages while keeping the index secret. Here, PIRs in which servers do not communicate with each other are called standard PIRs, and PIRs in which some servers communicate with each other are called colluding PIRs. The information-theoretic upper bound on efficiency has been given in previous studies. However, the conditions for PIRs to keep privacy, to decode the desired message, and to achieve that upper bound have not been clarified in matrix form. In this paper, we prove the necessary and sufficient conditions for the properties of standard PIR and colluding PIR. Further, we represent the properties in matrix form.
- [285] arXiv:2404.16258 (replaced) [pdf, ps, other]
-
Title: Central charges in local mirror symmetry via hypergeometric dualityComments: 32 pages. Comparison with related results and references added; typos fixed; exposition improvedSubjects: Algebraic Geometry (math.AG)
We apply the better-behaved GKZ hypergeometric systems to study toric Calabi-Yau Deligne-Mumford stacks and their Hori-Vafa mirrors given by affine hypersurfaces in algebraic tori. We show the equality between A-brane and B-brane central charges, in terms of period integrals and hypergeometric series respectively. This settles a conjecture of Hosono, which could also be considered as a generalization of the Gamma conjecture for local mirror symmetry.
- [286] arXiv:2404.16438 (replaced) [pdf, ps, other]
-
Title: Exponential decay for fractional Schr\"odinger parabolic problemsSubjects: Analysis of PDEs (math.AP)
We discuss exponential decay in $L^p(R^N)$, $1\leq p \leq \infty$, of solutions of a fractional Schrödinger parabolic equation with a locally uniformly integrable potential. The exponential type of the semigroup of solutions is considered and its dependence in
$1\leq p \leq \infty$ is addressed. We characterise a large class of potentials for which solutions decay exponentially. - [287] arXiv:2404.17533 (replaced) [pdf, ps, other]
-
Title: Rigidity of spin fill-ins with non-negative scalar curvatureComments: 22 pages; v2: additional result in section 3 and miscellaneous minor improvementsSubjects: Differential Geometry (math.DG); General Relativity and Quantum Cosmology (gr-qc)
We establish new mean curvature rigidity theorems of spin fill-ins with non-negative scalar curvature using two different spinorial techniques. Our results address two questions by Miao and Gromov, respectively. The first technique is based on extending boundary spinors satisfying a generalized eigenvalue equation via the Fredholm alternative for an APS boundary value problem, while the second is a comparison result in the spirit of Llarull and Lott using index theory. We also show that the latter implies a new Witten-type integral inequality for the mass of an asymptotically Schwarzschild manifold which holds even when the scalar curvature is not assumed to be non-negative.
- [288] arXiv:2404.18841 (replaced) [pdf, ps, other]
-
Title: Deep orthogonal decomposition: a continuously adaptive data-driven approach to model order reductionSubjects: Numerical Analysis (math.NA)
We develop a novel deep learning technique, termed Deep Orthogonal Decomposition (DOD), for dimensionality reduction and reduced order modeling of parameter dependent partial differential equations. The approach consists in the construction of a deep neural network model that approximates the solution manifold through a continuously adaptive local basis. In contrast to global methods, such as Principal Orthogonal Decomposition (POD), the adaptivity allows the DOD to overcome the Kolmogorov barrier, making the approach applicable to a wide spectrum of parametric problems. Furthermore, due to its hybrid linear-nonlinear nature, the DOD can accommodate both intrusive and nonintrusive techniques, providing highly interpretable latent representations and tighter control on error propagation. For this reason, the proposed approach stands out as a valuable alternative to other nonlinear techniques, such as deep autoencoders. The methodology is discussed both theoretically and practically, evaluating its performances on problems featuring nonlinear PDEs, singularities, and parametrized geometries.
- [289] arXiv:2404.19607 (replaced) [pdf, ps, other]
-
Title: Strong minimal model theorem and Massey productsComments: Typos corrected, references added. 14 pagesSubjects: Algebraic Topology (math.AT)
Kadeishvili's minimal model theorem establishes the existence of an $A_\infty$-structure, unique up to isomorphism, on the cohomology of a dg associative algebra, which captures its homotopy type. In this note we prove the existence of minimal models that are unique up to isotopy, a stronger result obviously known to T. Kadeishvili and certainly to others, yet seemingly overlooked by mankind. We will explore how this stronger result can help in the study of Massey products. First, we show that the attempts to extract a local information from the ternary operation $\mu_3$ of our minimal model leads directly to the rediscovery of the triple Massey product. The motto is: "The triple Massey product is an invariant manifestation of $\mu_3$." We then prove that, under reasonable assumptions, the higher Massey product $\langle x_1,\ldots,x_n\rangle$ equals the set of all values $\mu_n(x_1,\ldots,x_n)$, where $\mu_n$ runs over the $n$-ary products of our minimal models. We believe that this note will help to elucidate the still somewhat enigmatic relationship between minimal models and Massey products.
- [290] arXiv:2405.00065 (replaced) [pdf, ps, other]
-
Title: From Linear to Linearizable Optimization: A Novel Framework with Applications to Stationary and Non-stationary DR-submodular OptimizationComments: arXiv admin note: text overlap with arXiv:2402.08621Subjects: Optimization and Control (math.OC); Computational Complexity (cs.CC); Machine Learning (cs.LG); Machine Learning (stat.ML)
This paper introduces the notion of upper linearizable/quadratizable functions, a class that extends concavity and DR-submodularity in various settings, including monotone and non-monotone cases over different convex sets. A general meta-algorithm is devised to convert algorithms for linear/quadratic maximization into ones that optimize upper quadratizable functions, offering a unified approach to tackling concave and DR-submodular optimization problems. The paper extends these results to multiple feedback settings, facilitating conversions between semi-bandit/first-order feedback and bandit/zeroth-order feedback, as well as between first/zeroth-order feedback and semi-bandit/bandit feedback. Leveraging this framework, new algorithms are derived using existing results as base algorithms for convex optimization, improving upon state-of-the-art results in various cases. Dynamic and adaptive regret guarantees are obtained for DR-submodular maximization, marking the first algorithms to achieve such guarantees in these settings. Notably, the paper achieves these advancements with fewer assumptions compared to existing state-of-the-art results, underscoring its broad applicability and theoretical contributions to non-convex optimization.
- [291] arXiv:2405.00610 (replaced) [pdf, ps, other]
-
Title: Growth in products of matrices: fastest, average, and genericComments: 10 pages. Comments are welcomeSubjects: Group Theory (math.GR); Cryptography and Security (cs.CR); Combinatorics (math.CO); Dynamical Systems (math.DS); Probability (math.PR)
The problems that we consider in this paper are as follows. Let A and B be 2x2 matrices (over reals). Let w(A, B) be a word of length n. After evaluating w(A, B) as a product of matrices, we get a 2x2 matrix, call it W. What is the largest (by the absolute value) possible entry of W, over all w(A, B) of length n, as a function of n? What is the expected absolute value of the largest (by the absolute value) entry in a random product of n matrices, where each matrix is A or B with probability 0.5? What is the Lyapunov exponent for a random matrix product like that? We give partial answer to the first of these questions and an essentially complete answer to the second question. For the third question (the most difficult of the three), we offer a very simple method to produce an upper bound on the Lyapunov exponent in the case where all entries of the matrices A and B are nonnegative.
- [292] arXiv:2405.00618 (replaced) [pdf, ps, other]
-
Title: An axiomatisation of the temporal logic of two dimensional Minkowski spacetimeSubjects: Logic (math.LO)
We define temporal axioms that are sound and complete for the temporal validities over $(\reals^2, <)$.
- [293] arXiv:2405.01291 (replaced) [pdf, ps, other]
-
Title: On Hodge structures of compact complex manifolds with semistable degenerationsComments: 23 pagesSubjects: Algebraic Geometry (math.AG)
Compact Kähler manifolds satisfy several nice Hodge-theoretic properties such as the Hodge symmetry, the Hard Lefschetz property and the Hodge-Riemann bilinear relations, etc. In this note, we investigate when such nice properties hold on compact complex manifolds with semistable degenerations. For compact complex manifolds which can be obtained as smoothings of SNC varieties without triple intersection locus, we show the Hodge symmetry when the monodromy logarithm induces isomorphisms on the associated graded. We also show the Hodge-Riemann relations on H^3 of compact complex 3-folds with such semistable degenerations under some conditions.
- [294] arXiv:2405.01804 (replaced) [pdf, ps, html, other]
-
Title: Generalized Ramsey-Tur\'an NumbersComments: Fixed an icorrect row in Table 1 and corresponding computation in proof of Theorem 1.5Subjects: Combinatorics (math.CO)
The Ramsey-Turán problem for $K_p$ asks for the maximum number of edges in an $n$-vertex $K_p$-free graph with independence number $o(n)$. In a natural generalization of the problem, cliques larger than the edge $K_2$ are counted. Let {\bf RT}$(n,\#K_q,K_p,o(n))$ denote the maximum number of copies of $K_q$ in an $n$-vertex $K_p$-free graph with independence number $o(n)$. Balogh, Liu and Sharifzadeh determined the asymptotics of {\bf RT}$(n,\# K_3,K_p,o(n))$. In this paper we will establish the asymptotics for counting copies of $K_4$, $K_5$, and for the case $p \geq 5q$. We also provide a family of counterexamples to a conjecture of Balogh, Liu and Sharifzadeh.
- [295] arXiv:2405.04047 (replaced) [pdf, ps, html, other]
-
Title: Uniform-in-time estimates for mean-field type SDEs and applicationsSubjects: Probability (math.PR)
Via constructing an asymptotic coupling by reflection, in this paper we establish uniform-in-time estimates on probability distances for mean-field type SDEs, where the drift terms under consideration are dissipative merely in the long distance. As applications, we (i) explore the long time probability distance estimate between an SDE and its delay version; (ii) investigate the issue on uniform-in-time propagation of chaos for McKean-Vlasov SDEs, where the drifts might be singular with respect to the spatial variables and need not to be of convolution type; (iii) tackle the discretization error bounds in an infinite-time horizon for stochastic algorithms (e.g. backward/tamed/adaptive Euler-Maruyama schemes as three typical candidates) associated with McKean-Vlasov SDEs.
- [296] arXiv:2405.04051 (replaced) [pdf, ps, html, other]
-
Title: On the quantization goodness of polar latticesComments: 12 pages, 5 figures, submitted to IEEE for possible publicationSubjects: Information Theory (cs.IT)
In this work, we prove that polar lattices, when tailored for lossy compression, are quantization-good in the sense that their normalized second moments approach $\frac{1}{2\pi e}$ as the dimension of lattices increases. It has been predicted by Zamir et al. \cite{ZamirQZ96} that the Entropy Coded Dithered Quantization (ECDQ) system using quantization-good lattices can achieve the rate-distortion bound of i.i.d. Gaussian sources. In our previous work \cite{LingQZ}, we established that polar lattices are indeed capable of attaining the same objective. It is reasonable to conjecture that polar lattices also demonstrate quantization goodness in the context of lossy compression. This study confirms this hypothesis.
- [297] arXiv:2405.04363 (replaced) [pdf, ps, other]
-
Title: Some Notes on the Sample Complexity of Approximate Channel SimulationComments: Accepted as a spotlight paper at the first 'Learn to Compress' Workshop@ ISIT 2024. V2: corrected some typos and simplified Appendix CSubjects: Information Theory (cs.IT)
Channel simulation algorithms can efficiently encode random samples from a prescribed target distribution $Q$ and find applications in machine learning-based lossy data compression. However, algorithms that encode exact samples usually have random runtime, limiting their applicability when a consistent encoding time is desirable. Thus, this paper considers approximate schemes with a fixed runtime instead. First, we strengthen a result of Agustsson and Theis and show that there is a class of pairs of target distribution $Q$ and coding distribution $P$, for which the runtime of any approximate scheme scales at least super-polynomially in $D_\infty[Q \Vert P]$. We then show, by contrast, that if we have access to an unnormalised Radon-Nikodym derivative $r \propto dQ/dP$ and knowledge of $D_{KL}[Q \Vert P]$, we can exploit global-bound, depth-limited A* coding to ensure $\mathrm{TV}[Q \Vert P] \leq \epsilon$ and maintain optimal coding performance with a sample complexity of only $\exp_2\big((D_{KL}[Q \Vert P] + o(1)) \big/ \epsilon\big)$.
- [298] arXiv:2405.04999 (replaced) [pdf, ps, other]
-
Title: Small ball probability for multiple singular values of symmetric random matricesSubjects: Probability (math.PR)
Let $A_n$ be an $n\times n$ random symmetric matrix with $(A_{ij})_{i< j}$ i.i.d. mean $0$, variance 1, following a subGaussian distribution and diagonal elements i.i.d. following a subGaussian distribution with a fixed variance. We investigate the joint small ball probability that $A_n$ has eigenvalues near two fixed locations $\lambda_1$ and $\lambda_2$, where $\lambda_1$ and $\lambda_2$ are sufficiently separated and in the bulk of the semicircle law. More precisely we prove that for a wide class of entry distributions of $A_{ij}$ that involve all Gaussian convolutions (where $\sigma_{min}(\cdot)$ denotes the least singular value of a square matrix), $$\mathbb{P}(\sigma_{min}(A_n-\lambda_1 I_n)\leq\delta_1n^{-1/2},\sigma_{min}(A_n-\lambda_2 I_n)\leq\delta_2n^{-1/2})\leq c\delta_1\delta_2+e^{-cn}.$$ The given estimate approximately factorizes as the product of the estimates for the two individual events, which is an indication of quantitative independence. The estimate readily generalizes to $d$ distinct locations. As an application, we upper bound the probability that there exist $d$ eigenvalues of $A_n$ asymptotically satisfying any fixed linear equation, which in particular gives a lower bound of the distance to this linear relation from any possible eigenvalue pair that holds with probability $1-o(1)$, and rules out the existence of two equal singular values in generic regions of the spectrum.
- [299] arXiv:2405.05834 (replaced) [pdf, ps, other]
-
Title: The Riemann hypothesis and dynamics of Backtracking New Q-Newton's methodComments: 19 pages. Some typos are fixed, references are updated. Exposition is improved, Section 4 is expanded. Comments are welcome!Subjects: Dynamical Systems (math.DS); Complex Variables (math.CV); Number Theory (math.NT); Optimization and Control (math.OC)
A new variant of Newton's method -- named Backtracking New Q-Newton's method (BNQN) -- was recently introduced by the second author. This method has good convergence guarantees, specially concerning finding roots of meromorphic functions. This paper explores using BNQN for the Riemann xi function. We show in particular that the Riemann hypothesis is equivalent to that all attractors of BNQN lie on the critical line. We also explain how an apparent relation between the basins of attraction of BNQN and Voronoi's diagram can be helpful for verifying the Riemann hypothesis or finding a counterexample to it. Some illustrating experimental results are included, which convey some interesting phenomena. The experiments show that BNQN works very stably with highly transcendental functions like the Riemann xi function and its derivatives. Based on insights from the experiments, we discuss some concrete steps on using BNQN towards the Riemann hypothesis. Ideas and results from this paper can be extended to other zeta functions.
- [300] arXiv:2405.05887 (replaced) [pdf, ps, other]
-
Title: Convergence Rates of Online Critic Value Function Approximation in Native SpacesShengyuan Niu, Ali Bouland, Haoran Wang, Filippos Fotiadis, Andrew Kurdila, Andrea L'Afflitto, Sai Tej Paruchuri, Kyriakos G. VamvoudakisSubjects: Optimization and Control (math.OC)
In this paper, the evolution equation that defines the online critic for the approximation of the optimal value function is cast in a general class of reproducing kernel Hilbert spaces (RKHSs). Exploiting some core tools of RKHS theory, this formulation allows deriving explicit bounds on the performance of the critic in terms of the kernel and definition of the RKHS, the number of basis functions, and the location of centers used to define scattered bases. The performance of the critic is precisely measured in terms of the power function of the scattered basis used in approximations, and it can be used either in an a priori evaluation of potential bases or in an a posteriori assessments of value function error for basis enrichment or pruning. The most concise bounds in the paper describe explicitly how the critic performance depends on the placement of centers, as measured by their fill distance in a subset that contains the trajectory of the critic.
- [301] arXiv:2405.05932 (replaced) [pdf, ps, other]
-
Title: Non-symplectic automorphisms of prime order of O'Grady's tenfolds and cubic fourfoldsComments: Minor modifications, comments are very welcome!Subjects: Algebraic Geometry (math.AG)
We give a lattice-theoretic classification of non-symplectic automorphisms of prime order of irreducible holomorphic symplectic manifolds of OG10 type. We determine which automorphisms are induced by a non-symplectic automorphism of prime order of a cubic fourfold on the associated LSV manifolds, giving a geometric and lattice-theoretic description of the algebraic and transcendental lattices of the cubic fourfold. As an application we discuss the rationality conjecture for a general cubic fourfold with a non-symplectic automorphism of prime order.
- [302] arXiv:2405.06114 (replaced) [pdf, ps, other]
-
Title: A Specialisation Theorem for Lang-N\'eron GroupsComments: Main result improved and exposition rewritten. Comments welcomeSubjects: Algebraic Geometry (math.AG); Number Theory (math.NT)
We show that, for a polarised smooth projective variety $B \hookrightarrow \mathbb{P}^n_k$ of dimension $\geq 2$ over an infinite field $k$ and an abelian variety $A$ over the function field of $B$, there exists a dense Zariski open set of smooth geometrically connected hyperplane sections $h$ of $B$ such that $A$ has good reduction at $h$ and the specialisation homomorphism of Lang-Néron groups at $h$ is injective (up to a finite $p$-group in positive characteristic $p$). This gives a positive answer to a conjecture of the first author, which is used to deduce a negative definiteness result on his refined height pairing. This also sheds a new light on Néron's specialisation theorem.
- [303] arXiv:2405.06139 (replaced) [pdf, ps, other]
-
Title: Proof of the Complete Presence of a Modulo 4 Bias for the SemiprimesComments: 10 pages, 1 figureSubjects: Number Theory (math.NT)
Dummit, Granville and Kisilevsky have recently shown that the proportion of semiprimes (products of two primes) not exceeding a given $x$, whose factors are congruent to $3$ modulo $4$, is more than a quarter when $x$ is sufficiently large. They have also conjectured that this holds from the very beginning, that is, for all $x \geq 9$. We give a proof for $x\geq 10^{21}$ via an explicit approach based on their work. Together with their data for the remaining $x$, this results in a full proof of the conjecture. Our method consists of techniques with cancellations of sums over primes with different remainders. We also rely on classical estimates for prime counting functions, as well as on very recent explicit improvements by Bennet, Martin, O'Bryant and Rechnitzer, which have wide applications in essentially any setting involving estimations of sums over primes.
- [304] arXiv:2405.07217 (replaced) [pdf, ps, other]
-
Title: Improved bounds for polylogarithmic graph distances in scale-free percolation and related modelsComments: 21 pagesSubjects: Probability (math.PR); Social and Information Networks (cs.SI); Combinatorics (math.CO)
In this paper, we study graph distances in the geometric random graph models scale-free percolation SFP, geometric inhomogeneous random graphs GIRG, and hyperbolic random graphs HRG. Despite the wide success of the models, the parameter regime in which graph distances are polylogarithmic is poorly understood. We provide new and improved lower bounds. In a certain portion of the parameter regime, those match the known upper bounds.
Compared to the best previous lower bounds by Hao and Heydenreich, our result has several advantages: it gives matching bounds for a larger range of parameters, thus settling the question for a larger portion of the parameter space. It strictly improves the lower bounds by Hao and Heydenreich for all parameters settings in which those bounds were not tight. It gives tail bounds on the probability of having short paths, which imply shape theorems for the $k$-neighbourhood of a vertex whenever our lower bounds are tight, and tight bounds for the size of this $k$-neighbourhood. And last but not least, our proof is much simpler and not much longer than two pages, and we demonstrate that it generalizes well by showing that the same technique also works for first passage percolation. - [305] arXiv:2405.07315 (replaced) [pdf, ps, other]
-
Title: A Sharp condition on global wellposedness of Chern-Simons-Schr\"odinger equationSubjects: Analysis of PDEs (math.AP)
In this work, we derive a sharp condition on the mass of the initial data for the global existence of the Chern-Simons-Schrödinger equation. As a corollary, we prove that if the strength of interaction is less than the Bogomolny bound, then, for a large enough mass of initial data, there exists a globally defined solution. On the other hand, for the interactions which are above the Bogomolny bound, the critical mass condition on the initial data for the global existence depends on the strength of the self-interacting field. Then, we show that the states with the initial critical mass and zero energy are standing wave solutions and globally well-posed. Moreover, they are static if the self-interacting field is large enough and non-static for small self-interacting field.
- [306] arXiv:2405.07645 (replaced) [pdf, ps, other]
-
Title: Ergodicity of skew-products over typical IETsSubjects: Dynamical Systems (math.DS)
We prove ergodicity of a class of infinite measure preserving systems, called skew-products. More precisely, we consider systems of the form \[ {T_f}:{[0, 1) \times \mathbb{R}}\to{[0, 1) \times \mathbb{R}},\quad {T_f(x, t)}:={(T(x), t+f(x))}, \] where $T$ is an interval exchange transformation and $f$ is a piece-wise constant function with a finite number of discontinuities. We show that such system is ergodic with respect to ${Leb}_{[0,1)\times \mathbb{R}}$ for a typical choice of parameters of $T$ and $f$.
- [307] arXiv:2405.07683 (replaced) [pdf, ps, other]
-
Title: Integral means spectrum functionals on Teichmuller spacesComments: 22 pagesSubjects: Complex Variables (math.CV)
In this paper, we introduce and study the integral means spectrum (IMS) functionals on Teichmuller spaces. We show that the IMS functionals on the closure of the universal Teichmuller space and on the asymptotic universal Teichmuller space are both continuous. During the proof, we obtain some new results about the universal asymptotic Teichmuller space.
- [308] arXiv:2405.07910 (replaced) [pdf, ps, other]
-
Title: A Unification of Exchangeability and Continuous Exposure and Confounder Measurement Errors: Probabilistic ExchangeabilitySubjects: Statistics Theory (math.ST); Methodology (stat.ME)
Exchangeability concerning a continuous exposure, X, implies no confounding bias when identifying average exposure effects of X, AEE(X). When X is measured with error (Xep), two challenges arise in identifying AEE(X). Firstly, exchangeability regarding Xep does not equal exchangeability regarding X. Secondly, the necessity of the non-differential error assumption (NDEA), overly stringent in practice, remains uncertain. To address them, this article proposes unifying exchangeability and exposure and confounder measurement errors with three novel concepts. The first, Probabilistic Exchangeability (PE), states that the outcomes of those with Xep=e are probabilistically exchangeable with the outcomes of those truly exposed to X=eT. The relationship between AEE(Xep) and AEE(X) in risk difference and ratio scales is mathematically expressed as a probabilistic certainty, termed exchangeability probability (Pe). Squared Pe (Pe.sq) quantifies the extent to which AEE(Xep) differs from AEE(X) due to exposure measurement error through mechanisms not akin to confounding mechanisms. The coefficient of determination (R.sq) in the regression of X against Xep may sometimes be sufficient to measure Pe.sq. The second concept, Emergent Pseudo Confounding (EPC), describes the bias introduced by exposure measurement error through mechanisms akin to confounding mechanisms. PE can hold when EPC is controlled for, which is weaker than NDEA. The third, Emergent Confounding, describes when bias due to confounder measurement error arises. Adjustment for E(P)C can be performed like confounding adjustment to ensure PE. This paper provides formal justifications for using AEE(Xep) and maximum insight into potential divergence of AEE(Xep) from AEE(X) and how to measure it.
- [309] arXiv:2006.01830 (replaced) [pdf, ps, other]
-
Title: Tangles: a structural approach to artificial intelligence in the empirical sciences (Part I)Comments: The print edition of the book will appear later this year with CUP. An enhanced eBook edition and open-source software, with tutorials, are available already from this http URLSubjects: Artificial Intelligence (cs.AI); Combinatorics (math.CO)
Traditional clustering identifies groups of objects that share certain qualities. Tangles do the converse: they identify groups of qualities that often occur together. They can thereby discover, relate, and structure types: of behaviour, political views, texts, or viruses.
If desired, tangles can also be used as a new method for traditional clustering. They offer a precise, quantitative paradigm suited particularly to fuzzy clusters, since they do not require any assignment of objects to the clusters which these collectively form.
This is the first of four parts of a book with the above title. The book explores applications outside mathematics of the notion and theory of tangles generalised from the graph tangles know from graph minor theory. - [310] arXiv:2104.07694 (replaced) [pdf, ps, html, other]
-
Title: Zigzag path connects two Monte Carlo samplers: Hamiltonian counterpart to a piecewise deterministic Markov processSubjects: Computation (stat.CO); Probability (math.PR)
Zigzag and other piecewise deterministic Markov process samplers have attracted significant interest for their non-reversibility and other appealing properties for Bayesian posterior computation. Hamiltonian Monte Carlo is another state-of-the-art sampler, exploiting fictitious momentum to guide Markov chains through complex target distributions. We establish an important connection between the zigzag sampler and a variant of Hamiltonian Monte Carlo based on Laplace-distributed momentum. The position and velocity component of the corresponding Hamiltonian dynamics travels along a zigzag path paralleling the Markovian zigzag process; however, the dynamics is non-Markovian in this position-velocity space as the momentum component encodes non-immediate pasts. This information is partially lost during a momentum refreshment step, in which we preserve its direction but re-sample magnitude. In the limit of increasingly frequent momentum refreshments, we prove that Hamiltonian zigzag converges strongly to its Markovian counterpart. This theoretical insight suggests that, when retaining full momentum information, Hamiltonian zigzag can better explore target distributions with highly correlated parameters by suppressing the diffusive behavior of Markovian zigzag. We corroborate this intuition by comparing performance of the two zigzag cousins on high-dimensional truncated multivariate Gaussians, including a 11,235-dimensional target arising from a Bayesian phylogenetic multivariate probit modeling of HIV virus data.
- [311] arXiv:2104.13753 (replaced) [pdf, ps, other]
-
Title: Sum-of-norms clustering does not separate nearby ballsComments: 40 pages, 17 figures, published versionJournal-ref: Journal of Machine Learning Research, volume 25 (2024), no. 143, pp. 1--40Subjects: Machine Learning (cs.LG); Statistics Theory (math.ST)
Sum-of-norms clustering is a popular convexification of $K$-means clustering. We show that, if the dataset is made of a large number of independent random variables distributed according to the uniform measure on the union of two disjoint balls of unit radius, and if the balls are sufficiently close to one another, then sum-of-norms clustering will typically fail to recover the decomposition of the dataset into two clusters. As the dimension tends to infinity, this happens even when the distance between the centers of the two balls is taken to be as large as $2\sqrt{2}$. In order to show this, we introduce and analyze a continuous version of sum-of-norms clustering, where the dataset is replaced by a general measure. In particular, we state and prove a local-global characterization of the clustering that seems to be new even in the case of discrete datapoints.
- [312] arXiv:2107.10955 (replaced) [pdf, ps, other]
-
Title: Learning Linear Polytree Structural Equation ModelsComments: 35 pages, 5 figures, 4 tablesSubjects: Machine Learning (stat.ML); Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
We are interested in the problem of learning the directed acyclic graph (DAG) when data are generated from a linear structural equation model (SEM) and the causal structure can be characterized by a polytree. Under the Gaussian polytree models, we study sufficient conditions on the sample sizes for the well-known Chow-Liu algorithm to exactly recover both the skeleton and the equivalence class of the polytree, which is uniquely represented by a CPDAG. On the other hand, necessary conditions on the required sample sizes for both skeleton and CPDAG recovery are also derived in terms of information-theoretic lower bounds, which match the respective sufficient conditions and thereby give a sharp characterization of the difficulty of these tasks. We also consider the problem of inverse correlation matrix estimation under the linear polytree models, and establish the estimation error bound in terms of the dimension and the total number of v-structures. We also consider an extension of group linear polytree models, in which each node represents a group of variables. Our theoretical findings are illustrated by comprehensive numerical simulations, and experiments on benchmark data also demonstrate the robustness of polytree learning when the true graphical structures can only be approximated by polytrees.
- [313] arXiv:2202.01103 (replaced) [pdf, ps, other]
-
Title: A New Temporal Interpretation of Cluster EditingComments: 27 pages, 2 figures. Extended abstract appeared at IWOCA 2022Subjects: Discrete Mathematics (cs.DM); Computational Complexity (cs.CC); Data Structures and Algorithms (cs.DS); Combinatorics (math.CO)
The NP-complete graph problem Cluster Editing seeks to transform a static graph into a disjoint union of cliques by making the fewest possible edits to the edges. We introduce a natural interpretation of this problem in temporal graphs, whose edge sets change over time. This problem is NP-complete even when restricted to temporal graphs whose underlying graph is a path, but we obtain two polynomial-time algorithms for restricted cases. In the static setting, it is well-known that a graph is a disjoint union of cliques if and only if it contains no induced copy of $P_3$; we demonstrate that no general characterisation involving sets of at most four vertices can exist in the temporal setting, but obtain a complete characterisation involving forbidden configurations on at most five vertices. This characterisation gives rise to an FPT algorithm parameterised simultaneously by the permitted number of modifications and the lifetime of the temporal graph.
- [314] arXiv:2208.03123 (replaced) [pdf, ps, html, other]
-
Title: Watson-Crick conjugates of words and languagesSubjects: Formal Languages and Automata Theory (cs.FL); Combinatorics (math.CO)
This paper explores the concept of Watson-Crick conjugates, also known as $\theta$-conjugates, of words and languages. This concept extends the classical idea of conjugates by incorporating the Watson-Crick complementarity of DNA sequences, from the perspective of DNA computing. Our investigation initially focuses on the properties of $\theta$-conjugates of words. We then define $\theta$-conjugates of a language and study closure properties of certain families of languages under the $\theta$-conjugate operation. Furthermore, we analyze the iterated $\theta$-conjugate of both words and languages. Finally, we delve into the idea of $\theta$-conjugate-free languages and examine the decidability problems surrounding $\theta$-conjugate-freeness for different classes of languages
- [315] arXiv:2208.03411 (replaced) [pdf, ps, other]
-
Title: Expanded-clique graphs and the domination problemComments: 17 pages, 5 figuresSubjects: Discrete Mathematics (cs.DM); Combinatorics (math.CO)
Given a graph $G$ such that each vertex $v_i$ has a value $f(v_i)$, the expanded-clique graph $H$ is the graph where each vertex $v_i$ of $G$ becomes a clique $V_i$ of size $f(v_i)$ and for each edge $v_iv_j \in E(G)$, there is a vertex of $V_i$ adjacent to an exclusive vertex of $V_j$. In this work, among the results, we present two characterizations of the expanded-clique graphs, one of them leads to a linear-time recognition algorithm. Regarding the domination number, we show that this problem is \NP-complete for planar bipartite $3$-expanded-clique graphs and for cubic line graphs of bipartite graphs.
- [316] arXiv:2209.07548 (replaced) [pdf, ps, other]
-
Title: Open Set Recognition For Music Genre ClassificationComments: 9 pages, 5 figures, 4 tablesSubjects: Audio and Speech Processing (eess.AS); Optimization and Control (math.OC)
We explore segmentation of known and unknown genre classes using the open source GTZAN and FMA datasets. For each, we begin with best-case closed set genre classification, then we apply open set recognition methods. We offer an algorithm for the music genre classification task using OSR. We demonstrate the ability to retrieve known genres and as well identification of aural patterns for novel genres (not appearing in a training set). We conduct four experiments, each containing a different set of known and unknown classes, using the GTZAN and the FMA datasets to establish a baseline capacity for novel genre detection. We employ grid search on both OpenMax and softmax to determine the optimal total classification accuracy for each experimental setup, and illustrate interaction between genre labelling and open set recognition accuracy.
- [317] arXiv:2211.11810 (replaced) [pdf, ps, html, other]
-
Title: Sample-optimal classical shadows for pure statesComments: 34 pages; v2 - journal versionSubjects: Quantum Physics (quant-ph); Information Theory (cs.IT); Machine Learning (cs.LG)
We consider the classical shadows task for pure states in the setting of both joint and independent measurements. The task is to measure few copies of an unknown pure state $\rho$ in order to learn a classical description which suffices to later estimate expectation values of observables. Specifically, the goal is to approximate $\mathrm{Tr}(O \rho)$ for any Hermitian observable $O$ to within additive error $\epsilon$ provided $\mathrm{Tr}(O^2)\leq B$ and $\lVert O \rVert = 1$. Our main result applies to the joint measurement setting, where we show $\tilde{\Theta}(\sqrt{B}\epsilon^{-1} + \epsilon^{-2})$ samples of $\rho$ are necessary and sufficient to succeed with high probability. The upper bound is a quadratic improvement on the previous best sample complexity known for this problem. For the lower bound, we see that the bottleneck is not how fast we can learn the state but rather how much any classical description of $\rho$ can be compressed for observable estimation. In the independent measurement setting, we show that $\mathcal O(\sqrt{Bd} \epsilon^{-1} + \epsilon^{-2})$ samples suffice. Notably, this implies that the random Clifford measurements algorithm of Huang, Kueng, and Preskill, which is sample-optimal for mixed states, is not optimal for pure states. Interestingly, our result also uses the same random Clifford measurements but employs a different estimator.
- [318] arXiv:2212.02387 (replaced) [pdf, ps, other]
-
Title: An Efficient Stochastic Algorithm for Decentralized Nonconvex-Strongly-Concave Minimax OptimizationSubjects: Machine Learning (cs.LG); Optimization and Control (math.OC)
This paper studies the stochastic nonconvex-strongly-concave minimax optimization over a multi-agent network. We propose an efficient algorithm, called Decentralized Recursive gradient descEnt Ascent Method (DREAM), which achieves the best-known theoretical guarantee for finding the $\epsilon$-stationary points. Concretely, it requires $\mathcal{O}(\min (\kappa^3\epsilon^{-3},\kappa^2 \sqrt{N} \epsilon^{-2} ))$ stochastic first-order oracle (SFO) calls and $\tilde{\mathcal{O}}(\kappa^2 \epsilon^{-2})$ communication rounds, where $\kappa$ is the condition number and $N$ is the total number of individual functions. Our numerical experiments also validate the superiority of DREAM over previous methods.
- [319] arXiv:2304.12690 (replaced) [pdf, ps, other]
-
Title: The Generations of Classical Correlations via Quantum SchemesComments: 18 pages, no figures. To appear in IEEE Transactions on Information Theory. Comments are welcomeSubjects: Quantum Physics (quant-ph); Information Theory (cs.IT)
Suppose two separated parties, Alice and Bob, share a bipartite quantum state or a classical correlation called a \emph{seed}, and they try to generate a target classical correlation by performing local quantum or classical operations on the seed, i.e., any communications are not allowed. We consider the following fundamental problem about this setting: whether Alice and Bob can use a given seed to generate a target classical correlation. We show that this problem has rich mathematical structures. Firstly, we prove that even if the seed is a pure bipartite state, the above decision problem is already NP-hard and a similar conclusion can also be drawn when the seed is also a classical correlation, implying that this problem is hard to solve generally. Furthermore, we prove that when the seed is a pure quantum state, solving the problem is equivalent to finding out whether the target classical correlation has some diagonal form of positive semi-definite factorizations that matches the seed pure state, revealing an interesting connection between the current problem and optimization theory. Based on this observation and other insights, we give several necessary conditions where the seed pure state has to satisfy to generate the target classical correlation, and it turns out that these conditions can also be generalized to the case that the seed is a mixed quantum state. Lastly, since diagonal forms of positive semi-definite factorizations play a crucial role in solving the problem, we develop an algorithm that can compute them for an arbitrary classical correlation, which has decent performance on the cases we test.
- [320] arXiv:2305.10849 (replaced) [pdf, ps, html, other]
-
Title: Extreme ATM skew in a local volatility model with discontinuity: joint density approachComments: To appear in Finance and StochasticsSubjects: Mathematical Finance (q-fin.MF); Probability (math.PR)
This paper concerns a local volatility model in which volatility takes two possible values, and the specific value depends on whether the underlying price is above or below a given threshold value. The model is known, and a number of results have been obtained for it. In particular, a power law behaviour of the implied volatility skew has been established in the case when the threshold is taken at the money. This result as well as some others have been obtained by techniques based on the Laplace transform. The purpose of this paper is to demonstrate how to obtain similar results by another method. The proposed alternative approach is based on the natural relationship of the model with Skew Brownian motion and consists of the systematic use of the joint distribution of this stochastic process and some of its functionals.
- [321] arXiv:2307.06306 (replaced) [pdf, ps, other]
-
Title: Locally Adaptive Federated LearningComments: 29 pages, 9 figuresSubjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Federated learning is a paradigm of distributed machine learning in which multiple clients coordinate with a central server to learn a model, without sharing their own training data. Standard federated optimization methods such as Federated Averaging (FedAvg) ensure balance among the clients by using the same stepsize for local updates on all clients. However, this means that all clients need to respect the global geometry of the function which could yield slow convergence. In this work, we propose locally adaptive federated learning algorithms, that leverage the local geometric information for each client function. We show that such locally adaptive methods with uncoordinated stepsizes across all clients can be particularly efficient in interpolated (overparameterized) settings, and analyze their convergence in the presence of heterogeneous data for convex and strongly convex settings. We validate our theoretical claims by performing illustrative experiments for both i.i.d. non-i.i.d. cases. Our proposed algorithms match the optimization performance of tuned FedAvg in the convex setting, outperform FedAvg as well as state-of-the-art adaptive federated algorithms like FedAMS for non-convex experiments, and come with superior generalization performance.
- [322] arXiv:2307.12597 (replaced) [pdf, ps, other]
-
Title: A unified perspective on exponential tilt and bridge algorithms for rare trajectories of discrete Markov processesSubjects: Statistical Mechanics (cond-mat.stat-mech); Probability (math.PR); Computational Physics (physics.comp-ph)
This article analyzes and compares two general techniques of rare event simulation for generating paths of Markov processes over fixed time horizons: exponential tilting and stochastic bridge. These two methods allow to accurately compute the probability that a Markov process ends within a rare region, which is unlikely to be attained. Exponential tilting is a general technique for obtaining an alternative or tilted sampling probability measure, under which the Markov process becomes likely to hit the rare region at terminal time. The stochastic bridge technique involves conditioning paths towards two endpoints: the terminal point and the initial one. The terminal point is generated from some appropriately chosen probability distribution that covers well the rare region. We show that both methods belong to the class of importance sampling procedures, by providing a common mathematical framework of these two conceptually different methods of sampling rare trajectories. We also conduct a numerical comparison of these two methods, revealing distinct areas of application for each Monte Carlo method, where they exhibit superior efficiency. Detailed simulation algorithms are provided.
- [323] arXiv:2308.01886 (replaced) [pdf, ps, html, other]
-
Title: Magic of quantum hypergraph statesComments: published version: 20+17 pages, 4 figures, comments are welcomeSubjects: Quantum Physics (quant-ph); Statistical Mechanics (cond-mat.stat-mech); Mathematical Physics (math-ph)
Magic, or nonstabilizerness, characterizes the deviation of a quantum state from the set of stabilizer states and plays a fundamental role from quantum state complexity to universal fault-tolerant quantum computing. However, analytical or even numerical characterizations of magic are very challenging, especially in the multi-qubit system, even with a moderate qubit number. Here we systemically and analytically investigate the magic resource of archetypal multipartite quantum states -- quantum hypergraph states, which can be generated by multi-qubit Controlled-phase gates encoded by hypergraphs. We first give the magic formula in terms of the stabilizer R$\mathrm{\acute{e}}$nyi-$\alpha$ entropies for general quantum hypergraph states and prove the magic can not reach the maximal value, if the average degree of the corresponding hypergraph is constant. Then we investigate the statistical behaviors of random hypergraph states and prove the concentration result that typically random hypergraph states can reach the maximal magic. This also suggests an efficient way to generate maximal magic states with random diagonal circuits. Finally, we study some highly symmetric hypergraph states with permutation-symmetry, such as the one whose associated hypergraph is $3$-complete, i.e., any three vertices are connected by a hyperedge. Counterintuitively, such states can only possess constant or even exponentially small magic for $\alpha\geq 2$. Our study advances the understanding of multipartite quantum magic and could lead to applications in quantum computing and quantum many-body physics.
- [324] arXiv:2310.02700 (replaced) [pdf, ps, html, other]
-
Title: Insights of using Control Theory for minimizing Induced Seismicity in Underground ReservoirsSubjects: Systems and Control (eess.SY); Optimization and Control (math.OC)
Deep Geothermal Energy, Carbon Capture, and Storage and Hydrogen Storage have significant potential to meet the large-scale needs of the energy sector and reduce the CO$_2$ emissions. However, the injection of fluids into the earth's crust, upon which these activities rely, can lead to the formation of new seismogenic faults or the reactivation of existing ones, thereby causing earthquakes. In this study, we propose a novel approach based on control theory to address this issue. First, we obtain a simplified model of induced seismicity due to fluid injections in an underground reservoir using a diffusion equation in three dimensions. Then, we design a robust tracking control approach to force the seismicity rate to follow desired references. In this way, the induced seismicity is minimized while ensuring fluid circulation for the needs of renewable energy production and storage. The designed control guarantees the achievement of the control objectives even in the presence of system uncertainties and unknown dynamics. Finally, we present simulations of a simplified geothermal reservoir under different scenarios of energy demand to show the reliability and performance of the control approach, opening new perspectives for field experiments based on real-time regulators.
- [325] arXiv:2310.19100 (replaced) [pdf, ps, other]
-
Title: The allocation of FIFA World Cup slots based on the ranking of confederationsComments: 21 pages, 2 figures, 6 tablesSubjects: General Economics (econ.GN); Optimization and Control (math.OC)
Qualifications for several world championships in sports are organised such that distinct sets of teams play in their own tournament for a predetermined number of slots. Inspired by a recent work studying the problem with the tools from the literature on fair allocation, this paper provides an alternative approach based on historical matches between these sets of teams. We focus on the FIFA World Cup due to the existence of an official rating system and its recent expansion to 48 teams, as well as to allow for a comparison with the already suggested allocations. Our proposal extends the methodology of the FIFA World Ranking to compare the strengths of five confederations. Various allocations are presented depending on the length of the sample, the set of teams considered, as well as the frequency of rating updates. The results show that more European and South American teams should play in the FIFA World Cup. The ranking of continents by the number of deserved slots is different from the ranking implied by FIFA policy. We recommend allocating at least some slots transparently, based on historical performances, similar to the access list of the UEFA Champions League.
- [326] arXiv:2311.07287 (replaced) [pdf, ps, other]
-
Title: Integral of depth zero to three basis of Modular Graph FunctionsComments: 38 pages, v3: reference added, major corrections in final resultsSubjects: High Energy Physics - Theory (hep-th); Number Theory (math.NT)
Modular Graph Functions (MGFs) are SL(2,$\mathbb{Z}$)-invariant functions that emerge in the study of the low-energy expansion of the one-loop closed string amplitude. To find the string scattering amplitude, we must integrate MGFs over the moduli space of the torus. In this paper, we use the iterated integral representation of MGFs to establish a depth-dependent basis for them, where "depth" refers to the number of iterations in the integral. This basis has a suitable Laplace equation. We integrate this basis from depth zero to depth three over the fundamental domain of SL(2,$\mathbb{Z}$) with a cut-off.
- [327] arXiv:2312.01991 (replaced) [pdf, ps, html, other]
-
Title: Information Modified K-Nearest NeighborMohammad Ali Vahedifar, Azim Akhtarshenas, Maryam Sabbaghian, Mohammad Mohammadi Rafatpanah, Ramin ToosiSubjects: Machine Learning (cs.LG); Information Theory (cs.IT)
The fundamental concept underlying K-Nearest Neighbors (KNN) is the classification of samples based on the majority through their nearest neighbors. Although distance and neighbors' labels are critical in KNN, traditional KNN treats all samples equally. However, some KNN variants weigh neighbors differently based on a specific rule, considering each neighbor's distance and label. Many KNN methodologies introduce complex algorithms that do not significantly outperform the traditional KNN, often leading to less satisfactory outcomes. The gap in reliably extracting information for accurately predicting true weights remains an open research challenge. In our proposed method, information-modified KNN (IMKNN), we bridge the gap by presenting a straightforward algorithm that achieves effective results. To this end, we introduce a classification method to improve the performance of the KNN algorithm. By exploiting mutual information (MI) and incorporating ideas from Shapley's values, we improve the traditional KNN performance in accuracy, precision, and recall, offering a more refined and effective solution.
To evaluate the effectiveness of our method, it is compared with eight variants of KNN. We conduct experiments on 12 widely-used datasets, achieving 11.05\%, 12.42\%, and 12.07\% in accuracy, precision, and recall performance, respectively, compared to traditional KNN. Additionally, we compared IMKNN with traditional KNN across four large-scale datasets to highlight the distinct advantages of IMKNN in the impact of monotonicity, noise, density, subclusters, and skewed distributions. Our research indicates that IMKNN consistently surpasses other methods in diverse datasets. - [328] arXiv:2402.08555 (replaced) [pdf, ps, other]
-
Title: In-in correlators and scattering amplitudes on a causal setComments: 34 pagesJournal-ref: Phys. Rev. D 109, 106014 (2024)Subjects: High Energy Physics - Theory (hep-th); General Relativity and Quantum Cosmology (gr-qc); Mathematical Physics (math-ph)
Causal set theory is an approach to quantum gravity in which spacetime is fundamentally discrete at the Planck scale and takes the form of a Lorentzian lattice, or "causal set", from which continuum spacetime emerges in a large-scale (low-energy) approximation. In this work, we present new developments in the framework of interacting quantum field theory on causal sets. We derive a diagrammatic expansion for in-in correlators in local scalar field theories with finite polynomial interactions. We outline how these same correlators can be computed using the double-path integral which acts as a generating functional for the in-in correlators. We modify the in-in generating functional to obtain a generating functional for in-out correlators. We define a notion of scattering amplitudes on causal sets with non-interacting past and future regions and verify that they are given by S-matrix elements (matrix elements of the time-evolution operator). We describe how these formal developments can be implemented to compute early universe observables under the assumption that spacetime is fundamentally discrete.
- [329] arXiv:2402.08621 (replaced) [pdf, ps, html, other]
-
Title: A Generalized Approach to Online Convex OptimizationSubjects: Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
In this paper, we analyze the problem of online convex optimization in different settings. We show that any algorithm for online linear optimization with fully adaptive adversaries is an algorithm for online convex optimization. We also show that any such algorithm that requires full-information feedback may be transformed to an algorithm with semi-bandit feedback with comparable regret bound. We further show that algorithms that are designed for fully adaptive adversaries using deterministic semi-bandit feedback can obtain similar bounds using only stochastic semi-bandit feedback when facing oblivious adversaries. We use this to describe general meta-algorithms to convert first order algorithms to zeroth order algorithms with comparable regret bounds. Our framework allows us to analyze online optimization in various settings, such full-information feedback, bandit feedback, stochastic regret, adversarial regret and various forms of non-stationary regret.
- [330] arXiv:2403.04446 (replaced) [pdf, ps, other]
-
Title: Weak Hopf symmetry and tube algebra of the generalized multifusion string-net modelComments: v1: 64 pagesSubjects: High Energy Physics - Theory (hep-th); Strongly Correlated Electrons (cond-mat.str-el); Mathematical Physics (math-ph); Quantum Algebra (math.QA); Quantum Physics (quant-ph)
We investigate the multifusion generalization of string-net ground states and lattice Hamiltonians, delving into its associated weak Hopf symmetry. For the multifusion string-net, the gauge symmetry manifests as a general weak Hopf algebra, leading to a reducible vacuum string label; the charge symmetry, serving as a quantum double of gauge symmetry, constitutes a connected weak Hopf algebra. This implies that the associated topological phase retains its characterization by a unitary modular tensor category (UMTC). The bulk charge symmetry can also be captured by a weak Hopf tube algebra. We offer an explicit construction of the weak Hopf tube algebra structure and thoroughly discuss its properties. The gapped boundary and domain wall models are extensively discussed, with these $1d$ phases characterized by unitary multifusion categories (UMFCs). We delve into the gauge and charge symmetries of these $1d$ phases, as well as the construction of the boundary and domain wall tube algebras. Additionally, we illustrate that the domain wall tube algebra can be regarded as a cross product of two boundary tube algebras. As an application of our model, we elucidate how to interpret the defective string-net as a restricted multifusion string-net.
- [331] arXiv:2403.08409 (replaced) [pdf, ps, other]
-
Title: On the universal properties of stochastic processes under optimally tuned Poisson restartComments: 5 pagesSubjects: Statistical Mechanics (cond-mat.stat-mech); Probability (math.PR)
Poisson restart assumes that a stochastic process is interrupted and starts again at random time moments. A number of studies have demonstrated that this strategy may minimize the expected completion time in some classes of random search tasks. What is more, it turned out that under optimally tuned restart rate, any stochastic process, regardless of its nature and statistical details, satisfies a number of universal relations for the statistical moments of completion time. In this paper, we describe several new universal properties of optimally restarted processes. Also we obtain a universal inequality for the quadratic statistical moments of completion time in the optimization problem where stochastic process has several possible completion scenarios.
- [332] arXiv:2403.13871 (replaced) [pdf, ps, other]
-
Title: Exact solution for the collective non-Markovian decay of two fully excited quantum emittersSubjects: Quantum Physics (quant-ph); Mathematical Physics (math-ph)
Waveguide quantum electrodynamics constitutes a modern paradigm for the interaction of light and matter, in which strong coupling, bath structure, and propagation delays can break the radiative conditions that quantum emitters typically encounter in free space. These characteristics intertwine the excitations of quantum emitters and guided radiation modes to form complex multiphoton dynamics. So far, combining the collective decay of the emitters with the non-Markovian effects induced by the modes has escaped a full solution and the detailed physics behind these systems remains unknown. Here we analyze such a collective non-Markovian decay in a minimal system of two excited emitters coupled to a one-dimensional single-band waveguide. We develop an exact solution for this system in terms of elementary functions that unveils hidden symmetries and predicts new forms of spontaneous decay. The collective non-Markovian dynamics, which are strongly dependent on the vacuum coupling and the detuning from the center of the band, show exotic features that can be characterized with a simple and readily available criterion. Our analytic methods shed light on the complexity of collective light-matter interactions and open up a pathway for understanding multiparticle open quantum systems.
- [333] arXiv:2404.00793 (replaced) [pdf, ps, other]
-
Title: Learning the mechanisms of network growthComments: Main text: 13 pages, 4 figures. Supplementary: 12 pages; Rewording throughout and elaboration in Section 3Subjects: Social and Information Networks (cs.SI); Probability (math.PR); Machine Learning (stat.ML)
We propose a novel model-selection method for dynamic networks. Our approach involves training a classifier on a large body of synthetic network data. The data is generated by simulating nine state-of-the-art random graph models for dynamic networks, with parameter range chosen to ensure exponential growth of the network size in time. We design a conceptually novel type of dynamic features that count new links received by a group of vertices in a particular time interval. The proposed features are easy to compute, analytically tractable, and interpretable. Our approach achieves a near-perfect classification of synthetic networks, exceeding the state-of-the-art by a large margin. Applying our classification method to real-world citation networks gives credibility to the claims in the literature that models with preferential attachment, fitness and aging fit real-world citation networks best, although sometimes, the predicted model does not involve vertex fitness.
- [334] arXiv:2404.15964 (replaced) [pdf, ps, html, other]
-
Title: Complex Stochastic Optimal Control Foundation of Quantum MechanicsSubjects: Quantum Physics (quant-ph); Mathematical Physics (math-ph)
Recent studies have extended the use of the stochastic Hamilton-Jacobi-Bellman (HJB) equation to include complex variables for deriving quantum mechanical equations. However, these studies often assume that it is valid to apply the HJB equation directly to complex numbers, an approach that overlooks the fundamental problem of comparing complex numbers to find optimal controls. This paper explores how to correctly apply the HJB equation in the context of complex variables. Our findings significantly reevaluate the stochastic movement of quantum particles within the framework of stochastic optimal control theory. We derived the complex diffusion coefficient in the stochastic equation of motion using the Cauchy-Riemann theorem, considering that the particle's stochastic movement is described by two perfectly correlated real and imaginary stochastic processes. We demonstrated that the derived diffusion coefficient took a form that allowed the HJB equation to be linearized, thereby leading to the derivation of the Dirac equations. These insights deepen our understanding of quantum dynamics and enhance the mathematical rigor of the framework for applying stochastic optimal control to quantum mechanics.
- [335] arXiv:2404.18324 (replaced) [pdf, ps, other]
-
Title: Casimir force within Ising chain with competing interactionsComments: 6 pages, 3 figuresSubjects: Statistical Mechanics (cond-mat.stat-mech); Soft Condensed Matter (cond-mat.soft); Mathematical Physics (math-ph)
We derive exact results for the critical Casimir force (CCF) within the one-dimensional Ising model with periodic boundary conditions (PBC's) and long-range equivalent-neighbor ferromagnetic interactions of strength $J_{l}/N>0$ superimposed on the nearest-neighbor interactions of strength $J_{s}$ which could be either ferromagnetic ($J_{s}>0$) or antiferromagnetic ($J_{s}<0$). In the infinite system limit the model, also known as the Nagle-Kardar model, exhibits in the plane $(K_s=\beta J_s,K_l=\beta J_l)$ a critical line $2 K_l=\exp{\left(-2 K_s\right)}, K_s>-\ln3/4$, which ends at a tricritical point $(K_l=-\sqrt{3}/2, K_s=-\ln3/4)$. The critical Casimir amplitudes are: $\Delta_{\rm Cas}^{\rm (cr)}=1/4$ at the critical line, and $\Delta_{\rm Cas}^{\rm (tr)}=1/3$ at the tricritical point. Quite unexpectedly, with the imposed PBC's the CCF exhibits very unusual behavior as a function of temperature and magnetic field. It is repulsive near the critical line and tricritical point, decaying rapidly with separation from those two singular regimes fast away from them and becoming attractive, displaying in which the maximum amplitude of the attraction exceeds the maximum amplitude of repulsion. This represents a violation of the widely-accepted "boundary condition rule", which holds that the CCF is attractive for equivalent BC's and repulsive for conflicting BC's independently of the actual bulk universality class of the phase transition under investigation.
- [336] arXiv:2405.03564 (replaced) [pdf, ps, other]
-
Title: Connection formulae in the Collision Limit I: Case Studies in Lifshitz GeometryComments: 21+9 pagesSubjects: High Energy Physics - Theory (hep-th); High Energy Astrophysical Phenomena (astro-ph.HE); General Relativity and Quantum Cosmology (gr-qc); Mathematical Physics (math-ph)
The connection formulae provide a systematic way to compute physical quantities, such as the quasinormal modes, Green functions, in blackhole perturbation theories. In this work, we test whether it is possible to consistently take the collision limit, which bring two or more regular singularities into an irregular one, of the connection formulae, and we provide some supportive evidence for it.
- [337] arXiv:2405.05236 (replaced) [pdf, ps, html, other]
-
Title: Stability and Performance Analysis of Discrete-Time ReLU Recurrent Neural NetworksSubjects: Systems and Control (eess.SY); Machine Learning (cs.LG); Optimization and Control (math.OC)
This paper presents sufficient conditions for the stability and $\ell_2$-gain performance of recurrent neural networks (RNNs) with ReLU activation functions. These conditions are derived by combining Lyapunov/dissipativity theory with Quadratic Constraints (QCs) satisfied by repeated ReLUs. We write a general class of QCs for repeated RELUs using known properties for the scalar ReLU. Our stability and performance condition uses these QCs along with a "lifted" representation for the ReLU RNN. We show that the positive homogeneity property satisfied by a scalar ReLU does not expand the class of QCs for the repeated ReLU. We present examples to demonstrate the stability / performance condition and study the effect of the lifting horizon.
- [338] arXiv:2405.07890 (replaced) [pdf, ps, other]
-
Title: Subspace-Informed Matrix CompletionComments: arXiv admin note: text overlap with arXiv:2111.00235Subjects: Signal Processing (eess.SP); Information Theory (cs.IT)
In this work, we consider the matrix completion problem, where the objective is to reconstruct a low-rank matrix from a few observed entries. A commonly employed approach involves nuclear norm minimization. For this method to succeed, the number of observed entries needs to scale at least proportional to both the rank of the ground-truth matrix and the coherence parameter. While the only prior information is oftentimes the low-rank nature of the ground-truth matrix, in various real-world scenarios, additional knowledge about the ground-truth low-rank matrix is available. For instance, in collaborative filtering, Netflix problem, and dynamic channel estimation in wireless communications, we have partial or full knowledge about the signal subspace in advance. Specifically, we are aware of some subspaces that form multiple angles with the column and row spaces of the ground-truth matrix. Leveraging this valuable information has the potential to significantly reduce the required number of observations. To this end, we introduce a multi-weight nuclear norm optimization problem that concurrently promotes the low-rank property as well the information about the available subspaces. The proposed weights are tailored to penalize each angle corresponding to each basis of the prior subspace independently. We further propose an optimal weight selection strategy by minimizing the coherence parameter of the ground-truth matrix, which is equivalent to minimizing the required number of observations. Simulation results validate the advantages of incorporating multiple weights in the completion procedure. Specifically, our proposed multi-weight optimization problem demonstrates a substantial reduction in the required number of observations compared to the state-of-the-art methods.
- [339] arXiv:2405.07979 (replaced) [pdf, ps, other]
-
Title: Low-order outcomes and clustered designs: combining design and analysis for causal inference under network interferenceSubjects: Methodology (stat.ME); Statistics Theory (math.ST)
Variance reduction for causal inference in the presence of network interference is often achieved through either outcome modeling, which is typically analyzed under unit-randomized Bernoulli designs, or clustered experimental designs, which are typically analyzed without strong parametric assumptions. In this work, we study the intersection of these two approaches and consider the problem of estimation in low-order outcome models using data from a general experimental design. Our contributions are threefold. First, we present an estimator of the total treatment effect (also called the global average treatment effect) in a low-degree outcome model when the data are collected under general experimental designs, generalizing previous results for Bernoulli designs. We refer to this estimator as the pseudoinverse estimator and give bounds on its bias and variance in terms of properties of the experimental design. Second, we evaluate these bounds for the case of cluster randomized designs with both Bernoulli and complete randomization. For clustered Bernoulli randomization, we find that our estimator is always unbiased and that its variance scales like the smaller of the variance obtained from a low-order assumption and the variance obtained from cluster randomization, showing that combining these variance reduction strategies is preferable to using either individually. For clustered complete randomization, we find a notable bias-variance trade-off mediated by specific features of the clustering. Third, when choosing a clustered experimental design, our bounds can be used to select a clustering from a set of candidate clusterings. Across a range of graphs and clustering algorithms, we show that our method consistently selects clusterings that perform well on a range of response models, suggesting that our bounds are useful to practitioners.