Bitinforming Intelligence Brief

Track Record · last 365 days

34.1%

5-day hit rate · n=1374

-2.25%

20-day alpha vs SPY · n=969

26%

20-day win rate vs benchmark

1522

of 2042 calls scored

Top call: SPCE +10.9% over 5d (2026-05-01)

Worst call: SPCE -45.2% over 5d (2026-06-01)

Executive Brief

→airflow is the dominant theme today — 8 stories surfaced, running 7× its baseline.

→Lead story: VEXAIoT: Autonomous IoT Vulnerability EXploitation using AI Agents — exposure across MSFT, GOOGL, META, NVDA, MDB, PATH, BBAI, AI.

→Also notable: ConceptSMILE: Auditing the Trustworthiness of Concept-Based Explainable AI — touches NVDA, MSFT, GOOGL, META.

→Cross-cutting: airflow is showing up around AMD, GOOGL, META, MSFT.

Large-cap exposure

Market movers

arXiv Preprints10

AX2026-07-10T17:46:32Z

B-spline Policy: Accelerating Manipulation Policies via B-spline Action Representations

In this work, we present B-spline Policy (BSP), an action representation designed for accelerating robot manipulation policies. Rather than predicting discrete-time action chunks, BSP parameterizes actions as continuous B-spline curves defined by a set of knots and control points. This representation yields smooth, time-continuous trajectories that can be temporally scaled and executed by low-level controllers at higher frequencies and speeds. We show that B-spline-parameterized actions can be seamlessly integrated into standard policy learning pipelines by directly predicting B-spline parameters. Experiments on simulated and real-world tasks demonstrate that BSP significantly reduces task completion time, achieving substantial improvements over baseline methods while maintaining strong success rates. More results: https://b-spline-policy.github.io

AX2026-07-10T16:42:17Z

PAC-ACT: Post-training Actor-Critic for Action Chunking Transformers

Precision industrial contact manipulation requires reliable robot policies under pose perturbations and contact-force constraints. Vision-language-action models offer broad generalization but often introduce high inference latency and GPU-memory cost, while vision-action chunking policies are more suitable for real-time industrial control. However, these policies are usually trained by behavior cloning and suffer from distribution shift in contact-rich tasks. This paper proposes PAC-ACT, a reinforcement-learning post-training framework for pretrained Action Chunking Transformer policies. PAC-ACT reformulates policy optimization at the chunk level, constructs an ACT-transferred actor-critic architecture, and introduces a hybrid behavior-prior constraint to preserve the pretrained action distribution during online fine-tuning. Experiments on industrial precision-contact benchmarks show that PAC-ACT improves task success, contact stability, and force safety while retaining low latency and low GPU-memory usage. On the Contour task, PAC-ACT significantly reduces peak contact force and decreases the proportion of force readings above 60 N by 46 times. Sparse-reward ablations further show that the proposed behavior-prior constraint enables effective exploration under randomized initial poses.

AX2026-07-10T16:36:45Z

CoDiMAD: Diffusion-Based Privileged Distillation for Communication-Free Multi-Robot Coordination

Decentralized multi-robot coordination under partial observability remains challenging, especially in communication-free settings where agents must act solely from local sensor observations. Privileged policy distillation provides a promising approach by transferring knowledge from a globally informed oracle to sensor-constrained students. However, in multi-agent systems, the same local observation may correspond to multiple global configurations requiring qualitatively different cooperative actions, making the conditional action distribution inherently multi-modal. Standard deterministic distillation collapses these modes to their mean, often yielding invalid or hesitant actions. To address this issue, we propose CoDiMAD, a three-stage framework that trains a privileged oracle with MAPPO, constructs an offline dataset of local-observation-oracle-action pairs, and distills the oracle into decentralized students parameterized as conditional denoising diffusion probabilistic models. By approximating the conditional oracle-action distribution through the diffusion reverse process, CoDiMAD samples decisive actions from coherent coordination modes rather than averaging across them. Theoretical analysis characterizes the mode-averaging failure of deterministic distillation and the distributional recovery property of diffusion-based distillation. Experiments on three cooperative tasks show that CoDiMAD consistently outperforms direct local MARL and deterministic distillation baselines. The source code will be made publicly available upon acceptance.

AX2026-07-10T16:02:55Z

CORAL-AUV: CFD Oriented Reinforcement Learning for Autonomous Underwater Vehicles

Fine grain control and positioning of autonomous underwater vehicles (AUVs) is critical for sampling, maintenance, and survey applications. Traditional control methods for AUVs are labor intensive and are not robust to changes in the vehicle configuration or environmental conditions. Reinforcement learning (RL) promises rapid controller development while handling a range of deployment parameters via domain randomization (DR). However, DR is still limited by the capacity of the underlying simulation to model real physics. In particular, drag physics are difficult to model and are a large contributor to sim-to-real gaps. Meanwhile, computational fluid dynamics (CFD) provides high fidelity drag models but is challenging to leverage within reinforcement learning frameworks due to its computational overhead. Thus, in this paper we exploit the idea of training surrogate approximations of CFD models of a given vehicle, enabling fast inference within RL pipelines. We are the first to successfully deploy a zero-shot RL policy on a 6-DOF AUV in which policy training is performed on surrogate drag models (SDMs) trained on CFD data. We find 31% lower energy usage compared to a controller using simplified physics while traversing between waypoints 11% faster with 19% less error. Our SDM based RL controller better predicts zero-shot transfer and is more robust across reward shaping design choices. When using DR to complete a task with perturbed parameters, we find that the CFD policy is the only controller that successfully transfers. The policies are evaluated in a controlled tank environment and in the field providing extensive testing of the policies' capabilities.

AX2026-07-10T15:58:37Z

Task-Adaptive Design of Modular Aerial Manipulators Under Airflow Exposure Constraints

Aerial manipulation with multirotor platforms enables physical interaction in complex environments, but rotor-induced airflow remains a critical limitation for tasks involving airflow-sensitive targets or surroundings. This paper presents an optimization-based design framework for modular aerial manipulators that jointly considers task wrench feasibility, end-effector placement, and airflow exposure constraints. We first introduce a novel categorization of target-side airflow tolerance and formulate the corresponding exposure requirements as geometric constraints. To efficiently model rotor-induced airflow, we introduce a compact cone-sphere envelope that approximates the spreading structure of a quadrotor's airflow while preserving computational tractability for optimization. Building on this formulation, we propose a reconfiguration optimization that adapts a modular aerial manipulator to diverse task wrench requirements while enforcing both target-side airflow exposure and intra-platform airflow interference constraints. Unlike prior designs that assume a fixed end-effector location, the proposed framework optimizes the end-effector placement together with the platform configuration. Scalability experiments and ablation studies validate the effectiveness of the proposed framework.

AX2026-07-10T17:59:45Z

PHINN-EEG: Topological Time-Series Analysis of Dream-State EEG -- Dynamic Betti Curves for Dream Content Classification and Topology-Conditioned Neural Signal Synthesis

Current electroencephalography (EEG)-based dream detection relies on power spectral density (PSD) and statistical moment features, achieving a state-of-the-art area under the receiver operating characteristic curve (AUC) of approximately 0.70 on the DREAM database (Wong et al., 2025, Nature Communications). We introduce PHINN-EEG (Persistent Homology Inspired Neural Network for EEG), the first topological time-series framework for dream mentation analysis. Using sliding-window Takens delay embeddings and Vietoris-Rips filtrations on multichannel pre-awakening EEG epochs, we extract Dynamic Betti Curves that characterize the geometric architecture of neural activity, not merely its energy. These topological invariants, combined with topology-conditioned flow matching, are analytically projected to outperform existing PSD and catch22 benchmarks, targeting AUC = 0.82-0.90 on the 1,462-awakening open-access subset of the DREAM database (drawn from a full registry of 3,191 total awakenings from 263 participants across 20 independent laboratories). We further introduce a topology-conditioned rectified flow model for dream-state EEG synthesis-with a spectral-conditioned flow model of comparable feature dimensionality as an additional ablation baseline to isolate the value of topological conditioning specifically-and propose a set of candidate Betti transition archetypes linking topology to phenomenological dream report categories, presented as an exploratory hypothesis space pending empirical validation. If validated, this work represents a paradigm shift from spectral energy to phase-space geometry in neural rare-event detection, with potential future implications for wearable BCI dream monitoring.

AX2026-07-10T17:57:03Z

Scalable Visual Pretraining for Language Intelligence

The rapid progress of large foundation models has been driven predominantly by pretraining on large-scale text corpora. However, many forms of knowledge are conveyed through visual representations, where figures, typeset equations, and page layouts carry rich information that cannot be faithfully or completely captured by text alone. Yet current pretraining approaches discard these visual cues by converting visually rich sources, such as documents and web pages, into plain text for learning language intelligence. This paper challenges the default assumption that language models must be trained on text-only representations and shows that Visual Pretraining is a scalable learner for foundation model intelligence. To this end, we conduct a systematic study of unsupervised visual pretraining paradigms that directly leverage visual documents without text extraction. Across multiple backbones and benchmarks, visual pretraining on the same underlying corpora consistently outperforms text-only pretraining, offering an efficient pathway to scalable language intelligence.

AX2026-07-10T17:53:37Z

Evolution of Accuracy and Visual-Cognitive Errors in a Decade of Vision-Language AI Models

Vision language models (VLMs) have made remarkable progress in visual reasoning during the last decade. Most evaluations have used simple scenes (MS-COCO) that do not showcase complex human interactions or behaviors, only a handful of non-curated human descriptions as a benchmark, and have not focused on understanding the model's error types. Here, we introduce the Complex Social Behavior (CSB) dataset, containing 100 images depicting complex social interactions/behaviors. We analyze the progression of scene descriptions over a decade (2017-2025) of VLMs (four pre-Multimodal Large Language Models, MLLMs, and five MLLMs). We evaluate the accuracy of the models and 20 human descriptions relative to a gold standard on the CSB dataset and on a sample from MS-COCO. We analyzed five visual-cognitive error types: object detection, recognition, hallucination, scene understanding, and spatial dependence. The CSB dataset showed a more pronounced improvement than MS-COCO in scene description accuracy, with pre-MLLMs achieving much lower accuracy than the bottom-ranked human descriptions and MLLMs attaining accuracies similar to the top-ranked human descriptions. We show that MLLMs have eliminated the gap in scene description accuracy between simpler MS-COCO scenes and scenes depicting complex behaviors (CSB). MLLMs have almost eliminated all error types in our tested datasets, except for occasionally relying on different image regions for scene descriptions than humans do (spatial dependence error). We also show that detection, recognition, and hallucination errors have the highest impact on scene description accuracy. Together, our findings provide a more thorough evaluation of how visual language models have advanced over the last decade.

AX2026-07-10T17:52:29Z

VEXAIoT: Autonomous IoT Vulnerability EXploitation using AI Agents

Internet of Things (IoT) systems are inherently vulnerable due to constrained hardware, outdated firmware, and insecure default configurations, creating a need for scalable and adaptive security testing approaches. While recent adoptions of Large Language Model (LLM) agents have demonstrated promise in penetration testing and Capture-the-Flag (CTF) environments, their application to IoT specific vulnerabilities remains unexplored. This paper presents an autonomous multi-agent framework, referred to as Vulnerability EXploitation using AI Agents (VEXAIoT), for vulnerability discovery and exploitation in IoT environments using LLM-based reasoning and offensive security tools. The framework combines a vulnerability detection agent and an attack execution agent to perform reconnaissance, plan attack sequences, and execute exploits against vulnerable IoT services. The system is evaluated in IoTGoat and Metasploitable environments across ten attack scenarios mapped to OWASP IoT vulnerabilities. Experimental results show attack success rate of up to 100% with low token overhead and average execution times under two minutes for most attacks. Across 260 attack executions, VEXAIoT achieves a 95.0% overall success rate, including 94.5% success in IoTGoat and 96.7% success in Metasploitable2. These results demonstrate the potential for LLM-driven agents to automate IoT vulnerability assessment and offensive security workflows in controlled environments

AX2026-07-10T17:47:38Z

ConceptSMILE: Auditing the Trustworthiness of Concept-Based Explainable AI

Concept-based explainable artificial intelligence (AI) can make model reasoning more human-understandable, but concept-level outputs are not automatically trustworthy. We introduce ConceptSMILE, a model-agnostic perturbation-based auditing framework for evaluating the reliability of concept-based explanations. Rather than replacing SMILE, ConceptSMILE extends its perturbation-based logic from feature- or region-level attribution to the auditing of human-understandable concept explanations. The framework perturbs input regions, measures concept-response shifts, applies locality weighting, and fits an XGBoost surrogate to approximate local concept behaviour. Reliability is assessed through attribution accuracy, surrogate fidelity, faithfulness, stability, and consistency. We evaluate ConceptSMILE on retinal fundus images by comparing MedSAM-derived visual concepts with VLM-based semantic concepts. Results show that reliability varies across concepts and pathways: MedSAM achieves stronger spatial attribution and the highest surrogate fidelity ($R^2 = 0.8503$, $R_w^2 = 0.8465$), while the VLM pathway shows stronger vessel faithfulness and stronger stability under selected artefact conditions. ConceptSMILE provides an independent audit layer for evaluating the trustworthiness of concept-based XAI.

Patent Filings5

PT2020-06-02

Method and system for robotic adaptive production

Some non-limiting examples follow. In modern <b>robotic automation</b>, the part that is dealt with could be very complex and have several features, e.g., holes, on it. The relationship between the features can be used to efficiently learn the robot and/or part positions or other process parameters.

PT2014-12-10

Method for teaching a robot movement

Method for teaching a robot movement (84 - 88 - 90 - 92) using a system comprising . a robot (36, 94), . a robot controller (34, 96) with at least an automatic mode and a teach mode, . a programmable logic controller (PLC) (32) which is connected (38) to the robot controller (34, 96), whereas the …

PT2019-05-07

Method and system for programming a robot

The invention relates to a method for programming a robot, in particular a robot comprising a robotic arm, in which method a movement to be performed by the robot is set up preferably in a robot programme by means of a predefined motion template, the motion template is selected from a database …

PT2025-10-03

Input shaping control of a robotic arm in different reference spaces

A robotic controller for controlling a robotic arm is disclosed, the robotic controller comprising: a first spatial shaping module configured to provide a shaped first-space target motion by convolving a first-space target motion with a pulse train, wherein the first-space target motion defines a …

PT2026-04-28

Numerical control systems and control methods for industrial machinery

The numerical control system (1) includes a numerical control device (5) and a robot control device (6). The numerical control device (5) includes: a robot instruction generation unit (55) that generates robot instructions for each robot instruction block; a robot program start instruction unit ( …

Accelerating Keywords15

#01

airflow

7.75

8↑ / base 1.1

#02

b-spline

7.66

5↑ / base 0.7

#03

iot

7.59

7↑ / base 1.0

#04

conceptsmile

7.50

5↑ / base 0.7

#05

dream

7.46

7↑ / base 1.0

#06

pac-act

7.45

5↑ / base 0.7

#07

visual pretraining

7.44

4↑ / base 0.6

#08

attack

7.42

5↑ / base 0.7

#09

vulnerability

7.42

5↑ / base 0.7

#10

descriptions

7.41

6↑ / base 0.9

#11

concept-based

7.40

4↑ / base 0.6

#12

bsp

7.40

3↑ / base 0.4

#13

surrogate

7.39

5↑ / base 0.7

#14

aerial

7.38

4↑ / base 0.6

#15

exposure

7.38

4↑ / base 0.6

Market Movers5

#01 ARXIV

arXiv

VEXAIoT: Autonomous IoT Vulnerability EXploitation using AI Agents

IMPACT

41.05

MSFT$385.10 (+0.19%) GOOGL$357.18 (-0.48%) META$669.21 (+5.97%) NVDA$210.96 (+4.03%) MDB$342.08 (-5.73%) PATH$11.68 (-1.02%) BBAI$3.27 (-1.51%) AI$8.95 (-0.67%) novelty spike·multi-ticker·large-cap exposure·patent application·safety/alignment

Why it matters — Rising momentum suggests near-term attention and follow-on activity.

Why it matters — Impacts multiple players/supply chain; effects may propagate.

#02 ARXIV

arXiv

ConceptSMILE: Auditing the Trustworthiness of Concept-Based Explainable AI

IMPACT

39.38

NVDA$210.96 (+4.03%) MSFT$385.10 (+0.19%) GOOGL$357.18 (-0.48%) META$669.21 (+5.97%) novelty spike·multi-ticker·large-cap exposure·safety/alignment·hardware

Why it matters — Rising momentum suggests near-term attention and follow-on activity.

Why it matters — Impacts multiple players/supply chain; effects may propagate.

#03 ARXIV

arXiv

Evolution of Accuracy and Visual-Cognitive Errors in a Decade of Vision-Language AI Models

IMPACT

37.72

MSFT$385.10 (+0.19%) GOOGL$357.18 (-0.48%) META$669.21 (+5.97%) NVDA$210.96 (+4.03%) multi-ticker·large-cap exposure·benchmark lead·safety/alignment·data/training

Why it matters — Impacts multiple players/supply chain; effects may propagate.

Why it matters — Large-cap ties can amplify market impact and adoption.

#04 ARXIV

arXiv

Scalable Visual Pretraining for Language Intelligence

IMPACT

28.85

MDB$342.08 (-5.73%) MSFT$385.10 (+0.19%) GOOGL$357.18 (-0.48%) ARM$323.39 (-1.37%) NVDA$210.96 (+4.03%) PLTR$126.79 (-1.74%) AI$8.95 (-0.67%) SMCI$28.31 (+0.25%) multi-ticker·large-cap exposure·benchmark lead·efficiency/cost·real-time/edge

Why it matters — Impacts multiple players/supply chain; effects may propagate.

Why it matters — Large-cap ties can amplify market impact and adoption.

#05 PATENTS

Patents

Numerical control systems and control methods for industrial machinery

IMPACT

27.35

MSFT$385.10 (+0.19%) GOOGL$357.18 (-0.48%) NVDA$210.96 (+4.03%) multi-ticker·large-cap exposure

Why it matters — Impacts multiple players/supply chain; effects may propagate.

Why it matters — Large-cap ties can amplify market impact and adoption.

Equity Signals7

AMD

15.20

airflow, visual pretraining

GOOGL

15.20

airflow, visual pretraining

Term	Last	7dΔ	Corr
airflow	2.0	-0.07	-0.171
b-spline	0.0	—	—
iot	2.0	+0.07	-0.154
conceptsmile	0.0	—	—
dream	56.0	+0.08	-0.096
pac-act	0.0	—	—
visual pretraining	0.0	—	—
attack	100.0	+0.11	-0.004