Berkeley, CA · Founded 2026

Neural
Intelligence
On-Chain

Merkavian builds cryptographically verified AI infrastructure. We make neural network training trustless through zero-knowledge proofs and decentralized federated learning.

ZK+FL Proof-of-Learning Protocol
5.4M Candles Analyzed
Base. Sepolia Deployed
50+ Parallel Experiments
7× Signal Layers

What we build

Two systems at the frontier of AI and decentralized infrastructure.

Research Protocol · UC Berkeley 2026

PoLChain

A proof-of-learning blockchain protocol that uses zero-knowledge proofs to verify gradient updates from distributed miners — making AI model training cryptographically trustless without a central aggregator.

Halo2 / EZKLFederated Learning Base Sepolia$POL Token
Explore protocol →
Live System · Running 24/7

Autonomous Trading

A self-evolving algorithmic trading system with a 7-layer signal stack, reinforcement learning loop, and evolutionary engine running 50 parallel experiments simultaneously across 15 crypto pairs.

Reinforcement LearningEvolution Engine Kraken APIOracle Cloud
View system →

Core infrastructure

Zero-Knowledge Proofs

Halo2 circuits generated via EZKL verify ML training computations on-chain without revealing model weights or training data. Pixels and logits public; parameters private.

Federated Learning

Four miners train on independent MNIST shards with distinct augmentation strategies. Quality-gated FedAvg merges gradients — 50/50 if challenger beats global, 80/20 otherwise.

On-Chain Verification

TaskManager, POLToken, and EZKL-generated Halo2Verifier deployed on Base Sepolia. Every gradient submission, block finalization, and reward is a verified on-chain transaction.

Evolutionary Strategies

50 parallel sandbox experiments with 8 mutation types. Evaluator runs 3-gate validation every 6h. Graduates enter 24h A/B test against champion before promotion.

Multi-Signal Stack

7 independent signal layers: base rules, MTF alignment, VWAP 2σ bands, funding rate, candlestick patterns, volume profile, geometric aggregation with learned weights.

Safe Learning Pipeline

4-gate validation before any signal weight push: 50+ live trades required, ≤30% change magnitude, <14 days old, RSI threshold match. Human review step before every Oracle push.

Building at the frontier

UC Berkeley research at the intersection of zero-knowledge cryptography and machine learning.

jackson@merkavian.com

PoLChain

A proof-of-learning protocol demonstrating that AI model training can be made cryptographically verifiable on a live blockchain — without a trusted aggregator.

The Problem

Leading AI companies face a fundamental economic crisis. In 2024, OpenAI spent $3B on model training against $3.7B in revenue. Anthropic spent $1.5B against $2.55B. Profitability under centralized training economics is nearly impossible.

Blockchain enables a different model: AI developers issue tokens tied to model development, letting funding come from the market rather than burning capital. Miners earn tokens by contributing verified gradient updates. The critical unsolved problem — without a trusted aggregator, how do you verify miners actually trained on real data?

The Solution

Zero-knowledge proofs make gradient fraud cryptographically impossible. Each submission is paired with a Halo2 ZK proof confirming training occurred correctly — without revealing training data or model weights. Pixels and logits are public. Parameters stay private.

polchain — block #1,204
TaskManager.postTask() · 100 POL escrowed · deadline T+60s
4 miners: Alpha(shard0) Beta(shard1) Gamma(shard2) Delta(shard3)

autoMiner POST /prove/async · Flask EZKL pipeline starts
gen_witness → prove → hex_proof + instances[]

Delta.submitWithProof(gradHash, 94.7, proofBytes, instances)
Halo2Verifier.verifyProof() → true · ZKWorkSubmitted event

finalizeTask() · Winner: Delta · 100 $POL transferred
spawn aggregate.py --task_id 1204 --winner_score 94.7
FedAvg 50/50 (shard_acc > global_acc) → global_model.pth updated

System Architecture

Three layers interact in every 60-second block cycle:

◈ ML / ZK Layer
Flask Prove Server
:5001 · /train · /prove/async · /prove/status · gen_witness → ezkl.prove → proof.json
MNISTNet (model.py)
784→128→64→10 MLP · 4 shards × 10k · focus-digit oversampling · augmentation per shard
aggregate.py
FedAvg post-finalize · 50/50 or 80/20 merge · writes global_model.pth + accuracy_log.json
↑↓ POST /train · /prove/async · spawn aggregate.py · model.pth reads
◈ Off-Chain Node Services
Admin Server
Express :3001 · spawns miningLoop + autoMiner + Flask · SSE log fan-out at /logs
miningLoop.js
60s cadence · postTask → wait → finalizeTask → spawn aggregate.py · reads mode.json
autoMiner.js
Alpha/Beta/Gamma/Delta · one-block-delayed ZK · submitWithProof while next proof generates
↑↓ submitWithProof · finalizeTask · transferFrom · verifyProof · POL reward
◈ Base Sepolia L2 (chainId 84532)
POLToken.sol
ERC-20 · 1,000,000 supply · escrow via transferFrom · reward via transfer
TaskManager.sol
Core protocol · postTask · submitWork · submitWithProof · finalizeTask · calls Verifier
Halo2Verifier.sol
EZKL-generated · verifyProof(bytes, uint256[]) → bool · pairing + polynomial checks in EVM

ZK Proof Pipeline

Converting floating-point ML training into on-chain verifiable cryptography requires a precise pipeline. EZKL quantizes the neural network's operations into integer arithmetic over a finite field — enabling ZK verification while preserving computational integrity.

  • MNISTNet exported to ONNX (opset 11) with dummy input shape [1, 784]
  • ezkl.gen_settings() — public/public/private visibility model; pixels and logits on-chain, weights secret
  • ezkl.calibrate_settings() — adjusts quantization scales for resources target
  • ezkl.compile_circuit() → network.compiled (arithmetic circuit over finite field)
  • ezkl.get_srs() → kzg.srs (KZG structured reference string, one-time download)
  • ezkl.setup() → pk.key + vk.key (Halo2 proving and verifying keys)
  • ezkl.create_evm_verifier() → Verifier.sol → deployed to Base Sepolia
  • Per block: gen_witness → prove → hex_proof + instances → submitWithProof → Halo2Verifier on-chain

The Gradient-Gap Problem

The central research contribution and limitation: EZKL's quantization converts floating-point operations to integer arithmetic over a finite field. This creates a small discrepancy between what the ZK circuit proves and what the neural network actually computed. For MNISTNet this error is inconsequential — the model still converges correctly. For larger production models, this quantization error compounds across layers and could meaningfully degrade proof validity. Solving the gradient-gap problem is the prerequisite for applying PoLChain to frontier models.

Applied Proof-of-Learning: ZK-Verified Federated Learning on a Live Blockchain
Jackson Geiger · UC Berkeley · April 2026
Training AI models at scale requires substantial capital. We propose that blockchain-based federated learning with gradient verification enables a new economic model — one in which AI developers issue tokens that enable decentralized markets to fund model training. PoLChain demonstrates a Proof-of-Learning protocol on a live blockchain, where miners compete to submit gradients verified by Halo2 ZK proofs generated via EZKL.
PREPRINT

April 2026

Autonomous
Trading

A self-evolving algorithmic trading system with a 7-layer signal stack, closed reinforcement learning loop, and evolutionary engine running 50 parallel strategy experiments simultaneously.

Signal Architecture

Every trade consideration passes through seven independent signal layers before execution. Each layer produces a modifier that is geometrically aggregated using learned weights from live trade forensics — reloaded every 100 cycles.

L1
Base Rule Fire
RSI < 45 · MACD cross · Bollinger squeeze · EMA crossover · momentum_buy
base_conf [0.45–0.75]
L2
Multi-Timeframe Gate
5m → resample 15m/1h · alignment check · HARD REJECT if BUY into 1h downtrend
modifier 1.2× or 0.7×
L3
VWAP Position + Pivot Levels
2σ standard deviation bands · S1/S2/R1/R2 pivot proximity · dynamic TP targets resistance
modifier 0.85–1.15
L4
Funding Rate
Binance perp BTC/ETH/SOL · every 2h · EXTREME_LONG avoids crowded longs
modifier 0.6–1.4
L5
Candlestick Pattern Modifier
7 patterns: hammer, engulfing, doji, shooting star, morning star, dark cloud, harami
geometric mean ∈ [0.88–1.15]
L6
Volume Profile + ADX Bucket
High/low volume hour-of-day · ADX 15-20 best bucket for mean reversion (+1.15×)
modifier 0.85–1.15
L7
Signal Aggregation
final = base × geomean(modifier_i ^ weight_i) · weights from signal_weights.json (100-cycle reload)
final_conf [0.10–0.95]
↓ if conf > 0.58
EXECUTE
CooldownManager (30m) → check_correlation_limits (4 max) → open_position() → calculate_dynamic_take_profit()

Evolution Engine

50 parallel sandbox experiments run at all times. The system continuously searches for better strategies through mutation, evaluation, and promotion — closing the loop from live trading back to strategy generation.

Experimenter
8 mutation types · SHA-256 dedupe · refills every 6h
every 6h
Tier 3 Sandbox
50 portfolios × $1k · 10 threads · hourly score recompute
30s cycle
Evaluator
3-gate: perf → backtest → regime diversity · kills or graduates
every 6h +3h
Evolution Engine
24h A/B test · challenger vs champion · 1.15× threshold to promote
24h A/B
Champion
Hot reload · generation tracked · history archived · forensics logged
live trading

Reinforcement Learning Loop

Every closed trade feeds back into the system — adjusting signal weights, improving position sizing, and directing the next generation of experiments toward what's working in live conditions.

Trade Close → Forensics

Every closed trade records 15+ signal fields: mtf_alignment, vwap_position, funding_signal, near_support_at_entry, indicator_regime, hour_utc, signal_source, take_profit_type, and more. Forensics capped at last 500 trades.

Self-Tuner → Signal Weights

Groups trades by signal field value, calculates win rate per group. Updates signal_weights.json after 20 live trades. Weights reload into SignalAggregator every 100 cycles via _maybe_reload_weights(). Effective modifier = 1 + (raw - 1) × weight.

Adaptive Position Sizer

6 multipliers combined: confidence (0.7–1.3×), volatility (ATR%), pair quality score, adaptive (from 20-trade streak window), global consecutive loss protection (0.6× after 5 losses), pair adaptation override. Persisted to position_sizing_state.json.

Experimenter ← Signal Performance

Self-tuner writes signal_performance.json tracking win rate by source (champion, support_bounce, supertrend_breakout, vwap_reversion). Experimenter reads this when computing spawn weights — templates with >58% win rate get +35% spawn weight.

Mac → Oracle Pipeline

A safe learning pipeline validates all data before pushing to the live bot. Signal weight changes require 50+ live trades, ≤30% change magnitude, data <14 days old, and RSI threshold matching the live champion.

learning_pipeline.py --review
Training complete: 50 experiments · 7 periods · 100 Monte Carlo sims/exp
Fee model: 0.26% taker + 0.10% altcoin slippage applied to all results
Fingerprint cache: 12 duplicate mutations skipped instantly

Regime scoring complete:
ROBUST: 2 · SOLID: 5 · TRENDING: 8 · REVERSION: 6 · KILL: 29

Pushing to Oracle:
✅ 50 experiment classifications applied
✅ 12 high-confidence lessons pushed
✅ Oracle sync complete — evaluator will spawn new generation

New experiment generation deploying · next cycle in 6h

Building trustless
AI infrastructure

Merkavian was founded on the belief that the next breakthrough in AI economics is decentralization. The current model — where a handful of companies bear the entire cost of model training — is fundamentally unsustainable. OpenAI spent $3B training models in 2024. That's not a business, it's a subsidy.

We're building the infrastructure layer that makes distributed AI training cryptographically verifiable. PoLChain demonstrates that zero-knowledge proofs can solve the honesty problem in federated learning — making it possible for anyone to contribute compute and earn tokens, without requiring a trusted central aggregator.

Our second system proves we can apply the same rigor to production AI applications. The autonomous trading bot runs 50 parallel strategy experiments simultaneously, continuously learning from live market data through a closed reinforcement learning loop. The same principles — trustless verification, decentralized execution, cryptographic guarantees — applied to real capital.

Merkavian is based in Berkeley, California, founded by an undergraduate researcher at UC Berkeley working at the intersection of zero-knowledge cryptography, federated machine learning, and decentralized systems.

UC Berkeley ZK Cryptography Federated Learning Base Sepolia EZKL / Halo2
Jackson Geiger
Founder · Researcher
UC Berkeley undergraduate researcher focused on the intersection of zero-knowledge cryptography, federated learning, and decentralized AI infrastructure. Author of the PoLChain proof-of-learning protocol. Building systems where cryptographic guarantees replace institutional trust.
Founded2026
LocationBerkeley, CA
StagePre-seed / R&D
FocusAI + Blockchain Infra
ResearchUC Berkeley
Emailjackson@merkavian.com