MitraSETI

Rust-Accelerated Radio Signal Analysis with ML Classification

v0.2.0 — Released

MitraSETI in Action

Real-time signal processing, ML classification, and interactive visualization.

MitraSETI desktop application demo

Intelligent Signal Processing Pipeline

From raw Breakthrough Listen filterbank data to ML-classified candidates — fully automated, self-correcting, and 45x faster than the standard.

MitraSETI signal processing pipeline

Performance: MitraSETI vs turboSETI

Measured on real Breakthrough Listen observation data. No synthetic files, no cherry-picked runs.

MitraSETI vs turboSETI performance comparison

45x speedup on Voyager-1 data. MitraSETI processes a million-channel HDF5 observation in 0.06 seconds — turboSETI takes 2.53 seconds for the same file. The Rust de-Doppler engine distributes drift-rate trials across all CPU cores using rayon parallelism, replacing turboSETI's single-threaded Python loop.

v0.2.0 introduces the Taylor tree algorithm — an O(N log N) de-Doppler search that achieves 4.3x additional speedup over brute-force on Voyager-1 data. Combined with HDBSCAN density-based clustering and adaptive spectral kurtosis RFI excision, the pipeline now processes real Breakthrough Listen data with publication-grade sensitivity.

turboSETI reports raw hits. MitraSETI reports classified candidates — signals that survived rule-based filtering, CNN+Transformer inference, and out-of-distribution analysis. One high-confidence candidate is more actionable than three unscreened hits.

What MitraSETI Found

Autonomous analysis across 2,200 Breakthrough Listen observations with full ML pipeline.

2,200
BL Files Processed
Breakthrough Listen HDF5 + filterbank
16.8M
Raw Signals
De-Doppler detections across all files
275
ML-Verified Candidates
Survived CNN+Transformer + OOD scoring
45x
Faster Than turboSETI
Rust de-Doppler with rayon parallelism
11
Verified Candidates
Across 9 target categories
21.6h
Total Runtime
Full pipeline end-to-end
2,175
Post-Filter Signals
After rule-based + ML rejection
Aggressive
Pipeline Mode
Lower SNR thresholds, maximum coverage

See It in Action

MitraSETI ships with a full-featured PyQt5 desktop app and a FastAPI web interface for real-time signal monitoring.

MitraSETI Waterfall Viewer

Waterfall Viewer — spectrogram display with drift-rate overlay and zoom controls

MitraSETI Signal Gallery

Signal Gallery — ML-classified detections with confidence scores

MitraSETI Sky Radar

Sky Radar — interactive celestial map with radio + optical overlay

How It Works

A hybrid Rust+Python architecture. Rust handles the compute-intensive DSP. Python handles ML inference and orchestration.

Load BL Data
Rust De-Doppler
Feature Extraction
CNN+Transformer
OOD Detection
RFI Rejection
Candidate Verification
Rust Python PyTorch PyO3 rayon FastAPI PyQt5

Hybrid Rust+Python architecture. Rust handles DSP, Python handles ML inference.

v0.2.0 — Publication-Ready Pipeline

13 major features for scientifically rigorous radio signal analysis, built for reproducibility and interoperability.

Taylor Tree De-Doppler

O(N log N) algorithm — 4.3x faster than brute-force on Voyager-1. Based on Taylor (1974).

HDBSCAN Clustering

Density-based hit deduplication replaces greedy method for robust false-positive reduction.

Spectral Kurtosis RFI

Adaptive thresholds (median + MAD) for RFI excision. Nita & Gary (2010) method.

FITS Catalog Export

Standard astronomical format via astropy. Compatible with TOPCAT, Aladin, and VO tools.

Known RFI Database

27 cataloged terrestrial interference sources for automated labeling (GPS, Iridium, WiFi, etc.).

Cross-Epoch Persistence

Track recurring signals across multiple observations of the same target for credibility scoring.

Astropy Cross-Matching

SkyCoord KD-tree matching between MitraSETI radio detections and AstroLens optical anomalies.

Click CLI

mitraseti search, stream, benchmark, export, crossmatch, report.

ON/OFF Cadence Filter

Standard RFI rejection using observation cadence patterns (ABACAD).

Cloud Architecture for MitraSETI

Production deployment on AWS — from Breakthrough Listen data ingestion to ML-classified candidates at scale.

MitraSETI cloud architecture on AWS
Amazon S3 — Data Lake
Stores Breakthrough Listen filterbank (.fil) and HDF5 (.h5) files. Object lifecycle policies tier cold data to S3 Glacier.
Amazon SQS — Streaming
Event-driven file arrival notifications. Decouples data ingestion from processing workers for back-pressure handling.
ECS / EKS — Rust Workers
Containerised Rust de-Doppler workers with rayon parallelism. Auto-scales based on queue depth and CPU utilisation.
AWS Step Functions
Orchestrates the full pipeline: ingest → de-Doppler → feature extraction → ML inference → verification.
Amazon SageMaker — ML
Hosts CNN+Transformer classifier and Spectral OOD detector on GPU inference endpoints. Batch transform for large runs.
Amazon DynamoDB
Low-latency metadata store for candidate signals, processing status, and cross-reference results.
Amazon S3 — Spectrograms
Stores generated spectrogram images for candidate signals. Served via CloudFront for web UI access.
RDS / Aurora
Relational store for verified candidates, observation metadata, and historical analysis records.
CloudWatch + SNS
Pipeline health metrics, processing throughput dashboards, and real-time SNS alerts for high-confidence detections.

Data Flow: Inputs, Processing, Outputs

Inputs
  • Breakthrough Listen filterbank / HDF5 archives
  • Observation metadata (RA, Dec, source name)
  • Pre-trained model artifacts (CNN+Transformer, OOD)
  • Catalog data (SIMBAD, NVSS, FIRST, Pulsar)
Processing
  • Step Functions orchestrates multi-stage pipeline
  • ECS Rust workers run de-Doppler with rayon
  • SageMaker runs CNN+Transformer inference
  • Spectral OOD detector flags anomalies
  • Catalog cross-reference and RFI rejection
Outputs
  • Ranked ML-classified candidates with confidence
  • 9-class signal taxonomy with OOD scores
  • Spectrogram visualisations per candidate
  • SNS alerts for high-confidence detections
  • CloudWatch dashboards and throughput metrics

MitraSETI Cloud — Try It Free

MitraSETI Cloud is a fully managed SaaS platform for radio astronomy signal analysis. Upload your observation data, run automated detection and ML classification pipelines, and visualise results — all in your browser. No infrastructure setup required.

Upload & Process

Upload .h5, .fil, .fits, and .hdf5 files up to 1 GB. Automated pipelines handle the rest.

📊

ML Classification

AstroLens-powered CNN + Transformer models classify signals and flag anomalies automatically.

🔗

Interactive Visualisation

Waterfall plots, signal galleries, and pipeline monitoring — real-time in your browser.

Try MitraSETI Cloud — Free Explorer Tier

Free tier available • No credit card required • Researcher and Institution tiers for larger workloads

Explore the Code

MitraSETI is open-source under the MIT License. Contributions, feedback, and collaboration welcome.

View MitraSETI on GitHub