AI-Powered Video Enhancement

AI Video Processing
at Massive Scale

GPUNet leverages cutting-edge AI models like Real-ESRGAN, RIFE, and DAIN to upscale, interpolate, and enhance video across distributed GPU clusters at unprecedented speed.

1k+
Concurrent Workers
15 FPS
4K Upscaling
<100ms
Task Latency
5
Platforms

Built for Scale.
Designed for Performance.

Every component of GPUNet has been engineered to maximise throughput and minimise latency across distributed GPU workloads.

Push-Based Task Distribution

Bidirectional gRPC streaming pushes tasks directly to idle workers with sub-millisecond overhead. No polling, no wasted cycles.

AI Video Upscaling

Real-ESRGAN and custom ONNX models transform SD content to 4K with stunning clarity. Supports 2x and 4x upscaling modes.

Frame Interpolation

Generate intermediate frames using RIFE, DAIN, or SVP methods. Boost video frame rates for smoother, more cinematic playback.

GPU Auto-Tuning

Automatic detection of VRAM capacity, compute capability, and optimal batch sizes. Every worker runs at peak efficiency — zero configuration.

Master-Master HA

Raft-inspired cluster replication with leader election, log synchronisation, and vector clocks. No single point of failure.

Token Economy

Workers earn tokens for each processed frame with performance bonuses. Built-in incentive system for distributed contributor networks.

Transform Your Video Content

From pixelated archives to crystal-clear 4K — powered by distributed GPU computing.

AI Video Upscaling - before and after comparison

AI Video Upscaling

Transform SD content to stunning 4K with Real-ESRGAN.

Frame Interpolation visualization

Frame Interpolation

Generate smooth motion with RIFE and DAIN algorithms.

Distributed GPU Processing visualization

Distributed Processing

Harness multiple GPUs across your infrastructure.

From Upload to Download
in Six Steps

A fully automated pipeline handles every stage of distributed video processing.

1

Video Upload

User uploads a video through the web portal, selects an upscaling model (e.g. Real-ESRGAN 4x), and optionally enables frame interpolation.

2

Frame Extraction

The master server uses FFmpeg to split the video into individual PNG frames. Each frame becomes an independent task, queued in Redis by priority.

3

Task Distribution

The push-based distributor identifies idle GPU workers via their bidirectional gRPC stream and sends frame data directly — least-loaded worker first.

4

GPU Processing

Each worker runs the ONNX model on its GPU (CUDA, CoreML, or CPU fallback) with auto-tuned tile sizes and batch parameters. Post-processing filters are applied automatically.

5

Result Collection

Processed frames stream back to the server via chunked gRPC uploads. The server tracks progress in real time and awards tokens to the contributing worker.

6

Video Assembly

Once all frames are collected, FFmpeg merges them into the final video. The user receives a notification and can download the enhanced result.

Designed for
Massive Parallelism

A layered architecture separating user interfaces, orchestration, storage, and compute.

GPUNet Cluster Architecture

Bidirectional gRPC streaming with hybrid task distribution

🌐
User Portal
Leptos / WASM
📊
Admin Dashboard
Real-time Monitoring
🔗
REST / gRPC API
External Integrations
HTTPS + gRPC-Web
Master Server
Tonic gRPC · Axum REST · Task Distributor
SQL
Sync
Cache
🗄️
PostgreSQL
Jobs · Tasks · Tokens
🔄
Replica
Master-Master HA
Redis
Queue · Sessions
gRPC Stream · TLS
RTX 4090
CUDA · Processing
RTX 3090
CUDA · Idle
M2 Pro
CoreML · Processing
CPU
Fallback · Idle

Real-World Performance

Measured with Real-ESRGAN 4x model, upscaling 1080p frames to 4K resolution.

GPU VRAM Throughput Time per Frame
NVIDIA RTX 4090 0 GB ~0 FPS 0 ms
NVIDIA RTX 4080 0 GB ~0 FPS 0 ms
NVIDIA RTX 3090 0 GB ~0 FPS 0 ms
NVIDIA RTX 4070 Ti 0 GB ~0 FPS 0 ms
Apple M2 Pro 0 GB ~0 FPS 0 ms
CPU (16 cores) ~0 FPS 0 ms

Run Workers Anywhere

GPUNet workers run natively across every major operating system and architecture.

🐧
Linux x86_64
Primary platform
NVIDIA CUDA 12.x
🐧
Linux ARM64
Raspberry Pi
Edge compute
🪟
Windows
WSL2 with CUDA
GPU passthrough
🍎
macOS
Apple Silicon
CoreML / Metal
😈
FreeBSD
Server workloads
CPU fallback

Zero-Trust by Design

Every connection is encrypted, every request is signed, every worker is verified.

Ed25519 Cryptographic Auth

Workers authenticate using elliptic-curve keypairs with challenge-response verification. No passwords, no shared secrets.

TLS Encryption

All gRPC and REST connections are encrypted end-to-end. Frame data in transit remains confidential across the network.

Request Signing

Every worker request is signed with a private key. The server verifies signatures before processing — tamper-proof communication.

Rate Limiting & Validation

Built-in rate limiting protects against abuse. Strict input validation on all endpoints prevents injection and overflow attacks.

State-of-the-Art Neural Networks

GPUNet supports leading AI models for video enhancement, all running via optimised ONNX Runtime.

🎨
Real-ESRGAN
4x upscaling with detail recovery
🎬
RIFE
Real-time frame interpolation
🔮
DAIN
Depth-aware interpolation
SwinIR
Image restoration transformer
🖼️
ESPCN
Lightweight super-resolution
🎯
EDSR
Enhanced deep residual SR
🧩
Custom ONNX
Bring your own models
TensorRT
NVIDIA optimised inference

Built on Proven Foundations

A modern Rust-native stack optimised for performance, safety, and reliability.

🦀
Rust
Memory-safe systems language
Tokio
Async runtime
📡
Tonic gRPC
High-performance RPC
🧠
ONNX Runtime
AI model inference
🗄️
PostgreSQL
Persistent storage
Redis
Queue & caching
🎬
FFmpeg
Video processing
🌐
Leptos
WASM web framework