Orcan VisionTrace GPU Service

GPU-accelerated face recognition and FAISS indexing service for high-performance reverse image search.

Features

  • Batch Face Embedding Extraction: Process multiple images simultaneously using InsightFace on GPU
  • GPU-Accelerated FAISS Indexing: Create high-performance vector indexes for similarity search
  • Image Enhancement: Automatic quality improvement for poor quality inputs (CCTV, low-light images)
  • High-Performance Search: Fast similarity search with adaptive thresholds
  • Scalable Architecture: Optimized for production workloads with automatic scaling

API Endpoints

POST /extract_embeddings_batch

Extract face embeddings from multiple images in parallel.

Request:

{
  "images": ["base64_encoded_image1", "base64_encoded_image2"],
  "enhance_quality": true,
  "aggressive_enhancement": false
}

Response:

{
  "embeddings": [[embedding_vector1], [embedding_vector2]],
  "extraction_info": [{"face_count": 1, "confidence": 0.95}, ...],
  "total_processed": 2,
  "successful": 2
}

POST /create_faiss_index

Create optimized FAISS index on GPU for fast similarity search.

Request:

{
  "embeddings": [[embedding1], [embedding2], ...],
  "dataset_size": 10000,
  "dimension": 512
}

POST /search_faiss

Perform similarity search on FAISS index.

GET /health

Health check endpoint.

Hardware Requirements

  • GPU: NVIDIA A10G or A100 recommended
  • Memory: Minimum 8GB GPU memory
  • CUDA: Compatible with CUDA 11.8+

Performance

  • Face Extraction: 10-20x faster than CPU (0.05-0.1s per image)
  • Index Creation: 5-10x faster than CPU
  • Search Latency: <50ms for most queries
  • Throughput: 50+ images per batch

Use Cases

  • Reverse image search systems
  • Identity verification systems
  • Photo organization and management
  • Security and surveillance applications
  • Digital asset management

Model Details

  • Face Detection: InsightFace RetinaFace
  • Face Recognition: ArcFace embeddings (512-dimensional)
  • Enhancement: Multi-strategy image quality improvement
  • Indexing: Adaptive FAISS index selection based on dataset size

Limitations

  • Requires high-quality face images for best results
  • GPU memory limits batch size for very large images
  • Cold start latency of ~30 seconds on first request

License

MIT License - See LICENSE file for details.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support