edge-infer icon indicating copy to clipboard operation
edge-infer copied to clipboard

EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained devices like Android, iOS, or MCUs for real-time decision-making....

logo

Edge Infer

Build Crates.io

EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained devices like Android, iOS, or MCUs for real-time decision-making.

Architecture:

EdgeMind Architecture

Platform support (by Design):

  • Android, iOS
  • Linux, Windows, Mac,
  • Raspberry Pi, MCU

Todos

  • [x] Inference wrapper
    • [x] Onnx Runtime
    • [x] Tokenizer
  • [x] UniFFI, is a toolkit for building cross-platform software components in Rust.
  • [ ] GRPC server with tonic
  • [ ] Multiple OS support:
    • Desktop: Windows, Mac, Linux (x86, x64)
    • Mobile: Android, iOS, Linux (ARM)
    • Embedded Linux (ARM).
  • [ ] Flexible Configuration: Easily configurable via command-line parameters, including listening port, batch size, thread count, and others.

Usecases

  • [ ] SearchEverywhere: Search for anything, anywhere, anytime.
  • [ ] Visualization
  • [ ] AutoComplete
    • Model: Embedding with ??
  • [ ] Summarization

Resources

Examples:

  • Modern cross-platform telemetry: Glean

MCU

ToSpike

Not working:

License

This project is licensed under the MIT License, See LICENSE for the full license text.