Aya Offline

Offline multilingual LLM chat app. Runs a 3.5B parameter model (Aya) entirely on-device with no internet connection required.

Built with Flutter + a custom C inference engine with SIMD optimizations (AVX2 on x86, NEON on ARM64).

Platforms

Platform	Status	Engine
Android	✅	Native C (NDK/CMake)
iOS	✅	Native C (CocoaPods)
macOS	✅	Native C (CocoaPods)
Windows	✅	Native C (DLL)
Web	✅	WASM + SSE fallback

Prerequisites

Flutter SDK 3.11+
Model file: tiny-aya-global-q4_k_m.gguf (~2 GB)

Quick Start

git clone https://github.com/Complexity-ML/aya-offline.git
cd aya-offline
flutter pub get

Android

Push the model to the device:

adb push tiny-aya-global-q4_k_m.gguf /sdcard/Download/

Build and install:

flutter build apk --debug
adb install build/app/outputs/flutter-apk/app-debug.apk

Launch the app and grant "All files access" when prompted.

iOS

Requires a Mac with Xcode installed.

Transfer the model to the device (via Finder/iTunes shared files, or use a file manager app to place it in /var/mobile/Documents/).
Install CocoaPods dependencies and run:

cd ios && pod install && cd ..
flutter run -d <iphone-device-id>

macOS

Place the model file in the project root directory.
Install CocoaPods dependencies and run:

cd macos && pod install && cd ..
flutter run -d macos

Windows

Place the model file in the project root directory.
Build the C engine as DLL (requires MSYS2/MinGW or Visual Studio):

cd engine-c
gcc -shared -O2 -mavx2 -mfma -o aya_engine.dll src/gguf.c src/model.c src/aya_api.c -DAYA_BUILD_DLL -lm

Copy aya_engine.dll next to the Flutter executable and run:

flutter run -d windows

Web

Start the local inference server:

cd engine-c
./aya_server.exe tiny-aya-global-q4_k_m.gguf

In another terminal:

flutter run -d chrome

The web version connects to the local server via SSE (Server-Sent Events).

Architecture

aya-offline/
├── engine-c/          # C inference engine
│   ├── src/
│   │   ├── aya_api.c  # Public API (init, generate, free)
│   │   ├── model.c    # Transformer forward pass
│   │   ├── gguf.c     # GGUF file parser
│   │   └── quant.h    # Q4_K/Q6_K quantization + SIMD
│   └── CMakeLists.txt # Android NDK build
├── lib/
│   ├── engine/
│   │   ├── native_engine.dart  # FFI bindings (Android/iOS/desktop)
│   │   ├── sse_engine.dart     # SSE client (web)
│   │   └── engine.dart         # Platform abstraction
│   └── chat/
│       └── chat_screen.dart    # Chat UI
├── ios/Podfile         # iOS CocoaPods config
├── macos/Podfile       # macOS CocoaPods config
└── android/app/build.gradle.kts  # Android CMake/NDK config

License

INL 2025

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
android		android
engine-c		engine-c
ios		ios
lib		lib
macos		macos
test		test
web		web
windows		windows
.gitignore		.gitignore
.metadata		.metadata
README.md		README.md
analysis_options.yaml		analysis_options.yaml
pubspec.lock		pubspec.lock
pubspec.yaml		pubspec.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aya Offline

Platforms

Prerequisites

Quick Start

Android

iOS

macOS

Windows

Web

Architecture

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Aya Offline

Platforms

Prerequisites

Quick Start

Android

iOS

macOS

Windows

Web

Architecture

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages