efx-face

Version 0.3.2 | A beautiful TUI for managing MLX models on Apple Silicon

Browse, install, and run MLX-optimized models from Hugging Face with an intuitive terminal interface built with Bubbletea and Lipgloss.

Features

Core Features

Browse Models — Access 3000+ MLX models from Hugging Face with pagination
Unified Search — Search models directly via Hugging Face API with live filtering
Model Details — View downloads, likes, size, and metadata before installing
Install Models — Download and set up models with one click
Run Models — Launch models with MLX OpenAI Server
Uninstall — Clean removal of models and cache
Multiple Sources — Browse mlx-community, lmstudio-community, or all models

Multi-Server Management

Run Multiple Servers — Launch several models on different ports simultaneously
Real-time Logs — View server output with scrollable viewport
Server Switching — Switch between running servers to view their logs
Graceful Shutdown — Stop individual servers or all at once

Chat & Opencode Integration

Built-in Chat — Converse with running models directly in the TUI
Conversation History — Messages persist across sessions in JSON files
Markdown Rendering — AI responses with syntax-highlighted code blocks
Opencode Support — Launch opencode inline or copy command to clipboard
Pagination Mode — Browse conversation exchanges one at a time

Configuration Options

6 Model Type Presets: lm, multimodal, image-generation, image-edit, embeddings, whisper
Interactive Configuration — Single-page settings with live preview
Command Preview — Review full command before execution
Flexible Storage — Configure external or local model storage paths

Installation

Quick Install (Recommended)

curl -fsSL https://raw.githubusercontent.com/electroheadfx/efx-face-manager/main/install.sh | bash

This will:

Detect your OS and architecture
Download the correct binary
Install to /usr/local/bin
Create default config with model directory

Manual Download

Download from Releases:

Platform	Architecture	Binary
macOS	Apple Silicon (M1/M2/M3/M4)	`efx-face-darwin-arm64`
macOS	Intel	`efx-face-darwin-amd64`
Linux	x86_64	`efx-face-linux-amd64`
Linux	ARM64	`efx-face-linux-arm64`
Windows	x86_64	`efx-face-windows-amd64.exe`
Windows	ARM64	`efx-face-windows-arm64.exe`

# macOS/Linux
chmod +x efx-face-darwin-arm64
sudo mv efx-face-darwin-arm64 /usr/local/bin/efx-face
efx-face

Build from Source

git clone https://github.com/electroheadfx/efx-face-manager.git
cd efx-face-manager
make build
./bin/efx-face

Go Install

go install github.com/electroheadfx/efx-face-manager/cmd/efx-face@latest

Prerequisites

MLX OpenAI Server

Required to run models. Install using uv (recommended):

git clone https://github.com/cubist38/mlx-openai-server.git
cd mlx-openai-server
uv venv && source .venv/bin/activate
uv pip install -e .

Add to .zshrc:

mlx-openai-server() {
    local original_dir="$PWD"
    cd ~/path/to/mlx-openai-server
    source .venv/bin/activate
    command mlx-openai-server "$@"
    cd "$original_dir"
}

Or use pipx: pipx install mlx-openai-server

Hugging Face CLI

brew install huggingface-cli
# Optional: hf auth login

Opencode (Optional)

Required for the Chat Mode and Opencode Integration features in Server Manager:

# Install opencode CLI
npm install -g opencode
# or: npx opencode

Opencode provides AI-powered coding assistance that can connect to your running MLX servers.

How to Use

Home Screen

The Home Screen is your central hub for all operations:

Run on Template — Quick-launch popular models with pre-configured optimal settings
Run an Installed Model — Select any model you've downloaded and configure how to run it
Install a New Model — Browse and download models from Hugging Face
Uninstall a Model — Remove models you no longer need
Settings — Configure storage paths and preferences

Use ↑/↓ to navigate, Enter to select, and q to quit.

Running Models with Templates

Step 1: Choose a Template

Templates are pre-configured model setups optimized for specific use cases:

GLM-4.7-Flash-8bit — Fast Chinese/English language model with reasoning
Qwen3-Coder-30B — Optimized for coding tasks
Qwen3-VL-8B-Thinking — Vision-language multimodal model
PaddleOCR-VL-1.5 — OCR-focused vision models (8-bit and bf16 variants)
NVIDIA-Nemotron — General purpose language model

All Templates Configurable: All templates are loaded from ~/.config/efx-face-manager/templates.yaml. You can modify existing templates or add new ones. See templates.example.yaml for the format.

Select a template and press Enter to proceed to configuration.

Step 2: Configure and Launch

The Configuration Panel shows everything at a glance:

Left Column — Configure Options:

Context length, tool parsers, reasoning parsers
Debug mode, trust remote code settings
Port, host, and log level

Right Column — Option Setup:

Shows available choices for the selected option
Press Enter to edit a value
Changes update the command preview in real-time

Action Bar (top):

Run — Launch the server with current settings
Trust: off/on — Quick toggle for trust_remote_code
Cancel — Return to previous screen

Use Tab to switch between panels, ↑/↓ to navigate options.

Running Installed Models

Step 1: Choose a Model

Browse all models installed in your storage directory:

Models are displayed with their full names
Use ↑/↓ to scroll through the list
Press Enter to select a model

Step 2: Select Model Type

Choose the model type that matches your use case:

Type	Description
`lm`	Text-only language models (default)
`multimodal`	Vision, audio, and text processing
`image-generation`	Image generation (Qwen, Flux)
`image-edit`	Image editing capabilities
`embeddings`	Text embeddings for RAG
`whisper`	Audio transcription

Each type shows different configuration options in the next screen.

Installing New Models

Browse and download models from Hugging Face with the integrated search:

Search Features:

Live Search — Type to filter models in real-time
Multiple Sources — Switch between mlx-community, lmstudio-community, or all models using Tab
Model Info — View download counts and model details before installing
Pagination — Navigate through thousands of models with ←/→

How to Install:

Select "Install a New Model" from the home screen
Browse or search for models using the search bar
Press Enter on a model to view details
Press i to start the download
The model is automatically symlinked to your storage directory

Shortcuts:

Tab — Switch between model sources (mlx-community, lmstudio, all)
/ — Focus search input
Enter — View model details
i — Install selected model
o — Open model page on Hugging Face
←/→ — Navigate pages

Server Management

Single Server View

When a server is running, the Server Manager displays:

Left Panel — Server Controls:

List of running servers with model name, port, and type
Server details (host, uptime)
Actions: Stop server, Start new, Back to menu

Right Panel — Server Output:

Real-time scrollable log output
Shows model loading progress, requests, and responses
Use ↑/↓ or g/G to scroll through logs

Shortcuts:

s — Stop selected server
n — Start a new server
c — Clear logs
m — Return to main menu

Multiple Servers

Run multiple models simultaneously on different ports:

Each server runs independently on its own port (8000, 8001, 8002...)
Select a server from the list to view its logs
Press 1-9 for quick server selection
S (shift+s) stops ALL running servers

This is perfect for:

Running a coding model and a general model side by side
Testing different model configurations
Serving multiple model types (LLM + embeddings + whisper)

Chat Mode

Chat directly with your running MLX models without leaving the TUI:

The built-in chat interface uses opencode run in non-interactive mode, providing a lightweight way to converse with your models while maintaining conversation history.

Features:

Conversation History — Messages are saved to JSON files and persist across sessions
Markdown Rendering — AI responses are rendered with syntax highlighting (Dracula theme)
Code Blocks — Automatic language detection for code snippets
Pagination Mode — Browse through conversation exchanges with Ctrl+P
Multiple Conversations — Access chat history with Ctrl+L

Chat Panel Layout:

Top — Chat viewport showing conversation with scrollable history
Bottom — Input field for typing messages
Focus Toggle — Tab switches focus between viewport and input

Shortcuts:

Enter — Send message
Tab — Toggle focus between viewport and input
Ctrl+P — Toggle pagination mode (browse exchanges one at a time)
←/→ — Navigate between exchanges (in pagination mode)
Ctrl+L — Open chat history
Ctrl+N — Start new conversation
Ctrl+D — Delete current conversation
↑/↓ — Scroll through messages
ESC — Return to server manager

Opencode Integration

Seamlessly integrate with opencode for AI-powered coding assistance:

Three Ways to Use Opencode:

Copy Command (o) — Copies the opencode command with server configuration to your clipboard. Paste it in any terminal to start an opencode session connected to your running MLX server.
Run Inline (O) — Suspends the TUI and launches opencode directly in your terminal. When you exit opencode, the TUI resumes automatically.
Web Interface (w) — Launches opencode web server with your MLX model pre-configured. Opens a browser-based coding assistant interface.

Opencode Web Features:

Browser-based interface for AI-powered coding
Automatic model and provider configuration
Real-time session management
Select your project folder directly in the browser

How It Works:

The integration automatically configures opencode to connect to your running MLX server by:

Setting OPENCODE_CONFIG_CONTENT environment variable
Configuring the correct host, port, and model name
Using the OpenAI-compatible API endpoint

Example Command:

OPENCODE_CONFIG_CONTENT='{"model":"mlx-community/Qwen3-0.6B-4bit","provider":{"mlx-community":{...}}}' opencode --model "mlx-community/Qwen3-0.6B-4bit"

Shortcuts:

o — Copy opencode command to clipboard
O — Run opencode inline (suspends TUI)
w — Launch opencode web interface

Use Cases:

Code generation and completion with local models
Debugging assistance without cloud APIs
Private and offline AI coding assistance
Browser-based interface when terminal is not preferred

Settings

Configure your model storage location:

Available Paths:

External — /Volumes/T7/mlx-server (for external drives)
Local — ~/mlx-server (always available)
Auto-detect — Automatically uses external if mounted, falls back to local

The status shows:

✓ Active with model count
✓ Available (empty directory)
✗ Not mounted (external drive disconnected)

Settings are saved to ~/.efx-face-manager.conf and persist across sessions.

Uninstalling Models

Remove models you no longer need:

Select the model to uninstall
Confirm the deletion
Both the symlink AND cached files are removed

This frees up disk space by cleaning the Hugging Face cache.

Keyboard Reference

Global

Key	Action
`↑/↓`	Navigate lists and options
`Enter`	Select / Confirm
`Tab`	Switch between panels
`←/→`	Switch pages / columns
`ESC`	Go back one screen
`q`	Quit / Return to home

Server Manager

Key	Action
`s`	Stop selected server
`S`	Stop ALL servers
`n`	Start new server
`c`	Open chat with server
`x`	Clear server logs
`o`	Copy opencode command
`O`	Run opencode inline
`w`	Launch opencode web
`g/G`	Jump to top/bottom of logs
`1-9`	Quick select server

Chat Mode

Key	Action
`Enter`	Send message
`Tab`	Toggle focus (viewport/input)
`Ctrl+P`	Toggle pagination mode
`←/→`	Navigate exchanges (paginated)
`Ctrl+L`	Open chat history
`Ctrl+N`	New conversation
`Ctrl+D`	Delete conversation

Model Search

Key	Action
`/`	Focus search input
`Tab`	Switch model source
`i`	Install selected model
`o`	Open on Hugging Face

Building

make build         # Current platform
make build-all     # All 6 platforms
make build-mac     # macOS ARM64 + AMD64
make build-linux   # Linux ARM64 + AMD64
make build-windows # Windows ARM64 + AMD64
make release       # Build + create archives for GitHub
make help          # Show all options

Troubleshooting

"mlx-openai-server: command not found" → Install mlx-openai-server (see Prerequisites)

"Model type not supported" → Update mlx-lm: uv pip install git+https://github.com/ml-explore/mlx-lm.git

"trust_remote_code" error → Enable "Trust remote code" in config panel or add --trust-remote-code flag

License

MIT License — Feel free to use, modify, and distribute.

Credits

Hugging Face — Model hosting
mlx-openai-server — OpenAI-compatible server
Bubbletea & Lipgloss — TUI framework
Coded with Qoder

by Laurent Marques

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
cmd/efx-face		cmd/efx-face
internal		internal
src		src
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
efx-face-manager		efx-face-manager
efx-face.tape		efx-face.tape
go.mod		go.mod
go.sum		go.sum
install.sh		install.sh
templates.example.yaml		templates.example.yaml

Folders and files

Latest commit

History

Repository files navigation

efx-face

Table of Contents

Features

Core Features

Multi-Server Management

Chat & Opencode Integration

Configuration Options

Installation

Quick Install (Recommended)

Manual Download

Build from Source

Go Install

Prerequisites

MLX OpenAI Server

Hugging Face CLI

Opencode (Optional)

How to Use

Home Screen

Running Models with Templates

Step 1: Choose a Template

Step 2: Configure and Launch

Running Installed Models

Step 1: Choose a Model

Step 2: Select Model Type

Installing New Models

Server Management

Single Server View

Multiple Servers

Chat Mode

Opencode Integration

Settings

Uninstalling Models

Keyboard Reference

Global

Server Manager

Chat Mode

Model Search

Building

Troubleshooting

License

Credits

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages