Model Hub
Echo Fleet — 26 models across 5 nodes
Total Models
26
Online
7
Avg Latency
3.7s
Total VRAM
295.5GB
Machine
All Machines
JL1 Nova
JL2 Forge
JL3 Vigil
JLa Ghost
Cloud
Tier
All Tiers
Speed (<7s)
Workhorse (7-15s)
Heavy (15s+)
Cloud
Specialty
All Specialties
analysis
architecture
chat
code
complex-reasoning
creative
everything
fast-chat
long-context
multimodal
quick-fix
reasoning
research
strategy
tool-use
vision
Search
Speed (<7s)
(7)
axe-blade-4b
2.5GB · Q4_K_M
Online
JL2 Forge
·
16GB
code
quick-fix
Avg Response
3.9s
VRAM
2.5GB
axe-phantom-8b
4.9GB · Q4_K_M
Online
JL2 Forge
·
16GB
code
reasoning
Avg Response
5.2s
VRAM
4.9GB
axe-spark-3b
1.8GB · Q4_K_M
Online
JL2 Forge
·
16GB
chat
quick-fix
Avg Response
2.1s
VRAM
1.8GB
axe-razor-7b
4.1GB · Q4_K_M
Offline
JL3 Vigil
·
16GB
code
tool-use
Avg Response
4.7s
VRAM
4.1GB
axe-whisper-3b
1.9GB · Q4_K_M
Offline
JLa Ghost
·
8GB
chat
fast-chat
Avg Response
2.8s
VRAM
1.9GB
axe-pulse-4b
2.3GB · Q4_K_M
Offline
JLa Ghost
·
8GB
quick-fix
tool-use
Avg Response
3.5s
VRAM
2.3GB
axe-echo-7b
4.2GB · Q4_K_M
Offline
JLa Ghost
·
8GB
chat
creative
Avg Response
5.8s
VRAM
4.2GB
Workhorse (7-15s)
(9)
axe-sage-14b
8.4GB · Q4_K_M
Offline
JL1 Nova
·
64GB
analysis
research
Avg Response
8.1s
VRAM
8.4GB
axe-oracle-27b
16.2GB · Q4_K_M
Offline
JL1 Nova
·
64GB
reasoning
strategy
Avg Response
12.3s
VRAM
16.2GB
axe-vision-12b
7.1GB · Q4_K_M
Offline
JL1 Nova
·
64GB
vision
multimodal
Avg Response
9.5s
VRAM
7.1GB
axe-scribe-20b
12.1GB · Q4_K_M
Offline
JL1 Nova
·
64GB
creative
long-context
Avg Response
10.8s
VRAM
12.1GB
axe-forge-13b
7.8GB · Q4_K_M
Offline
JL3 Vigil
·
16GB
code
reasoning
Avg Response
8.9s
VRAM
7.8GB
axe-sentinel-11b
6.5GB · Q4_K_M
Offline
JL3 Vigil
·
16GB
chat
analysis
Avg Response
7.2s
VRAM
6.5GB
axe-nova-22b
13.2GB · Q4_K_M
Offline
JL3 Vigil
·
16GB
reasoning
long-context
Avg Response
11.5s
VRAM
13.2GB
axe-muse-15b
9.1GB · Q4_K_M
Offline
JL1 Nova
·
64GB
creative
chat
Avg Response
9.8s
VRAM
9.1GB
axe-prism-18b
10.8GB · Q4_K_M
Offline
JL3 Vigil
·
16GB
multimodal
vision
Avg Response
10.2s
VRAM
10.8GB
Heavy (15s+)
(6)
axe-titan-57b
34.1GB · Q4_K_M
Offline
JL1 Nova
·
64GB
complex-reasoning
architecture
Avg Response
18.5s
VRAM
34.1GB
axe-nexus-70b
41.4GB · Q4_K_M
Offline
JL1 Nova
·
64GB
everything
Avg Response
25.0s
VRAM
41.4GB
axe-cortex-72b
43.2GB · Q4_K_M
Offline
JL1 Nova
·
64GB
reasoning
research
analysis
Avg Response
26.3s
VRAM
43.2GB
axe-coder-34b
20.5GB · Q4_K_M
Offline
JL1 Nova
·
64GB
code
architecture
Avg Response
15.2s
VRAM
20.5GB
axe-guardian-32b
19.3GB · Q4_K_M
Offline
JL1 Nova
·
64GB
analysis
strategy
Avg Response
14.7s
VRAM
19.3GB
axe-matrix-40b
24.1GB · Q4_K_M
Offline
JL1 Nova
·
64GB
reasoning
research
Avg Response
17.1s
VRAM
24.1GB
Cloud
(4)
Claude Sonnet 4.5
Anthropic
Online
Cloud
·
∞
code
reasoning
creative
Avg Response
2.1s
Gemini Pro 1.5
Google
Online
Cloud
·
∞
multimodal
chat
long-context
Avg Response
1.8s
Grok Beta
xAI
Online
Cloud
·
∞
analysis
research
Avg Response
2.5s
Groq Llama 3.3
Groq
Online
Cloud
·
∞
fast-chat
reasoning
Avg Response
0.8s