417 Comments - Last post 31 minutes ago by Kagrayz
361 Comments - Last post 32 minutes ago by Kagrayz
5 Comments - Last post 50 minutes ago by Kratomista
47,369 Comments - Last post 51 minutes ago by misterhaan
9 Comments - Last post 1 hour ago by tungmapu
1,173 Comments - Last post 2 hours ago by Formidolosus
2,653 Comments - Last post 3 hours ago by ToatsMcGoats
43 Comments - Last post 1 minute ago by DrPower
36 Comments - Last post 3 minutes ago by Swordoffury
223 Comments - Last post 41 minutes ago by Hawk189
11,237 Comments - Last post 1 hour ago by JMM72
141 Comments - Last post 1 hour ago by PonBaron
65 Comments - Last post 2 hours ago by nonamebg
10,107 Comments - Last post 3 hours ago by CurryKingWurst
So, a mate of mine starts rambling about this project idea of his — keeping it vague for now 'cause he's gonna try and market it or something — but basically it’s gonna involve AI, voice, the works. Since I’ve been off work recovering from some delightful dental surgery (10/10 don’t recommend), he asked if I could whip up a basic offline AI to help with his prototype.
One week later, in between games and wrangling the kids, I’ve somehow ended up knee-deep in a full-on desktop AI assistant. I’m calling it Version 0.8 for now, with my “MVP” version being 1.0.
Right now it uses FFmpeg, Whisper, LLaMA3, and Coqui TTS. It handles both text and voice input/output, caches WAVs, convos, user settings, and has a few colour themes 'cause who doesn’t love a bit of flair. Currently working on per-conversation caching and trying to make convos reference each other — which is as fun as it sounds.
Also, the AI voice? Sounds like a half-baked call centre operator. Absolutely cooked. I’m adding more voice options soon so it stops sounding like a robo-Karen trying to upsell me internet plans.
Performance-wise, I’ve managed to take voice response from "go make a cuppa" times down to about 6–8 seconds, thanks to streaming chunked WAVs and throwing the GPU at it. Still not lightning, but hey, it’s no longer yelling into the void and waiting for enlightenment.
Anyway, point is — since I was putting together a train anyway, thought I’d ask: anyone got feature ideas? Already blown past what my mate expected, so I’ve got a pretty hefty roadmap going. But I’m all ears for wild suggestions, practical or ridiculous.
Here is your entry to a progressive train. Good Luck and Enjoy ^^
Just finalised the addition of allowing the creation of different conversations, user defined conversation titles, conversational tabbing, persistent / cached conversations and deleting conversations ^^ Currently the entire App is 755 Megabytes. Let's watch that expand >.<
Comment has been collapsed.