More

karimf · 2026-04-05T01:29:09 1775352549

Nice project. I thought of building exactly this.

Since it's much easier to port source code to other languages now, I'd love to see more projects like written in Swift, or C#.

karimf · 2026-04-03T16:35:09 1775234109

The big question is whether Apple can keep shipping new models constantly.

AFAIK the current model is on par with with Qwen-3-4B, which is from a year ago [0]. There's a big leap going from last year Qwen-3-4B to Qwen-3.5-4B or to Gemma 4.

Apple model is nice since you don't need to download anything else, but I'd rather use the latest model than to use a model from a year ago.

https://machinelearning.apple.com/research/apple-foundation-...

dangus · 2026-04-03T18:25:38 1775240738

I’m not sure why that’s a question, it’s just a downloaded file. You can even watch it download separately when you enable Apple Intelligence (it’s not tied to OS updates from what I can tell).

Of course I imagine Apple is not going to be the fastest mover in this regard. I’m not even sure they believe the product will be widely impactful anymore and may keep it relegated to a small list of popular use cases like photo touch ups and quick questions to Siri. For me the most useful parts of Apple’s AI don’t even require me to enable Apple Intelligence.

karimf · 2026-04-02T18:21:10 1775154070

I'm curious about the multimodal capabilities on the E2B and E4B and how fast is it.

In ChatGPT right now, you can have a audio and video feed for the AI, and then the AI can respond in real-time.

Now I wonder if the E2B or the E4B is capable enough for this and fast enough to be run on an iPhone. Basically replicating that experience, but all the computations (STT, LLM, and TTS) are done locally on the phone.

I just made this [0] last week so I know you can run a real-time voice conversation with an AI on an iPhone, but it'd be a totally different experience if it can also process a live camera feed.

https://github.com/fikrikarim/volocal

fy20 · 2026-04-03T04:41:39 1775191299

I just want to say thanks. Finding out about these kind of projects that people are working on is what I come to HN for, and what excites me about software engineering!

karimf · 2026-04-03T06:22:13 1775197333

Thank you for the kind words!

functional_dev · 2026-04-02T19:01:24 1775156484

yeah, it appears to support audio and image input.. and runs on mobile devices with 256K context window!

coder543 · 2026-04-03T01:07:36 1775178456

The E2B and E4B models support 128k context, not 256k, and even with the 128k... it could take a long time to process that much context on most phones, even with the processor running full tilt. It's hard to say without benchmarks, but 128k supported isn't the same as 128k practical. It will be interesting to see.

karimf · 2026-04-02T14:57:07 1775141827

> In the coming days, we will also open-source smaller-scale variants, reaffirming our commitment to accessibility and community-driven innovation.

karimf · 2026-04-02T07:14:55 1775114095

This is huge problem for developing countries. Most people here have $100-$200 phones. iPhone is a luxury.

joe_mamba · 2026-04-02T08:25:37 1775118337

Forget developing countries, iPhone is a luxury even in some European countries, when rent is 500+ Euros and your take home pay is ~1000. After all the other bills you're not left with iPhone money, which is why 100-200 Euro models of Chinese brands are doing so well.

It's easier to name the countries where iPhone ISN'T a luxury, as you can count them on very few hands.

heraldgeezer · 2026-04-02T08:51:06 1775119866

[flagged]

cnd78A · 2026-04-02T10:46:49 1775126809

Many countries would develop much faster if there weren't bombed nor maintain by puppet dictactors from (economically) developped nations (USA and france keep doing this intensively, while countries like Germany dont mind supporting fascist states). (PS: I'm not woke, not even Marxist).

Nesco · 2026-04-02T11:24:49 1775129089

What country would be the stereotypical example you are thinking of? I fail to find any

fn-mote · 2026-04-02T12:04:50 1775131490

From the US point of view only:

Historically, most of Latin America.

Very recently: why was Venezuela attacked by the US?

Nesco · 2026-04-02T17:51:55 1775152315

Latin America isn’t a country and Venezuela wasn’t developing in any sens of the word

marcosdumay · 2026-04-02T14:03:23 1775138603

Also, why are they trying to do a genocide in Cuba?

ReptileMan · 2026-04-02T10:02:16 1775124136

Won't matter - if enough people in developing countries can afford iphones, apple will just rise the prices.

karimf · 2026-03-31T09:46:44 1774950404

The Github repo is only for issue tracker

matheusmoreira · 2026-03-31T10:00:33 1774951233

Wow it's true. Anthropic actually had me fooled. I saw the GitHub repository and just assumed it was open source. Didn't look at the actual files too closely. There's pretty much nothing there.

So glad I took the time to firejail this thing before running it.

karimf · 2026-03-31T09:46:07 1774950367

Is there anything special here vs. OpenCode or Codex?

There were/are a lot of discussions on how the harness can affect the output.

simonklee · 2026-03-31T11:38:37 1774957117

Not really, except that they have a bunch of weird things in the source code and people like to make fun of it. OpenCode/Codex generally doesn't have this since these are open-source projects from the get go.

(I work on OpenCode)

karimf · 2026-03-31T06:45:04 1774939504

Depending on the use case, the future is already here.

For example, last week I built a real-time voice AI running locally on iPhone 15.

One use case is for people learning speaking english. The STT is quite good and the small LLM is enough for basic conversation.

https://github.com/fikrikarim/volocal

podlp · 2026-03-31T14:15:26 1774966526

That’s awesome! I’ve got a similar project for macOS/ iOS using the Apple Intelligence models and on-device STT Transcriber APIs. Do you think it the models you’re using could be quantized more that they could be downloaded on first run using Background Assets? Maybe we’re not there yet, but I’m interested in a better, local Siri like this with some sort of “agentic lite” capabilities.

karimf · 2026-03-31T20:56:55 1774990615

> Do you think it the models you’re using could be quantized more that they could be downloaded on first run using Background Assets?

I first tried the Qwen 3.5 0.8B Q4_K_S and the model couldn't hold a basic conversation. Although I haven't tried lower quants on 2B.

I'm also interested on the Apple Foundation models, and it's something I plan to try next. AFAIK it's on par with Qwen-3-4B [0]. The biggest upside as you alluded to is that you don't need to download it, which is huge for user onboarding.

[0] https://machinelearning.apple.com/research/apple-foundation-...

podlp · 2026-04-01T14:13:30 1775052810

Subjectively, AFM isn’t even close to Qwen. It’s one of the weakest models I’ve used. I’m not even sure how many people have Apple Intelligence enabled. But I agree, there must be a huge onboarding win long-term using (and adapting) a model that’s already optimized for your machine. I’ve learned how to navigate most of its shortcomings, but it’s not the most pleasant to work with.

Patrick_Devine · 2026-03-31T22:31:41 1774996301

Try it with mxfp8 or bf16. It's a decent model for doing tool calling, but I wouldn't recommend using it with 4 bit quantization.

Barbing · 2026-03-31T07:00:16 1774940416

Brilliant. Hope to see you in the App Store!

karimf · 2026-03-31T07:14:40 1774941280

Oh thank you! I wasn’t sure if it was worth submitting to the app store since it was just a research preview, but I could do it if people want it.

karimf · 2026-04-02T23:03:32 1775171012

Ok it's on the app store now: https://apps.apple.com/app/volocal/id6761493288

karimf · 2026-03-30T12:51:16 1774875076

Totally agree. There are significantly more new apps being released. I've been visiting the /r/macapps subreddit and they're having trouble filtering new submissions. I generally like the direction that they're taking https://www.reddit.com/r/macapps/comments/1ryaeex/rmacapps_m...

Even though it's more troublesome to submit apps to App Store, it's one signal that the app is not a malware.

g947o · 2026-03-30T13:29:27 1774877367

Wow, this subreddit looks like the apocalypse of vibe coded projects/apps. Kind of similar to what happened to "show HN". Too many ideas, not enough problems to solve, and likely bad implementations. The result is that nobody uses any of the apps.

In AI conversations, people often forget that at the end of a day, an actual human needs to use your stuff.

karimf · 2026-03-29T17:55:05 1774806905

Thank you!