The “Android Bench” for ranking AI models used in Android app development has been updated, with OpenAI’s latest model now tied with Gemini for the top spot.
First released in March, the “Android Bench” is Google’s resource for measuring the best AI models to use for coding Android apps. Google’s methodology includes looking at how the models can work with Jetpack Compose for UI, Coroutines and Flows for asynchronous programming, room for persistence, and hilt for dependency injection, among other factors.
In the first update to this list, Google has added two new models in OpenAI’s GPT 5.4 and GPT 5.3 Codex, and they quickly jump towards the top of the list.
The rest of the list didn’t change this time around, with the results used still from late February in that initial run. OpenAI’s latest models were tested in mid-March ahead of this week’s release of those results.
Of course, these results shouldn’t be treated as an absolute fact. As with any benchmark, reality often differs from controlled tests. There are a ton of variables for why one model might work better for you than another, including workflow, value, and more.
Google originally said that its goal in publishing these results was to help developers be “more productive” and, ultimately, deliver “higher quality apps across the Android ecosystem.”
Follow Ben: Twitter/X, Threads, Bluesky, and Instagram
FTC: We use income earning auto affiliate links. More.
Check out 9to5Google on YouTube for more news:
Breaking news for Android. Get the latest on app…
Ben is a Senior Editor for 9to5Google.
Find him on Twitter @NexusBen. Send tips to schoon@9to5g.com or encrypted to benschoon@protonmail.com.
AI Search


