Calling Gemma with Ollama, TestContainers, and LangChain4j

April 12, 2024
3 min read

Likes ...

Comments ...

Table of Contents

How to run GemmaContainerizationTime to implement this approach!And voila!

Lately, for my Generative AI powered Java apps, I've used the Gemini multimodal large language model from Google. But there's also Gemma, its little sister model.

Gemma is a family of lightweight, state-of-the-art open models built from the same research and technology used to create the Gemini models.

Gemma is available in two sizes: 2B and 7B. Its weights are freely available, and its small size means you can run it on your own, even on your laptop. So I was curious to give it a run with LangChain4j.

How to run Gemma

There are many ways to run Gemma: in the cloud, via Vertex AI with a click of a button, or GKE with some GPUs, but you can also run it locally with Jlama or Gemma.cpp.

Another good option is to run Gemma with Ollama, a tool that you install on your machine, and which lets you run small models, like Llama 2, Mistral, and many others. They quickly added support for Gemma as well.

Once installed locally, you can run:

ollama run gemma:2b
ollama run gemma:7b

Cherry on the cake, the [LangChain4j]() library provides an Ollama module, so you can plug Ollama supported models in your Java applications easily.

Containerization

After a great discussion with my colleague Dan Dobrin who had worked with Ollama and TestContainers (#1 and #2) in his serverless production readiness workshop, I decided to try the approach below.

Which brings us to the last piece of the puzzle: Instead of having to install and run Ollama on my computer, I decided to use Ollama within a container, handled by TestContainers.

TestContainers is not only useful for testing, but you can also use it for driving containers. There's even a specific OllamaContainer you can take advantage of!

So here's the whole picture:

Time to implement this approach!

You'll find the code in the Github repository accompanying my recent Gemini workshop

Let's start with the easy part, interacting with an Ollama supported model with LangChain4j:

OllamaContainer ollama = createGemmaOllamaContainer();
ollama.start();

ChatLanguageModel model = OllamaChatModel.builder()
    .baseUrl(String.format("http://%s:%d", ollama.getHost(), ollama.getFirstMappedPort()))
    .modelName("gemma:2b")
    .build();

String response = model.generate("Why is the sky blue?");

System.out.println(response);

You run an Ollama test container.
You create an Ollama chat model, by pointing at the address and port of the container.
You specify the model you want to use.
Then, you just need to call model.generate(yourPrompt) as usual.

Easy? Now let's have a look at the trickier part, my local method that creates the Ollama container:

// check if the custom Gemma Ollama image exists already
List<Image> listImagesCmd = DockerClientFactory.lazyClient()
    .listImagesCmd()
    .withImageNameFilter(TC_OLLAMA_GEMMA_2_B)
    .exec();

if (listImagesCmd.isEmpty()) {
    System.out.println("Creating a new Ollama container with Gemma 2B image...");
    OllamaContainer ollama = new OllamaContainer("ollama/ollama:0.1.26");
    ollama.start();
    ollama.execInContainer("ollama", "pull", "gemma:2b");
    ollama.commitToImage(TC_OLLAMA_GEMMA_2_B);
    return ollama;
} else {
    System.out.println("Using existing Ollama container with Gemma 2B image...");
    // Substitute the default Ollama image with our Gemma variant
    return new OllamaContainer(
        DockerImageName.parse(TC_OLLAMA_GEMMA_2_B)
            .asCompatibleSubstituteFor("ollama/ollama"));
}

You need to create a derived Ollama container that pulls in the Gemma model. Either this image was already created beforehand, or if it doesn't exist yet, you create it.

Use the Docker Java client to check if the custom Gemma image exists. If it doesn't exist, notice how TestContainers let you create an image derived from the base Ollama image, pull the Gemma model, and then commit that image to your local Docker registry.

Otherwise, if the image already exists (ie. you created it in a previous run of the application), you're just going to tell TestContainers that you want to substitute the default Ollama image with your Gemma-powered variant.

And voila!

You can call Gemma locally on your laptop, in your Java apps, using LangChain4j, without having to install and run Ollama locally (but of course, you need to have a Docker daemon running).

Big thanks to Dan Dobrin for the approach, and to Sergei, Eddú and Oleg from TestContainers for the help and useful pointers.

AI4J - The Intelligent Java Conference

This exclusive virtual event brings together leading AI innovators and renowned Java Champions to unpack what’s changing today, what’s coming next, and how enterprise Java teams can stay ahead. April 14, 2026 @ 9am PDT | 12pm PDT.

April 12, 2024
3 min read

Likes ...

Comments ...

Guillaume Laforge

Author

Guillaume Laforge is a Java Champion, the co-founder of the Apache Groovy programming language, and is also a developer advocate for Google Cloud, focusing on generative AI, serverless technologies, and API orchestration.

How to Develop AI Agents Using BoxLang AI: A Practical Guide

Code. Check. Commit. 🚀 Never Leave the Terminal with Claude Code + SonarQube MCP

Disco API: Helping You To Find Any OpenJDK Distribution

Monitoring Across Frameworks: Spring Boot, Micronaut, Quarkus, and Helidon

Spring Boot Debugging with Aspect-Oriented Programming (AOP)

Service Layer Pattern in Java With Spring Boot

Semantic Caching with SpringBoot & Redis

Liquid Glass, Material 3, And A Lot Of Plumbing

A New Chapter for the Payara Community

Which Java Runtime Should You Use in Production? Comparing OpenJDK Distributions

foojay: A Place for Friends of OpenJDK

Dashboard for OpenJDK Update Release Details

JDK14: New Features and Enhancements

Fun with Flags: My Top 10 Resources for JVM Flags

Performance of Modern Java on Data-Heavy Workloads: Real-Time Streaming

Performance of Modern Java on Data-Heavy Workloads: Batch Processing

How does Java handle different Images and ColorSpaces – Part 1

How does Java handle different Images and ColorSpaces – Part 2

How does Java handle different Images and ColorSpaces – Part 3

How does Java handle different Images and ColorSpaces – Part 4

Indexing all of Wikipedia, on a laptop

Working with Multiple Carets in IntelliJ IDEA

Clean Shutdown of Spring Boot Applications

Project Panama for Newbies (Part 1)

Java 17 on the Raspberry Pi

How to Create Mobile Apps with JavaFX (Part 1)

Beginning JavaFX Applications with IntelliJ IDE

SpringBoot 3.2 + CRaC

Foojay Slack: bit.ly/join-foojay-slack

Preparing for Spring Framework 7 and Spring Boot 4

Apache Kafka Performance on Azul Platform Prime vs Vanilla OpenJDK

Learn about a number of experiments that have been conducted with Apache Kafka performance on Azul Platform Prime, compared to vanilla OpenJDK. Roughly 40% improvements in performance, both throughput and latency, are achieved.

Stable, Secure, and Affordable Java

Azul Platform Core is the #1 Oracle Java alternative, offering OpenJDK support for more versions (including Java 6 & 7) and more configurations for the greatest business value and lowest TCO.

Quick Start with Machine Learning in Java

So you’re a Java developer and you want to do some machine learning.

Oct 07 3,5K

Zoran Sevarac

Deep Netts

Machine Learning

Busting Myths, Building Futures: A Conversation with Cay Horstmann on Java and Machine Learning

Cay Horstmann shares his experiences with Java, his writing process for technical books, the challenges of teaching Java, and discusses its role in education.

Jul 24 7,2K

A N M Bazlur Rahman

Interviews

Fabiane Bizinella Nardon Talks about Machine Learning and Disruptive Data Science

I atttended sessions and spoke with Java Champion Fabiane Bizinella Nardon at many JavaOne conferences.

I remember, in our conversations in the hallways, discussing various entrepreneurial ventures she was working on.

One of the ideas was Tail Target. Fast forward almost a decade, and Tail Target has truly come to fruition.

Mar 26 3,2K

Kevin Farnham

Interviews

Machine Learning

Faster Integration Tests with Reusable Testcontainers

Learn how to improve your test performance against container-based resources by magnitudes in a couple of easy steps!

Aug 17 16,2K

Michael Simons

Databases

Testing Testcontainers Performance Neo4J

Jakarta EE 11: Beyond the Era of Java EE

Step up your coding with the Continuous Feedback Udemy Course: Additional coupons are available

Stable, Secure, and Affordable Java

Calling Gemma with Ollama, TestContainers, and LangChain4j

How to run Gemma

Containerization

Time to implement this approach!

And voila!

AI4J - The Intelligent Java Conference

Guillaume Laforge

Guillaume Laforge

Thanks to our Sponsors!

Azul

Redis

CodeRabbit

Reo

Zencoder

Payara

Digma

adesso

Trending

Apache Kafka Performance on Azul Platform Prime vs Vanilla OpenJDK

Stable, Secure, and Affordable Java

Stable, Secure, and Affordable Java

Step up your coding with the Continuous Feedback Udemy Course: Additional coupons are available

Jakarta EE 11: Beyond the Era of Java EE

Comments (1)

Java Weekly, Issue 538 | Baeldung

Jakarta EE 11: Beyond the Era of Java EE

Step up your coding with the Continuous Feedback Udemy Course: Additional coupons are available

Stable, Secure, and Affordable Java

Do you want your ad here?

Calling Gemma with Ollama, TestContainers, and LangChain4j

How to run Gemma

Containerization

Time to implement this approach!

And voila!

AI4J - The Intelligent Java Conference

Guillaume Laforge

Guillaume Laforge

Thanks to our Sponsors!

Azul

Redis

CodeRabbit

Reo

Zencoder

Payara

Digma

adesso

Trending

Apache Kafka Performance on Azul Platform Prime vs Vanilla OpenJDK

Stable, Secure, and Affordable Java

All 0 Likes

Stable, Secure, and Affordable Java

Step up your coding with the Continuous Feedback Udemy Course: Additional coupons are available

Jakarta EE 11: Beyond the Era of Java EE

Do you want your ad here?

Related Articles

Comments (1)

Java Weekly, Issue 538 | Baeldung

Set Event Reminder

Subscribe to foojay updates:

Share with