Fine-tuning Mistral for an enhanced content search experience (part I). A comprehensive look

Blog post

At Empathy.co, we continuously strive to push the boundaries of innovation, particularly in enhancing search experiences. Our recent project involved fine-tuning the Mistral model to improve content search experiences. Despite our focus applies to many content scenarios, these post series show how we improved our developer portal search functionality to provide a great example of what’s possible.

This post is the first in a four-part series, focusing specifically on the overall journey. The second post will provide a backend perspective, and subsequent posts will explore the infrastructure and user experience perspectives.

HolonSearch Privacy-First Generative Content Search experience in EPDocs
Fig. HolonSearch Privacy-First Generative Content Search experience in Empathy Platform Docs

The horizons of Generative AI

From the very beginning of AI irrupting in our daily lives, we can outline two horizons of generative AI experiences:

  • Interactive query & retrieval experiences that involve engaging in a conversation where you ask questions and receive answers. The interaction is based on natural language input.
  • Data-driven automation experiences that involve providing richer file datasets or data from emails, meeting notes, calendars, etc. So, AI-like tools can process this data to generate responses and automate tasks.

Following these approaches without caring about the importance of privacy awareness can lead to significant risks, including data misuse, erosion of user trust, and potential breaches of sensitive information. Our goal to elicit trust on our way to leveraging AI in search is to start with the aspects of an emotionally engaging experience, focusing on creating an intuitive and private user experience.

So, we’ve created a seamless AI content search experience that maintains people’s trust and privacy while interacting with it, encapsulated by our ethos of “We love AI. We love privacy.” By integrating a privacy-minded approach, we ensure a holistic and responsible development of AI technologies.

A unified vision

In developing this project, we aimed to create a seamless and intuitive search experience powered by generative AI while ensuring people’s trust and privacy. This integral vision required a coordinated effort across three main areas:

Backend development

Our backend teams focused on generating and fine-tuning datasets for the Mistral AI model. By creating diverse data sets and using advanced machine learning techniques, we ensured the model could provide accurate and contextually relevant responses.

Get to know more about the backend process in the second part of the post series.

Infrastructure

To support the enhanced AI capabilities, we designed a flexible and secure infrastructure. This involved using open standards to avoid vendor lock-in and implementing AI gateways to manage interactions between clients and multiple AI model providers. Our setup included both self-hosted and cloud-based solutions, ensuring scalability and reliability.

For more details on the infrastructure, check out the third part of this series.

UX and frontend development

The frontend team worked to translate the backend capabilities and the robust infrastructure into a user-friendly interface, upgrading people’s interaction and search experience. By integrating natural language processing into the existing contextual search function yet maintaining traditional keyword search options, we created an intuitive and familiar search experience, ensuring it met the needs of both beginners and experienced users.

Discover more on the user experience approach in the last post of the series.

The result: a unified search experience

The fine-tuning of Mistral for our developer portal exemplifies our commitment to innovation and user privacy. By combining diverse data sources, leveraging advanced AI models, and ensuring flexible deployment, we’ve created a powerful and private search experience.

Try it out!

Love AI. Love privacy. Type away! on our search bar above and try out our new HolonSearch Privacy-First Generative Content Search right now!

Keep reading!

Read more about our journey to our HolonSearch Privacy-First Generative Content Search experience in the following posts. Explore now the backend, infrastructure, and UX perspectives in more detail.