Skip to content

What is Okahu

Overview

AI Observability is proactive monitoring of your AI apps and cloud infra they run on to understand how to make them work better

Problem

GenAI application landscape

Today's GenAI applications are complex. They comprises of multiple technology components like workflow code, models, model hosting services, vector databases etc. Monitoring each components behavior, tracking the coorelations between components and understanding big picture is very hard.

Solution

Okahu observability cloud

Okahu captures traces and metrics from all the GenAI application component. We automatically discover the relationships between components and build the big picture. Okahu lets you can define your goals like performance and reliability, track your applications behavior, help you identify and resolve the problems.

Getting Started

Explore Okahu playground

Login to Okahu portal with your Github or Linkedin account. We'll create a tenant for you with sample data.

Okahu playground app

You can explore the Okahu playground to see what capabilties Okahu offers

Application insights

Application Insights

Okahu uses the traces and metrics data to measure your applications behavior against the your pre-defined goals.

Component map

Application component map

Okahu automatically discovers the GenAI components used by your application and their dependenecies to build the big picture.

Inference and Prompts

Application inference

Okahu offers full range of RestAPI to programatically use Okahu's capabilities. Clone Okahu's Postman collection to explore the Rest APIs. You'll need the API key that can be retrieved from settings tab in Okahu portal. The example above shows the details of prompt used in a inference request by the demo application.

Infrastructure metrics

Okahu collects various metrics like GPU or CPU utilization from infrastructure services like Azure OpenAI or NVDIA Triton. You can view these metrics using Okahu's rest APIs.

Monitor your app in Okahu

Once you explore the playground and understand the value Okahu provides, it's time to monitor your application.

1. Local Development

Okahu started a community driven open source project called Monocle under Linux Foundation Data & AI. Monocle provides a Python library that enables a GenAI developer to generate traces from their application runs. You and review these traces locally to help build and tune the application. Please refer to Monocle user guide for enabling your application.

Open

We invite you to get involved in the Monocle. Your suggestions and contribution are very welcome!

2. Okahu Cloud

Okahu extendeds Monocle's technology to graduate from local development environment to pre-prod and prod where you want to collaborate with other stakeholder like developers, DevOps and leadership. You can now define your application in Okahu and track your goals. Please refer to Okahu application guide to learn how to enable your application.

3. On-prem Okahu

Okahu collects the metrics and traces from your infrastructure components like Azure OpenAI, AWS, Sagemaker or NVDIA Triton. You need to deploy and configure the Okahu agent in your on-prem environment where these services are running. For more details, please refer to Okahu's infra agent guide.

Provide feedback

We welcome your feedback. Please email us at cx@okahu.ai.

References