Aparna Dhinakaran – Medium

Aparna Dhinakaran

Aparna Dhinakaran
in
Towards Data Science

Large Language Model Performance in Time Series Analysis

How do major LLMs stack up at detecting anomalies or movements in the data when given a large set of time series data within the context…

May 1

Large Language Model Performance in Time Series Analysis

May 1

Aparna Dhinakaran
in
Towards Data Science

Tips for Getting the Generation Part Right in Retrieval Augmented Generation

Results from experiments to evaluate and compare GPT-4, Claude 2.1, and Claude 3.0 Opus

Apr 6

Tips for Getting the Generation Part Right in Retrieval Augmented Generation

Apr 6

Aparna Dhinakaran
in
Towards Data Science

Model Evaluations Versus Task Evaluations

Understanding the difference for LLM applications

Mar 26

Model Evaluations Versus Task Evaluations

Mar 26

Aparna Dhinakaran
in
Towards Data Science

Why You Should Not Use Numeric Evals For LLM As a Judge

Testing major LLMs on how well they conduct numeric evaluations

Mar 8

Why You Should Not Use Numeric Evals For LLM As a Judge

Mar 8

Aparna Dhinakaran
in
Towards Data Science

The Needle In a Haystack Test

Evaluating the performance of RAG systems

Feb 15

The Needle In a Haystack Test

Feb 15

Aparna Dhinakaran
in
Towards Data Science

LLM Evals: Setup and the Metrics That Matter

How to build and run LLM evals — and why you should use precision and recall when benchmarking your LLM prompt template

Oct 13, 2023

LLM Evals: Setup and the Metrics That Matter

Oct 13, 2023

Aparna Dhinakaran
in
Towards Data Science

Safeguarding LLMs with Guardrails

A pragmatic guide to implementing guardrails, covering both Guardrails AI and NVIDIA’s NeMo Guardrails

Sep 1, 2023

Safeguarding LLMs with Guardrails

Sep 1, 2023

Aparna Dhinakaran
in
Towards Data Science

How To Best Leverage OpenAI’s Evals Framework

Keys for evaluating LLMs using OpenAI Evals

May 23, 2023

How To Best Leverage OpenAI’s Evals Framework

May 23, 2023

Aparna Dhinakaran
in
Towards Data Science

Applying Large Language Models to Tabular Data to Identify Drift

Can LLMs reduce the effort involved in anomaly detection, sidestepping the need for parameterization or dedicated model training?

Apr 25, 2023

Applying Large Language Models to Tabular Data to Identify Drift

Apr 25, 2023

Aparna Dhinakaran
in
Towards Data Science

Boosting Tabular Data Predictions with Large Language Models

What happens when you unleash GPT-4 on a tabular Kaggle competition to predict home prices?

Apr 6, 2023

Boosting Tabular Data Predictions with Large Language Models

Apr 6, 2023

Aparna Dhinakaran

Aparna Dhinakaran

Co-Founder and CPO of Arize AI. Formerly Computer Vision PhD at Cornell, Uber Machine Learning, UC Berkeley AI Research.

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams