Aparna Dhinakaran – Medium

Aparna Dhinakaran

Aparna Dhinakaran
in
Towards Data Science

Large Language Model Performance in Time Series Analysis

How do major LLMs stack up at detecting anomalies or movements in the data when given a large set of time series data within the context…

5 min readMay 1, 2024

--

2

Large Language Model Performance in Time Series Analysis

--

2

Aparna Dhinakaran
in
Towards Data Science

Tips for Getting the Generation Part Right in Retrieval Augmented Generation

Results from experiments to evaluate and compare GPT-4, Claude 2.1, and Claude 3.0 Opus

6 min readApr 6, 2024

--

Tips for Getting the Generation Part Right in Retrieval Augmented Generation

--

Aparna Dhinakaran
in
Towards Data Science

Model Evaluations Versus Task Evaluations

Understanding the difference for LLM applications

9 min readMar 26, 2024

--

Model Evaluations Versus Task Evaluations

--

Aparna Dhinakaran
in
Towards Data Science

Why You Should Not Use Numeric Evals For LLM As a Judge

Testing major LLMs on how well they conduct numeric evaluations

8 min readMar 8, 2024

--

Why You Should Not Use Numeric Evals For LLM As a Judge

--

Aparna Dhinakaran
in
Towards Data Science

The Needle In a Haystack Test

Evaluating the performance of RAG systems

9 min readFeb 15, 2024

--

The Needle In a Haystack Test

--

Aparna Dhinakaran
in
Towards Data Science

LLM Evals: Setup and the Metrics That Matter

How to build and run LLM evals — and why you should use precision and recall when benchmarking your LLM prompt template

12 min readOct 13, 2023

--

4

LLM Evals: Setup and the Metrics That Matter

--

4

Aparna Dhinakaran
in
Towards Data Science

Safeguarding LLMs with Guardrails

A pragmatic guide to implementing guardrails, covering both Guardrails AI and NVIDIA’s NeMo Guardrails

10 min readSep 1, 2023

--

2

Safeguarding LLMs with Guardrails

--

2

Aparna Dhinakaran
in
Towards Data Science

How To Best Leverage OpenAI’s Evals Framework

Keys for evaluating LLMs using OpenAI Evals

6 min readMay 23, 2023

--

2

How To Best Leverage OpenAI’s Evals Framework

--

2

Aparna Dhinakaran
in
Towards Data Science

Applying Large Language Models to Tabular Data to Identify Drift

Can LLMs reduce the effort involved in anomaly detection, sidestepping the need for parameterization or dedicated model training?

9 min readApr 25, 2023

--

5

Applying Large Language Models to Tabular Data to Identify Drift

--

5

Aparna Dhinakaran
in
Towards Data Science

Boosting Tabular Data Predictions with Large Language Models

What happens when you unleash GPT-4 on a tabular Kaggle competition to predict home prices?

9 min readApr 6, 2023

--

11

Boosting Tabular Data Predictions with Large Language Models

--

11

Aparna Dhinakaran

Aparna Dhinakaran

Co-Founder and CPO of Arize AI. Formerly Computer Vision PhD at Cornell, Uber Machine Learning, UC Berkeley AI Research.

Following

Help
Status
About
Careers
Blog
Privacy
Terms
Text to speech
Teams