Aparna DhinakaraninTowards Data ScienceLarge Language Model Performance in Time Series AnalysisHow do major LLMs stack up at detecting anomalies or movements in the data when given a large set of time series data within the context…5 min read·May 1, 2024--2--2
Aparna DhinakaraninTowards Data ScienceTips for Getting the Generation Part Right in Retrieval Augmented GenerationResults from experiments to evaluate and compare GPT-4, Claude 2.1, and Claude 3.0 Opus6 min read·Apr 6, 2024----
Aparna DhinakaraninTowards Data ScienceModel Evaluations Versus Task EvaluationsUnderstanding the difference for LLM applications9 min read·Mar 26, 2024----
Aparna DhinakaraninTowards Data ScienceWhy You Should Not Use Numeric Evals For LLM As a JudgeTesting major LLMs on how well they conduct numeric evaluations8 min read·Mar 8, 2024----
Aparna DhinakaraninTowards Data ScienceThe Needle In a Haystack TestEvaluating the performance of RAG systems9 min read·Feb 15, 2024----
Aparna DhinakaraninTowards Data ScienceLLM Evals: Setup and the Metrics That MatterHow to build and run LLM evals — and why you should use precision and recall when benchmarking your LLM prompt template12 min read·Oct 13, 2023--4--4
Aparna DhinakaraninTowards Data ScienceSafeguarding LLMs with GuardrailsA pragmatic guide to implementing guardrails, covering both Guardrails AI and NVIDIA’s NeMo Guardrails10 min read·Sep 1, 2023--2--2
Aparna DhinakaraninTowards Data ScienceHow To Best Leverage OpenAI’s Evals FrameworkKeys for evaluating LLMs using OpenAI Evals6 min read·May 23, 2023--2--2
Aparna DhinakaraninTowards Data ScienceApplying Large Language Models to Tabular Data to Identify DriftCan LLMs reduce the effort involved in anomaly detection, sidestepping the need for parameterization or dedicated model training?9 min read·Apr 25, 2023--5--5
Aparna DhinakaraninTowards Data ScienceBoosting Tabular Data Predictions with Large Language ModelsWhat happens when you unleash GPT-4 on a tabular Kaggle competition to predict home prices?9 min read·Apr 6, 2023--11--11