{ "cells": [ { "cell_type": "markdown", "id": "15e64bc9-bdf7-4fe5-9839-5d160c425c61", "metadata": { "tags": [] }, "source": [ "# Typewriter: 26 Tools\n", "\n", "This is a variation of the typewriter task in which the agent has access to 26 parameterless tools.\n", "\n", "Each tool represents a letter of the alphabet (e.g., 'a', 'b', 'c').\n", "\n", "The agent can use each tool to \"print\" the corresponding letter on a piece of virtual paper.\n", "\n", "The objective for the agent is to \"print\" the user's input on the paper exactly.\n", "\n", "---------\n", "\n", "For this code to work, please configure LangSmith environment variables with your credentials.\n", "\n", "```python\n", "import os\n", "\n", "os.environ[\"LANGCHAIN_API_KEY\"] = \"sk-...\" # Your api key.\n", "```" ] }, { "cell_type": "code", "execution_count": 1, "id": "b39159d0-9ea1-414f-a9d8-4a7b22b3d2cc", "metadata": { "tags": [] }, "outputs": [], "source": [ "from langchain_benchmarks import registry" ] }, { "cell_type": "code", "execution_count": 2, "id": "1aef2b32-a5df-421f-8be3-a2ef27372ece", "metadata": { "tags": [] }, "outputs": [ { "data": { "text/html": [ "
Name | Tool Usage - Typewriter (26 tools) |
Type | ToolUsageTask |
Dataset ID | 128af05e-aa00-4e3b-a958-d166dd450581 |
Description | Environment with 26 tools each tool represents a letter of the alphabet.\n", "\n", "The objective of this task is to evaluate the model's ability the use tools\n", "for a simple repetition task.\n", "\n", "For example, if the string is 'abc', the tools 'a', 'b', and 'c' must be invoked in that order.\n", "\n", "The dataset includes examples of varying difficulty. The difficulty is measured by the length of the string.\n", "\n", "This is a variation of the typer writer task, where 26 parameterless tools are\n", "given instead of a single tool that takes a letter as an argument. |