Prep 0.13.2 (#33 )

* fix issue with client removal * client type not editable after creation (keeps things simple) * fixes issue with openai client bugging out (api_url not set) * fix issues with edit client not reflecting changes to UX * 0.13.2
Prep 0.13.1 (#29 )
2025-12-16 19:57:47 +01:00 · 2023-11-19 20:43:15 +02:00 · 2023-11-19 18:58:40 +02:00 · 2023-11-19 14:05:09 +02:00 · 2023-11-18 12:20:02 +02:00 · 2023-11-18 12:16:29 +02:00
159 changed files with 6899 additions and 3535 deletions
--- a/.gitignore
+++ b/.gitignore
@@ -1,10 +1,13 @@
-.lmer
 *.pyc
-problems
 *.swp
 *.swo
 *.egg-info
-tales/
 *-internal*
 *.internal*
 *_internal*
+talemate_env
+chroma
+scenes
+config.yaml
+!scenes/infinity-quest/assets
+!scenes/infinity-quest/infinity-quest.json
--- a/README.md
+++ b/README.md
@@ -4,29 +4,35 @@ Allows you to play roleplay scenarios with large language models.

 It does not run any large language models itself but relies on existing APIs. Currently supports **text-generation-webui** and **openai**.

-This means you need to either have an openai api key or know how to setup [oobabooga/text-generation-webui](https://github.com/oobabooga/text-generation-webui) (locally or remotely via gpu renting.)
+This means you need to either have an openai api key or know how to setup [oobabooga/text-generation-webui](https://github.com/oobabooga/text-generation-webui) (locally or remotely via gpu renting. `--extension openai` flag needs to be set)

-![Screenshot 1](docs/img/Screenshot_8.png)
+As of version 0.13.0 the legacy text-generator-webui API `--extension api` is no longer supported, please use their new `--extension openai` api implementation instead. 
+
+![Screenshot 1](docs/img/Screenshot_9.png)
 ![Screenshot 2](docs/img/Screenshot_2.png)

 ## Current features

 - responive modern ui
 - agents
-    - conversation
-    - narration
-    - summarization
-    - director
-    - creative
- multi-client (agents can be connected to separate LLMs)
- long term memory (very experimental at this point)
+    - conversation: handles character dialogue
+    - narration: handles narrative exposition
+    - summarization: handles summarization to compress context while maintain history
+    - director: can be used to direct the story / characters
+    - editor: improves AI responses (very hit and miss at the moment)
+    - world state: generates world snapshot and handles passage of time (objects and characters)
+    - creator: character / scenario creator
+- multi-client (agents can be connected to separate APIs)
+- long term memory
+    - chromadb integration
+    - passage of time
 - narrative world state
 - narrative tools
 - creative tools 
    - AI backed character creation with template support (jinja2)
    - AI backed scenario creation
 - runpod integration
- overridable templates foe all LLM prompts. (jinja2)
+- overridable templates for all prompts. (jinja2)

 ## Planned features

@@ -34,20 +40,27 @@ Kinda making it up as i go along, but i want to lean more into gameplay through

 In no particular order:

- Gameplay loop governed by AI
+- Extension support
+    - modular agents and clients
 - Improved world state
 - Dynamic player choice generation
 - Better creative tools
    - node based scenario / character creation
- Improved long term memory (base is there, but its very rough at the moment)
+- Improved and consistent long term memory
 - Improved director agent
    - Right now this doesn't really work well on anything but GPT-4 (and even there it's debatable). It tends to steer the story in a way that introduces pacing issues. It needs a model that is creative but also reasons really well i think.
- Automatic1111 client
+- Gameplay loop governed by AI
+    - objectives
+    - quests
+    - win / lose conditions
+- Automatic1111 client for in place visual generation

 # Quickstart

 ## Installation

+Post [here](https://github.com/final-wombat/talemate/issues/17) if you run into problems during installation.
+
 ### Windows

 1. Download and install Python 3.10 or higher from the [official Python website](https://www.python.org/downloads/windows/).
@@ -64,7 +77,7 @@ In no particular order:
 1. `git clone git@github.com:final-wombat/talemate`
 1. `cd talemate`
 1. `source install.sh`
-1. Start the backend: `python src/talemate/server/run.py runserver --host 0.0.0.0 --port 5001`.
+1. Start the backend: `python src/talemate/server/run.py runserver --host 0.0.0.0 --port 5050`.
 1. Open a new terminal, navigate to the `talemate_frontend` directory, and start the frontend server by running `npm run serve`.

 ## Configuration
@@ -94,31 +107,18 @@ Once the api key is set Pods loaded from text-generation-webui templates (or the

 **ATTENTION**: Talemate is not a suitable for way for you to determine whether your pod is currently running or not. **Always** check the runpod dashboard to see if your pod is running or not.

-## Recommended Models
+## Recommended Models 
+(as of2023.10.25)

-Note: this is my personal opinion while using talemate. If you find a model that works better for you, let me know about it.
-
-Will be updated as i test more models and over time.
-
-| Model Name                    | Type            | Notes                                                                                                             |
-|-------------------------------|-----------------|-------------------------------------------------------------------------------------------------------------------|
-| [GPT-4](https://platform.openai.com/) | Remote         | Still the best for consistency and reasoning, but is heavily censored, while talemate will send a general "decensor" system prompt, depending on the type of content you want to roleplay, there is a chance your key will be banned. **If you do use this make sure to monitor your api usage, talemate tends to send a lot more requests than other roleplaying applications.** |
-| [Nous Hermes LLama2](https://huggingface.co/TheBloke/Nous-Hermes-Llama2-GPTQ) | 13B model | My go-to model for 13B parameters. It's good at roleplay and also smart enough to handle the world state and narrative tools. A 13B model loaded via exllama also allows you run chromadb with the xl instructor embeddings off of a single 4090. |
-| [MythoMax](https://huggingface.co/TheBloke/MythoMax-L2-13B-GPTQ) | 13B model | Similar quality to Hermes LLama2, but a bit more creative. Rarely fails on JSON responses. |
-| [Synthia v1.2 34B](https://huggingface.co/TheBloke/Synthia-34B-v1.2-GPTQ) | 34B model | Cannot be run at full context together with chromadb instructor models on a single 4090. But a great choice if you're running chromadb with the default embeddings (or on cpu). |
-| [Xwin-LM-70B](https://huggingface.co/Xwin-LM/Xwin-LM-70B-V0.1) | 70B model | Great choice if you have the hardware to run it (or can rent it). |
-| [Synthia v1.2 70B](https://huggingface.co/TheBloke/Synthia-70B-v1.2-GPTQ) | 70B model | Great choice if you have the hardware to run it (or can rent it). |
-
-I have not included OpenAI's gpt-3.5-turbo in this list, since it is really inconsistent with JSON responses, plus its probably still just as heavily censored as GPT-4.
-
-I have not tested with Llama 1 models in a while, Lazarus was really good at roleplay, but started failing on JSON requirements.
-
-I have not tested with anything below 13B parameters.
+Any of the top models in any of the size classes here should work well:
+https://www.reddit.com/r/LocalLLaMA/comments/17fhp9k/huge_llm_comparisontest_39_models_tested_7b70b/

 ## Connecting to an LLM

 On the right hand side click the "Add Client" button. If there is no button, you may need to toggle the client options by clicking this button:

+As of version 0.13.0 the legacy text-generator-webui API `--extension api` is no longer supported, please use their new `--extension openai` api implementation instead. 
+
 ![Client options](docs/img/client-options-toggle.png)

 ### Text-generation-webui
--- a/config.example.yaml
+++ b/config.example.yaml
@@ -1,3 +1,5 @@
+agents: {}
+clients: {}
 creator:
  content_context:
  - a fun and engaging slice of life story aimed at an adult audience.
--- a/docs/chromadb.md
+++ b/docs/chromadb.md
@@ -1,7 +1,15 @@
-## ChromaDB
+# ChromaDB
+
+Talemate uses ChromaDB to maintain long-term memory. The default embeddings used are really fast but also not incredibly accurate. If you want to use more accurate embeddings you can use the instructor embeddings or the openai embeddings. See below for instructions on how to enable these.
+
+In my testing so far, instructor-xl has proved to be the most accurate (even more-so than openai)
+
+## Local instructor embeddings

 If you want chromaDB to use the more accurate (but much slower) instructor embeddings add the following to `config.yaml`:

+**Note**: The `xl` model takes a while to load even with cuda. Expect a minute of loading time on the first scene you load.
+
 ```yaml
 chromadb:
    embeddings: instructor
@@ -9,17 +17,23 @@ chromadb:
    instructor_model: hkunlp/instructor-xl
 ```

+### Instructor embedding models
+
+- `hkunlp/instructor-base` (smallest / fastest)
+- `hkunlp/instructor-large` 
+- `hkunlp/instructor-xl` (largest / slowest) - requires about 5GB of memory
+
 You will need to restart the backend for this change to take effect.

 **NOTE** - The first time you do this it will need to download the instructor model you selected. This may take a while, and the talemate backend will be un-responsive during that time.

-Once the download is finished, if talemate is still un-responsive, try reloading the front-end to reconnect. When all fails just restart the backend as well.
+Once the download is finished, if talemate is still un-responsive, try reloading the front-end to reconnect. When all fails just restart the backend as well. I'll try to make this more robust in the future.

 ### GPU support

 If you want to use the instructor embeddings with GPU support, you will need to install pytorch with CUDA support. 

-To do this on windows, run `install-pytorch-cuda.bat` from the project root. Then change your device in the config to `cuda`:
+To do this on windows, run `install-pytorch-cuda.bat` from the project directory. Then change your device in the config to `cuda`:

 ```yaml
 chromadb:
@@ -28,8 +42,20 @@ chromadb:
    instructor_model: hkunlp/instructor-xl
 ```

-Instructor embedding models:
+## OpenAI embeddings

- `hkunlp/instructor-base` (smallest / fastest)
- `hkunlp/instructor-large` 
- `hkunlp/instructor-xl` (largest / slowest) - requires about 5GB of GPU memory
+First make sure your openai key is specified in the `config.yaml` file
+
+```yaml
+openai:
+  api_key: <your-key-here>
+```
+
+Then add the following to `config.yaml` for chromadb:
+
+```yaml
+chromadb:
+    embeddings: openai
+```
+
+**Note**: As with everything openai, using this isn't free. It's way cheaper than their text completion though. ALSO - if you send super explicit content they may flag / ban your key, so keep that in mind (i hear they usually send warnings first though), and always monitor your usage on their dashboard.
--- a/docs/img/Screenshot_9.png
+++ b/docs/img/Screenshot_9.png
--- a/docs/linux-install.md
+++ b/docs/linux-install.md
@@ -14,7 +14,7 @@

 1. With the virtual environment activated and dependencies installed, you can start the backend server.
 2. Navigate to the `src/talemate/server` directory.
-3. Run the server with `python run.py runserver --host 0.0.0.0 --port 5001`.
+3. Run the server with `python run.py runserver --host 0.0.0.0 --port 5050`.

 ### Running the Frontend

@@ -22,4 +22,4 @@
 2. If you haven't already, install npm dependencies by running `npm install`.
 3. Start the server with `npm run serve`.

-Please note that you may need to set environment variables or modify the host and port as per your setup. You can refer to the `runserver.sh` and `frontend.sh` files for more details.
+Please note that you may need to set environment variables or modify the host and port as per your setup. You can refer to the `runserver.sh` and `frontend.sh` files for more details.
--- a/docs/talemate-scene-schema.json
+++ b/docs/talemate-scene-schema.json
@@ -0,0 +1,187 @@
+{
+  "$schema": "http://json-schema.org/draft-07/schema#",
+  "type": "object",
+  "properties": {
+    "description": {
+      "type": "string"
+    },
+    "intro": {
+      "type": "string"
+    },
+    "name": {
+      "type": "string"
+    },
+    "history": {
+      "type": "array",
+      "items": {
+        "type": "object",
+        "properties": {
+          "message": {
+            "type": "string"
+          },
+          "id": {
+            "type": "integer"
+          },
+          "typ": {
+            "type": "string"
+          },
+          "source": {
+            "type": "string"
+          }
+        },
+        "required": ["message", "id", "typ", "source"]
+      }
+    },
+    "environment": {
+      "type": "string"
+    },
+    "archived_history": {
+      "type": "array",
+      "items": {
+        "type": "object",
+        "properties": {
+          "text": {
+            "type": "string"
+          },
+          "ts": {
+            "type": "string"
+          }
+        },
+        "required": ["text", "ts"]
+      }
+    },
+    "character_states": {
+      "type": "object"
+    },
+    "characters": {
+      "type": "array",
+      "items": {
+        "type": "object",
+        "properties": {
+          "name": {
+            "type": "string"
+          },
+          "description": {
+            "type": "string"
+          },
+          "greeting_text": {
+            "type": "string"
+          },
+          "base_attributes": {
+            "type": "object",
+            "additionalProperties": {
+                "type": "string"
+            }
+          },
+          "details": {
+            "type": "object"
+          },
+          "gender": {
+            "type": "string"
+          },
+          "color": {
+            "type": "string"
+          },
+          "example_dialogue": {
+            "type": "array",
+            "items": {
+              "type": "string"
+            }
+          },
+          "history_events": {
+            "type": "array",
+            "items": {
+              "type": "object"
+            }
+          },
+          "is_player": {
+            "type": "boolean"
+          },
+          "cover_image": {
+            "type": ["string", "null"]
+          }
+        },
+        "required": ["name", "description", "greeting_text", "base_attributes", "details", "gender", "color", "example_dialogue", "history_events", "is_player", "cover_image"]
+      }
+    },
+    "goal": {
+      "type": ["string", "null"]
+    },
+    "goals": {
+      "type": "array",
+      "items": {
+        "type": "object"
+      }
+    },
+    "context": {
+      "type": "string"
+    },
+    "world_state": {
+      "type": "object",
+      "properties": {
+        "characters": {
+          "type": "object",
+          "additionalProperties": {
+            "type": "object",
+            "properties": {
+              "snapshot": {
+                "type": ["string", "null"]
+              },
+              "emotion": {
+                "type": "string"
+              }
+            },
+            "required": ["snapshot", "emotion"]
+          }
+        },
+        "items": {
+          "type": "object",
+          "additionalProperties": {
+            "type": "object",
+            "properties": {
+              "snapshot": {
+                "type": ["string", "null"]
+              }
+            },
+            "required": ["snapshot"]
+          }
+        },
+        "location": {
+          "type": ["string", "null"]
+        }
+      },
+      "required": ["characters", "items", "location"]
+    },
+    "assets": {
+      "type": "object",
+      "properties": {
+        "cover_image": {
+          "type": "string"
+        },
+        "assets": {
+          "type": "object",
+          "additionalProperties": {
+            "type": "object",
+            "properties": {
+              "id": {
+                "type": "string"
+              },
+              "file_type": {
+                "type": "string"
+              },
+              "media_type": {
+                "type": "string"
+              }
+            },
+            "required": ["id", "file_type", "media_type"]
+          }
+        }
+      },
+      "required": ["cover_image", "assets"]
+    },
+    "ts": {
+      "type": "string"
+    }
+  },
+  "required": ["description", "intro", "name", "history", "environment", "archived_history", "character_states", "characters", "context", "world_state", "assets", "ts"]
+}
--- a/install.bat
+++ b/install.bat
@@ -7,10 +7,10 @@ REM activate the virtual environment
 call talemate_env\Scripts\activate

 REM install poetry
-pip install poetry
+python -m pip install "poetry==1.7.1" "rapidfuzz>=3" -U

 REM use poetry to install dependencies
-poetry install
+python -m poetry install

 REM copy config.example.yaml to config.yaml only if config.yaml doesn't exist
 IF NOT EXIST config.yaml copy config.example.yaml config.yaml
--- a/poetry.lock
+++ b/poetry.lock
--- a/pyproject.toml
+++ b/pyproject.toml
@@ -4,7 +4,7 @@ build-backend = "poetry.masonry.api"

 [tool.poetry]
 name = "talemate"
-version = "0.9.0"
+version = "0.13.2"
 description = "AI-backed roleplay and narrative tools"
 authors = ["FinalWombat"]
 license = "GNU Affero General Public License v3.0"
@@ -17,7 +17,7 @@ black = "*"
 rope = "^0.22"
 isort = "^5.10"
 jinja2 = "^3.0"
-openai = "*"
+openai = ">=1"
 requests = "^2.26"
 colorama = ">=0.4.6"
 Pillow = "^9.5"
@@ -27,19 +27,21 @@ typing-inspect = "0.8.0"
 typing_extensions = "^4.5.0"
 uvicorn = "^0.23"
 blinker = "^1.6.2"
-pydantic = "<2"
-langchain = "0.0.213"
+pydantic = "<3"
 beautifulsoup4 = "^4.12.2"
 python-dotenv = "^1.0.0"
 websockets = "^11.0.3"
 structlog = "^23.1.0"
 runpod = "==1.2.0"
 nest_asyncio = "^1.5.7"
+isodate = ">=0.6.1"
+thefuzz = ">=0.20.0"
+tiktoken = ">=0.5.1"

 # ChromaDB
-chromadb = ">=0.4,<1"
+chromadb = ">=0.4.17,<1"
 InstructorEmbedding = "^1.0.1"
-torch = ">=2.0.0, !=2.0.1"
+torch = ">=2.1.0"
 sentence-transformers="^2.2.2"

 [tool.poetry.dev-dependencies]
--- a/reinstall.bat
+++ b/reinstall.bat
@@ -0,0 +1,18 @@
+@echo off
+
+IF EXIST talemate_env rmdir /s /q "talemate_env"
+
+REM create a virtual environment
+python -m venv talemate_env
+
+REM activate the virtual environment
+call talemate_env\Scripts\activate
+
+REM install poetry
+python -m pip install "poetry==1.7.1" "rapidfuzz>=3" -U
+
+REM use poetry to install dependencies
+python -m poetry install
+
+echo Virtual environment re-created.
+pause
--- a/scenes/infinity-quest/infinity-quest.json
+++ b/scenes/infinity-quest/infinity-quest.json
@@ -4,7 +4,29 @@
  "name": "Infinity Quest",
  "history": [],
  "environment": "scene",
-  "archived_history": [],
+  "ts": "P1Y",
+  "archived_history": [
+    {
+      "text": "Captain Elmer and Kaira first met during their rigorous training for the Infinity Quest mission. Their initial interactions were marked by a sense of mutual respect and curiosity.",
+      "ts": "PT1S"
+    },
+    {
+      "text": "Over the course of several months, as they trained together, Elmer and Kaira developed a strong bond. They often spent their free time discussing their dreams of exploring the cosmos.",
+      "ts": "P3M"
+    },
+    {
+      "text": "During a simulated mission, the Starlight Nomad encountered a sudden system malfunction. Elmer and Kaira worked tirelessly together to resolve the issue and avert a potential disaster. This incident strengthened their trust in each other's abilities.",
+      "ts": "P6M"
+    },
+    {
+      "text": "As they ventured further into uncharted space, the crew faced a perilous encounter with a hostile alien species. Elmer and Kaira's coordinated efforts were instrumental in negotiating a peaceful resolution and avoiding conflict.",
+      "ts": "P8M"
+    },
+    {
+      "text": "One memorable evening, while gazing at the stars through the ship's observation deck, Elmer and Kaira shared personal stories from their past. This intimate conversation deepened their connection and understanding of each other.",
+      "ts": "P11M"
+    }
+  ],
  "character_states": {},
  "characters": [
    {
--- a/src/talemate/init.py
+++ b/src/talemate/init.py
@@ -2,4 +2,4 @@ from .agents import Agent
 from .client import TextGeneratorWebuiClient
 from .tale_mate import *

-VERSION = "0.9.0"
+VERSION = "0.13.2"
--- a/src/talemate/agents/init.py
+++ b/src/talemate/agents/init.py
@@ -6,4 +6,6 @@ from .director import DirectorAgent
 from .memory import ChromaDBMemoryAgent, MemoryAgent
 from .narrator import NarratorAgent
 from .registry import AGENT_CLASSES, get_agent_class, register
-from .summarize import SummarizeAgent
+from .summarize import SummarizeAgent
+from .editor import EditorAgent
+from .world_state import WorldStateAgent
--- a/src/talemate/agents/base.py
+++ b/src/talemate/agents/base.py
@@ -10,13 +10,37 @@ from blinker import signal
 import talemate.instance as instance
 import talemate.util as util
 from talemate.emit import emit
-
+from talemate.events import GameLoopStartEvent
+import talemate.emit.async_signals
+import dataclasses
+import pydantic
+import structlog

 __all__ = [
    "Agent",
    "set_processing",
 ]

+log = structlog.get_logger("talemate.agents.base")
+
+class AgentActionConfig(pydantic.BaseModel):
+    type: str
+    label: str
+    description: str = ""
+    value: Union[int, float, str, bool]
+    default_value: Union[int, float, str, bool] = None
+    max: Union[int, float, None] = None
+    min: Union[int, float, None] = None
+    step: Union[int, float, None] = None
+    scope: str = "global"
+
+class AgentAction(pydantic.BaseModel):
+    enabled: bool = True
+    label: str
+    description: str = ""
+    config: Union[dict[str, AgentActionConfig], None] = None
+    
+
 def set_processing(fn):
    """
    decorator that emits the agent status as processing while the function
@@ -45,7 +69,6 @@ class Agent(ABC):

    agent_type = "agent"
    verbose_name = None
-    
    set_processing = set_processing

    @property
@@ -59,19 +82,14 @@ class Agent(ABC):
    def verbose_name(self):
        return self.agent_type.capitalize()

-    @classmethod
-    def config_options(cls):
-        return {
-            "client": [name for name, _ in instance.client_instances()],
-        }
+

    @property
    def ready(self):
        if not getattr(self.client, "enabled", True):
            return False
        
-        
-        if self.client.current_status in ["error", "warning"]:
+        if self.client and self.client.current_status in ["error", "warning"]:
            return False
        
        return self.client is not None
@@ -79,10 +97,104 @@ class Agent(ABC):
    @property
    def status(self):
        if self.ready:
+            if not self.enabled:
+                return "disabled"
            return "idle" if getattr(self, "processing", 0) == 0 else "busy"
        else:
            return "uninitialized"

+    @property
+    def enabled(self):
+        # by default, agents are enabled, an agent class that
+        # is disableable should override this property
+        return True
+    
+    @property
+    def disable(self):
+        # by default, agents are enabled, an agent class that
+        # is disableable should override this property to 
+        # disable the agent
+        pass
+    
+    @property
+    def has_toggle(self):
+        # by default, agents do not have toggles to enable / disable
+        # an agent class that is disableable should override this property
+        return False
+    
+    @property
+    def experimental(self):
+        # by default, agents are not experimental, an agent class that
+        # is experimental should override this property
+        return False
+
+    @classmethod
+    def config_options(cls, agent=None):
+        config_options = {
+            "client": [name for name, _ in instance.client_instances()],
+            "enabled": agent.enabled if agent else True,
+            "has_toggle": agent.has_toggle if agent else False,
+            "experimental": agent.experimental if agent else False,
+        }
+        actions = getattr(agent, "actions", None)
+        
+        if actions:
+            config_options["actions"] = {k: v.model_dump() for k, v in actions.items()}
+        else:
+            config_options["actions"] = {}
+            
+        return config_options
+
+    def apply_config(self, *args, **kwargs):
+        if self.has_toggle and "enabled" in kwargs:
+            self.is_enabled = kwargs.get("enabled", False)
+            
+        if not getattr(self, "actions", None):
+            return
+            
+        for action_key, action in self.actions.items():
+            
+            if not kwargs.get("actions"):
+                continue
+            
+            action.enabled = kwargs.get("actions", {}).get(action_key, {}).get("enabled", False)
+            
+            if not action.config:
+                continue
+            
+            for config_key, config in action.config.items():
+                try:
+                    config.value = kwargs.get("actions", {}).get(action_key, {}).get("config", {}).get(config_key, {}).get("value", config.value)
+                except AttributeError:
+                    pass
+    
+    async def on_game_loop_start(self, event:GameLoopStartEvent):
+        
+        """
+        Finds all ActionConfigs that have a scope of "scene" and resets them to their default values
+        """
+        
+        if not getattr(self, "actions", None):
+            return
+        
+        for _, action in self.actions.items():
+            if not action.config:
+                continue
+            
+            for _, config in action.config.items():
+                if config.scope == "scene":
+                    # if default_value is None, just use the `type` of the current 
+                    # value
+                    if config.default_value is None:
+                        default_value = type(config.value)()
+                    else:
+                        default_value = config.default_value
+                    
+                    log.debug("resetting config", config=config, default_value=default_value)
+                    config.value = default_value
+        
+        await self.emit_status()
+              
    async def emit_status(self, processing: bool = None):
        
        # should keep a count of processing requests, and when the
@@ -101,6 +213,8 @@ class Agent(ABC):
            self.processing += 1
            
        status = "busy" if self.processing > 0 else "idle"
+        if not self.enabled:
+            status = "disabled"
        
        emit(
            "agent_status",
@@ -108,13 +222,15 @@ class Agent(ABC):
            id=self.agent_type,
            status=status,
            details=self.agent_details,
-            data=self.config_options(),
+            data=self.config_options(agent=self),
        )

        await asyncio.sleep(0.01)

    def connect(self, scene):
        self.scene = scene
+        talemate.emit.async_signals.get("game_loop_start").connect(self.on_game_loop_start)
+

    def clean_result(self, result):
        if "#" in result:
@@ -159,3 +275,7 @@ class Agent(ABC):

                current_memory_context.append(memory)
        return current_memory_context
+
+@dataclasses.dataclass
+class AgentEmission:
+    agent: Agent
--- a/src/talemate/agents/conversation.py
+++ b/src/talemate/agents/conversation.py
@@ -1,24 +1,40 @@
 from __future__ import annotations
-
+import dataclasses
 import re
+import random
 from datetime import datetime
-from typing import TYPE_CHECKING, Optional
+from typing import TYPE_CHECKING, Optional, Union

 import talemate.client as client
+import talemate.instance as instance
 import talemate.util as util
 import structlog
 from talemate.emit import emit
+import talemate.emit.async_signals
 from talemate.scene_message import CharacterMessage, DirectorMessage
 from talemate.prompts import Prompt
+from talemate.events import GameLoopEvent
+from talemate.client.context import set_conversation_context_attribute, client_context_attribute, set_client_context_attribute

-from .base import Agent, set_processing
+from .base import Agent, AgentEmission, set_processing, AgentAction, AgentActionConfig
 from .registry import register

 if TYPE_CHECKING:
-    from talemate.tale_mate import Character, Scene
+    from talemate.tale_mate import Character, Scene, Actor
    
 log = structlog.get_logger("talemate.agents.conversation")

+@dataclasses.dataclass
+class ConversationAgentEmission(AgentEmission):
+    actor: Actor
+    character: Character
+    generation: list[str]
+    
+talemate.emit.async_signals.register(
+    "agent.conversation.before_generate",
+    "agent.conversation.generated"
+)
+
@register()
 class ConversationAgent(Agent):
    """
@@ -44,7 +60,242 @@ class ConversationAgent(Agent):
        self.logging_enabled = logging_enabled
        self.logging_date = datetime.now().strftime("%Y-%m-%d_%H-%M-%S")
        self.current_memory_context = None
+        
+        # several agents extend this class, but we only want to initialize
+        # these actions for the conversation agent
+        
+        if self.agent_type != "conversation":
+            return
+        
+        self.actions = {
+            "generation_override": AgentAction(
+                enabled = True,
+                label = "Generation Override",
+                description = "Override generation parameters",
+                config = {
+                    "length": AgentActionConfig(
+                        type="number",
+                        label="Generation Length (tokens)",
+                        description="Maximum number of tokens to generate for a conversation response.",
+                        value=96, 
+                        min=32,
+                        max=512,
+                        step=32,
+                    ),#
+                    "instructions": AgentActionConfig(
+                        type="text",
+                        label="Instructions",
+                        value="1-3 sentences.",
+                        description="Extra instructions to give the AI for dialog generatrion.",
+                    ),
+                    "jiggle": AgentActionConfig(
+                        type="number",
+                        label="Jiggle",
+                        description="If > 0.0 will cause certain generation parameters to have a slight random offset applied to them. The bigger the number, the higher the potential offset.",
+                        value=0.0,
+                        min=0.0,
+                        max=1.0,
+                        step=0.1,
+                    ),
+                }
+            ),
+            "natural_flow": AgentAction(
+                enabled = True,
+                label = "Natural Flow",
+                description = "Will attempt to generate a more natural flow of conversation between multiple characters.",
+                config = {
+                    "max_auto_turns": AgentActionConfig(
+                        type="number",
+                        label="Max. Auto Turns",
+                        description="The maximum number of turns the AI is allowed to generate before it stops and waits for the player to respond.",
+                        value=4, 
+                        min=1,
+                        max=100,
+                        step=1,
+                    ),
+                    "max_idle_turns": AgentActionConfig(
+                        type="number",
+                        label="Max. Idle Turns",
+                        description="The maximum number of turns a character can go without speaking before they are considered overdue to speak.",
+                        value=8, 
+                        min=1,
+                        max=100,
+                        step=1,
+                    ),
+                }
+            ),
+            "use_long_term_memory": AgentAction(
+                enabled = True,
+                label = "Long Term Memory",
+                description = "Will augment the conversation prompt with long term memory.",
+                config = {
+                    "ai_selected": AgentActionConfig(
+                        type="bool",
+                        label="AI Selected",
+                        description="If enabled, the AI will select the long term memory to use. (will increase how long it takes to generate a response)",
+                        value=False,
+                    ),
+                }
+            ),  
+        }

+    def connect(self, scene):
+        super().connect(scene)
+        talemate.emit.async_signals.get("game_loop").connect(self.on_game_loop)
+
+    def last_spoken(self):
+        
+        """
+        Returns the last time each character spoke
+        """
+        
+        last_turn = {}
+        turns = 0
+        character_names = self.scene.character_names
+        max_idle_turns = self.actions["natural_flow"].config["max_idle_turns"].value
+        
+        for idx in range(len(self.scene.history) - 1, -1, -1):
+            
+            if isinstance(self.scene.history[idx], CharacterMessage):
+                
+                if turns >= max_idle_turns:
+                    break
+                
+                character = self.scene.history[idx].character_name
+                
+                if character in character_names:
+                    last_turn[character] = turns
+                    character_names.remove(character)
+                    
+                if not character_names:
+                    break
+                
+                turns += 1
+        
+        if character_names and turns >= max_idle_turns:
+            for character in character_names:
+                last_turn[character] = max_idle_turns        
+
+        return last_turn
+        
+    def repeated_speaker(self):
+        """
+        Counts the amount of times the most recent speaker has spoken in a row
+        """
+        character_name = None
+        count = 0
+        for idx in range(len(self.scene.history) - 1, -1, -1):
+            if isinstance(self.scene.history[idx], CharacterMessage):
+                if character_name is None:
+                    character_name = self.scene.history[idx].character_name
+                if self.scene.history[idx].character_name == character_name:
+                    count += 1
+                else:
+                    break
+        return count
+                
+    async def on_game_loop(self, event:GameLoopEvent):
+        await self.apply_natural_flow()
+    
+    async def apply_natural_flow(self):
+        """
+        If the natural flow action is enabled, this will attempt to determine
+        the ideal character to talk next.
+        
+        This will let the AI pick a character to talk to, but if the AI can't figure
+        it out it will apply rules based on max_idle_turns and max_auto_turns.
+        
+        If all fails it will just pick a random character.
+        
+        Repetition is also taken into account, so if a character has spoken twice in a row
+        they will not be picked again until someone else has spoken.
+        """
+        
+        scene = self.scene
+        if self.actions["natural_flow"].enabled and len(scene.character_names) > 2:
+            
+            # last time each character spoke (turns ago)
+            max_idle_turns = self.actions["natural_flow"].config["max_idle_turns"].value
+            max_auto_turns = self.actions["natural_flow"].config["max_auto_turns"].value
+            last_turn = self.last_spoken()
+            last_turn_player = last_turn.get(scene.get_player_character().name, 0)
+            
+            if last_turn_player >= max_auto_turns:
+                self.scene.next_actor = scene.get_player_character().name
+                log.debug("conversation_agent.natural_flow", next_actor="player", overdue=True, player_character=scene.get_player_character().name)
+                return
+            
+            log.debug("conversation_agent.natural_flow", last_turn=last_turn)
+            
+            # determine random character to talk, this will be the fallback in case
+            # the AI can't figure out who should talk next
+            
+            if scene.prev_actor:
+                
+                # we dont want to talk to the same person twice in a row
+                character_names = scene.character_names
+                character_names.remove(scene.prev_actor)
+                random_character_name = random.choice(character_names)
+            else:
+                character_names = scene.character_names                
+                # no one has talked yet, so we just pick a random character
+                
+                random_character_name = random.choice(scene.character_names)
+            
+            overdue_characters = [character for character, turn in last_turn.items() if turn >= max_idle_turns]
+            
+            if overdue_characters and self.scene.history:
+                # Pick a random character from the overdue characters
+                scene.next_actor = random.choice(overdue_characters)
+            elif scene.history:
+                scene.next_actor = None
+                
+                # AI will attempt to figure out who should talk next
+                next_actor = await self.select_talking_actor(character_names)
+                next_actor = next_actor.strip().strip('"').strip(".")
+                
+                for character_name in scene.character_names:
+                    if next_actor.lower() in character_name.lower() or character_name.lower() in next_actor.lower():
+                        scene.next_actor = character_name
+                        break
+                
+                if not scene.next_actor:
+                    # AI couldn't figure out who should talk next, so we just pick a random character
+                    log.debug("conversation_agent.natural_flow", next_actor="random", random_character_name=random_character_name)
+                    scene.next_actor = random_character_name
+                else:
+                    log.debug("conversation_agent.natural_flow", next_actor="picked", ai_next_actor=scene.next_actor)
+            else:
+                # always start with main character (TODO: configurable?)
+                player_character = scene.get_player_character()
+                log.debug("conversation_agent.natural_flow", next_actor="main_character", main_character=player_character)
+                scene.next_actor = player_character.name if player_character else random_character_name
+                
+            scene.log.debug("conversation_agent.natural_flow", next_actor=scene.next_actor)
+
+            
+            # same character cannot go thrice in a row, if this is happening, pick a random character that
+            # isnt the same as the last character
+            
+            if self.repeated_speaker() >= 2 and self.scene.prev_actor == self.scene.next_actor:
+                scene.next_actor = random.choice([c for c in scene.character_names if c != scene.prev_actor])
+                scene.log.debug("conversation_agent.natural_flow", next_actor="random (repeated safeguard)", random_character_name=scene.next_actor)
+            
+        else:
+            scene.next_actor = None
+
+    
+    @set_processing
+    async def select_talking_actor(self, character_names: list[str]=None):
+        result = await Prompt.request("conversation.select-talking-actor", self.client, "conversation_select_talking_actor", vars={
+            "scene": self.scene,
+            "max_tokens": self.client.max_token_length,
+            "character_names": character_names or self.scene.character_names,
+            "character_names_formatted": ", ".join(character_names or self.scene.character_names),
+        })
+        
+        return result
+        

    async def build_prompt_default(
        self,
@@ -71,10 +322,7 @@ class ConversationAgent(Agent):
            insert_bot_token=10
        )
        
-        memory = await self.build_prompt_default_memory(
-            scene, long_term_memory_budget, 
-            scene_and_dialogue + [f"{character.name}: {character.description}" for character in scene.get_characters()]
-        )
+        memory = await self.build_prompt_default_memory(character)

        main_character = scene.main_character.character

@@ -96,6 +344,10 @@ class ConversationAgent(Agent):
            director_message = isinstance(scene_and_dialogue[-1], DirectorMessage)
        except IndexError:
            director_message = False
+            
+        extra_instructions = ""
+        if self.actions["generation_override"].enabled:
+            extra_instructions = self.actions["generation_override"].config["instructions"].value

        prompt = Prompt.get("conversation.dialogue", vars={
            "scene": scene,
@@ -109,12 +361,13 @@ class ConversationAgent(Agent):
            "talking_character": character,
            "partial_message": char_message,
            "director_message": director_message,
+            "extra_instructions": extra_instructions,
        })
        
        return str(prompt)
    
    async def build_prompt_default_memory(
-        self, scene: Scene, budget: int, existing_context: list
+        self, character: Character
    ):
        """
        Builds long term memory for the conversation prompt
@@ -127,29 +380,35 @@ class ConversationAgent(Agent):
        Also it will only add information that is not already in the existing context.
        """

-        memory = scene.get_helper("memory").agent
-
-        if not memory:
+        if not self.actions["use_long_term_memory"].enabled:
            return []

+
        if self.current_memory_context:
            return self.current_memory_context

-        self.current_memory_context = []
+        self.current_memory_context = ""
+        

-        # feed the last 3 history message into multi_query
-        history_length = len(scene.history)
-        i = history_length - 1
-        while i >= 0 and i >= len(scene.history) - 3:
-            self.current_memory_context += await memory.multi_query(
-                [scene.history[i]],
-                filter=lambda x: x
-                not in self.current_memory_context + existing_context,
+        if self.actions["use_long_term_memory"].config["ai_selected"].value:
+            history = self.scene.context_history(min_dialogue=3, max_dialogue=15, keep_director=False, sections=False, add_archieved_history=False)
+            text = "\n".join(history)
+            world_state = instance.get_agent("world_state")
+            log.debug("conversation_agent.build_prompt_default_memory", direct=False)
+            self.current_memory_context = await world_state.analyze_text_and_extract_context(
+                text, f"continue the conversation as {character.name}"
            )
-            i -= 1

+        else:
+            history = self.scene.context_history(min_dialogue=3, max_dialogue=3, keep_director=False, sections=False, add_archieved_history=False)
+            log.debug("conversation_agent.build_prompt_default_memory", history=history, direct=True)
+            memory = instance.get_agent("memory")
+            
+            context = await memory.multi_query(history, max_tokens=500, iterate=5)
+             
+            self.current_memory_context = "\n".join(context)
+        
        return self.current_memory_context
-    

    async def build_prompt(self, character, char_message: str = ""):
        fn = self.build_prompt_default
@@ -158,31 +417,30 @@ class ConversationAgent(Agent):

    def clean_result(self, result, character):
        
-        
-        log.debug("clean result", result=result)
-        
        if "#" in result:
            result = result.split("#")[0]
        
-        result = result.replace("\n", "__LINEBREAK__").strip()
-
-        # Removes partial sentence at the end
-        result = re.sub(r"[^\.\?\!\*]+(\n|$)", "", result)
-        
        result = result.replace(" :", ":")
-        result = result.strip().strip('"').strip()
        result = result.replace("[", "*").replace("]", "*")
+        result = result.replace("(", "*").replace(")", "*")
        result = result.replace("**", "*")
-        
-        result = result.replace("__LINEBREAK__", "\n")
-
-        # if there is an uneven number of '*' add one to the end
-
-        if result.count("*") % 2 == 1:
-            result += "*"

        return result

+    def set_generation_overrides(self):
+        if not self.actions["generation_override"].enabled:
+            return
+        
+        set_conversation_context_attribute("length", self.actions["generation_override"].config["length"].value)
+            
+        if self.actions["generation_override"].config["jiggle"].value > 0.0:
+            nuke_repetition = client_context_attribute("nuke_repetition")
+            if nuke_repetition == 0.0:
+                # we only apply the agent override if some other mechanism isn't already
+                # setting the nuke_repetition value
+                nuke_repetition = self.actions["generation_override"].config["jiggle"].value
+                set_client_context_attribute("nuke_repetition", nuke_repetition)
+
    @set_processing
    async def converse(self, actor, editor=None):
        """
@@ -193,6 +451,11 @@ class ConversationAgent(Agent):
        self.current_memory_context = None

        character = actor.character
+        
+        emission = ConversationAgentEmission(agent=self, generation="", actor=actor, character=character)
+        await talemate.emit.async_signals.get("agent.conversation.before_generate").send(emission)
+        
+        self.set_generation_overrides()

        result = await self.client.send_prompt(await self.build_prompt(character))

@@ -237,10 +500,10 @@ class ConversationAgent(Agent):
        total_result = total_result.split("#")[0]

        # Removes partial sentence at the end
-        total_result = re.sub(r"[^\.\?\!\*]+(\n|$)", "", total_result)
-
-        if total_result.count("*") % 2 == 1:
-            total_result += "*"
+        total_result = util.clean_dialogue(total_result, main_name=character.name)
+        
+        # Remove "{character.name}:" - all occurences
+        total_result = total_result.replace(f"{character.name}:", "")

        # Check if total_result starts with character name, if not, prepend it
        if not total_result.startswith(character.name):
@@ -257,13 +520,15 @@ class ConversationAgent(Agent):
        )

        response_message = util.parse_messages_from_str(total_result, [character.name])
+        
+        log.info("conversation agent", result=response_message)
+            
+        emission = ConversationAgentEmission(agent=self, generation=response_message, actor=actor, character=character)
+        await talemate.emit.async_signals.get("agent.conversation.generated").send(emission)

-        if editor:
-            response_message = [
-                editor.help_edit(character, message) for message in response_message
-            ]
+        #log.info("conversation agent", generation=emission.generation)

-        messages = [CharacterMessage(message) for message in response_message]
+        messages = [CharacterMessage(message) for message in emission.generation]

        # Add message and response to conversation history
        actor.scene.push_history(messages)
--- a/src/talemate/agents/creator/init.py
+++ b/src/talemate/agents/creator/init.py
@@ -3,15 +3,16 @@ from __future__ import annotations
 import json
 import os

-from talemate.agents.conversation import ConversationAgent
+from talemate.agents.base import Agent
 from talemate.agents.registry import register
 from talemate.emit import emit
+import talemate.client as client

 from .character import CharacterCreatorMixin
 from .scenario import ScenarioCreatorMixin

@register()
-class CreatorAgent(CharacterCreatorMixin, ScenarioCreatorMixin, ConversationAgent):
+class CreatorAgent(CharacterCreatorMixin, ScenarioCreatorMixin, Agent):
    
    """
    Creates characters and scenarios and other fun stuff!
@@ -20,6 +21,13 @@ class CreatorAgent(CharacterCreatorMixin, ScenarioCreatorMixin, ConversationAgen
    agent_type = "creator"
    verbose_name = "Creator"

+    def __init__(
+        self,
+        client: client.TaleMateClient,
+        **kwargs,
+    ):
+        self.client = client
+
    def clean_result(self, result):
        if "#" in result:
            result = result.split("#")[0]
--- a/src/talemate/agents/creator/character.py
+++ b/src/talemate/agents/creator/character.py
@@ -9,6 +9,8 @@ from typing import TYPE_CHECKING, Callable
 import talemate.util as util
 from talemate.emit import emit
 from talemate.prompts import Prompt, LoopedPrompt
+from talemate.exceptions import LLMAccuracyError
+from talemate.agents.base import set_processing

 if TYPE_CHECKING:
    from talemate.tale_mate import Character
@@ -19,7 +21,11 @@ def validate(k,v):
    if k and k.lower() == "gender":
        return v.lower().strip()
    if k and k.lower() == "age":
-        return int(v.strip())
+        try:
+            return int(v.split("\n")[0].strip())
+        except (ValueError, TypeError):
+            raise LLMAccuracyError("Was unable to get a valid age from the response", model_name=None)
+            
    return v.strip().strip("\n")

 DEFAULT_CONTENT_CONTEXT="a fun and engaging adventure aimed at an adult audience."
@@ -31,6 +37,7 @@ class CharacterCreatorMixin:
    
    ## NEW
    
+    @set_processing
    async def create_character_attributes(
        self,
        character_prompt: str,
@@ -42,60 +49,55 @@ class CharacterCreatorMixin:
        predefined_attributes: dict[str, str] = dict(),
    ):
        
-        try:
-            await self.emit_status(processing=True)
-            
-            def spice(prompt, spices):
-                # generate number from 0 to 1 and if its smaller than use_spice
-                # select a random spice from the list and return it formatted
-                # in the prompt
-                if random.random() < use_spice:
-                    spice = random.choice(spices)
-                    return prompt.format(spice=spice)
-                return ""    
-            
-            # drop any empty attributes from predefined_attributes
-            
-            predefined_attributes = {k:v for k,v in predefined_attributes.items() if v}
-            
-            prompt = Prompt.get(f"creator.character-attributes-{template}", vars={
-                "character_prompt": character_prompt,
-                "template": template,
-                "spice": spice,
-                "content_context": content_context,
-                "custom_attributes": custom_attributes,
-                "character_sheet": LoopedPrompt(
-                    validate_value=validate,
-                    on_update=attribute_callback,
-                    generated=predefined_attributes,
-                ),
-            })
-            await prompt.loop(self.client, "character_sheet", kind="create_concise")
-            
-            return prompt.vars["character_sheet"].generated
        
-        finally:
-            await self.emit_status(processing=False)
+        def spice(prompt, spices):
+            # generate number from 0 to 1 and if its smaller than use_spice
+            # select a random spice from the list and return it formatted
+            # in the prompt
+            if random.random() < use_spice:
+                spice = random.choice(spices)
+                return prompt.format(spice=spice)
+            return ""    
+        
+        # drop any empty attributes from predefined_attributes
+        
+        predefined_attributes = {k:v for k,v in predefined_attributes.items() if v}
+        
+        prompt = Prompt.get(f"creator.character-attributes-{template}", vars={
+            "character_prompt": character_prompt,
+            "template": template,
+            "spice": spice,
+            "content_context": content_context,
+            "custom_attributes": custom_attributes,
+            "character_sheet": LoopedPrompt(
+                validate_value=validate,
+                on_update=attribute_callback,
+                generated=predefined_attributes,
+            ),
+        })
+        await prompt.loop(self.client, "character_sheet", kind="create_concise")
+        
+        return prompt.vars["character_sheet"].generated
+        
    
    
+    @set_processing
    async def create_character_description(
        self, 
        character:Character, 
        content_context: str = DEFAULT_CONTENT_CONTEXT,
    ):
        
-        try:
-            await self.emit_status(processing=True)
-            description = await Prompt.request(f"creator.character-description", self.client, "create", vars={
-                "character": character,
-                "content_context": content_context,
-            })
-            
-            return description.strip()
-        finally:
-            await self.emit_status(processing=False)
+        description = await Prompt.request(f"creator.character-description", self.client, "create", vars={
+            "character": character,
+            "content_context": content_context,
+        })
+        
+        return description.strip()
+
        
    
+    @set_processing
    async def create_character_details(
        self, 
        character: Character,
@@ -104,23 +106,21 @@ class CharacterCreatorMixin:
        questions: list[str] = None,
        content_context: str = DEFAULT_CONTENT_CONTEXT,
    ):
-        try:
-            await self.emit_status(processing=True)
-            prompt = Prompt.get(f"creator.character-details-{template}", vars={
-                "character_details": LoopedPrompt(
-                    validate_value=validate,
-                    on_update=detail_callback,
-                ),
-                "template": template,
-                "content_context": content_context,
-                "character": character,
-                "custom_questions": questions or [],
-            })
-            await prompt.loop(self.client, "character_details", kind="create_concise")
-            return prompt.vars["character_details"].generated
-        finally:
-            await self.emit_status(processing=False)
+        prompt = Prompt.get(f"creator.character-details-{template}", vars={
+            "character_details": LoopedPrompt(
+                validate_value=validate,
+                on_update=detail_callback,
+            ),
+            "template": template,
+            "content_context": content_context,
+            "character": character,
+            "custom_questions": questions or [],
+        })
+        await prompt.loop(self.client, "character_details", kind="create_concise")
+        return prompt.vars["character_details"].generated
+
    
+    @set_processing
    async def create_character_example_dialogue(
        self,
        character: Character,
@@ -132,64 +132,86 @@ class CharacterCreatorMixin:
        rules_callback: Callable = lambda rules: None,
    ):
        
-        try:
-            await self.emit_status(processing=True)
+        dialogue_rules = await Prompt.request(f"creator.character-dialogue-rules", self.client, "create", vars={
+            "guide": guide,
+            "character": character,
+            "examples": examples or [],
+            "content_context": content_context,
+        })
        
-            dialogue_rules = await Prompt.request(f"creator.character-dialogue-rules", self.client, "create", vars={
-                "guide": guide,
-                "character": character,
-                "examples": examples or [],
-                "content_context": content_context,
-            })
+        log.info("dialogue_rules", dialogue_rules=dialogue_rules)
+        
+        if rules_callback:
+            rules_callback(dialogue_rules)
            
-            log.info("dialogue_rules", dialogue_rules=dialogue_rules)
-            
-            if rules_callback:
-                rules_callback(dialogue_rules)
-                
-            example_dialogue_prompt = Prompt.get(f"creator.character-example-dialogue-{template}", vars={
-                "guide": guide,
-                "character": character,
-                "examples": examples or [],
-                "content_context": content_context,
-                "dialogue_rules": dialogue_rules,
-                "generated_examples": LoopedPrompt(
-                    validate_value=validate,
-                    on_update=example_callback,
-                ),
-            })
-            
-            await example_dialogue_prompt.loop(self.client, "generated_examples", kind="create")
-            
-            return example_dialogue_prompt.vars["generated_examples"].generated
-        finally:
-            await self.emit_status(processing=False)
+        example_dialogue_prompt = Prompt.get(f"creator.character-example-dialogue-{template}", vars={
+            "guide": guide,
+            "character": character,
+            "examples": examples or [],
+            "content_context": content_context,
+            "dialogue_rules": dialogue_rules,
+            "generated_examples": LoopedPrompt(
+                validate_value=validate,
+                on_update=example_callback,
+            ),
+        })
+        
+        await example_dialogue_prompt.loop(self.client, "generated_examples", kind="create")
+        
+        return example_dialogue_prompt.vars["generated_examples"].generated
+
            
            
+    @set_processing
    async def determine_content_context_for_character(
        self,
        character: Character,
    ):
        
-        try:
-            await self.emit_status(processing=True)
-            content_context = await Prompt.request(f"creator.determine-content-context", self.client, "create", vars={
-                "character": character,
-            })
-            return content_context.strip()
-        finally:
-            await self.emit_status(processing=False)
+        content_context = await Prompt.request(f"creator.determine-content-context", self.client, "create", vars={
+            "character": character,
+        })
+        return content_context.strip()
+
            
+    @set_processing
    async def determine_character_attributes(
        self,
        character: Character,
    ):
        
-        try:
-            await self.emit_status(processing=True)
-            attributes = await Prompt.request(f"creator.determine-character-attributes", self.client, "analyze_long", vars={
-                "character": character,
-            })
-            return attributes
-        finally:
-            await self.emit_status(processing=False)
+        attributes = await Prompt.request(f"creator.determine-character-attributes", self.client, "analyze_long", vars={
+            "character": character,
+        })
+        return attributes
+    
+    @set_processing
+    async def determine_character_description(
+        self,
+        character: Character,
+        text:str=""
+    ):
+        
+        description = await Prompt.request(f"creator.determine-character-description", self.client, "create", vars={
+            "character": character,
+            "scene": self.scene,
+            "text": text,
+            "max_tokens": self.client.max_token_length,
+        })
+        return description.strip()
+    
+    @set_processing
+    async def generate_character_from_text(
+        self,
+        text: str,
+        template: str,
+        content_context: str = DEFAULT_CONTENT_CONTEXT,
+    ):
+        
+        base_attributes = await self.create_character_attributes(
+            character_prompt=text,
+            template=template,
+            content_context=content_context,
+        )
+        
+        
--- a/src/talemate/agents/creator/scenario.py
+++ b/src/talemate/agents/creator/scenario.py
@@ -3,6 +3,7 @@ import re
 import random

 from talemate.prompts import Prompt
+from talemate.agents.base import set_processing

 class ScenarioCreatorMixin:
    
@@ -10,8 +11,7 @@ class ScenarioCreatorMixin:
    Adds scenario creation functionality to the creator agent
    """
    
-    ### NEW
-    
+    @set_processing
    async def create_scene_description(
        self,
        prompt:str,
@@ -29,27 +29,23 @@ class ScenarioCreatorMixin:
            
            callback (callable): A callback to call when the scene has been created.
        """
-        try:
-            await self.emit_status(processing=True)
-            scene = self.scene
+        scene = self.scene
+        
+        description = await Prompt.request(
+            "creator.scenario-description",
+            self.client,
+            "create",
+            vars={
+                "prompt": prompt,
+                "content_context": content_context,
+                "max_tokens": self.client.max_token_length,
+                "scene": scene,
+            }
+        )
+        description = description.strip()
+        
+        return description
            
-            description = await Prompt.request(
-                "creator.scenario-description",
-                self.client,
-                "create",
-                vars={
-                    "prompt": prompt,
-                    "content_context": content_context,
-                    "max_tokens": self.client.max_token_length,
-                    "scene": scene,
-                }
-            )
-            description = description.strip()
-            
-            return description
-            
-        finally:
-            await self.emit_status(processing=False)
    
    
    async def create_scene_name(
@@ -70,27 +66,21 @@ class ScenarioCreatorMixin:
                
                description (str): The description of the scene.
            """
-            try:
-                await self.emit_status(processing=True)
-                
-                scene = self.scene
-                
-                name = await Prompt.request(
-                    "creator.scenario-name",
-                    self.client,
-                    "create",
-                    vars={
-                        "prompt": prompt,
-                        "content_context": content_context,
-                        "description": description,
-                        "scene": scene,
-                    }
-                )
-                name = name.strip().strip('.!').replace('"','')
-                return name
-                
-            finally:
-                await self.emit_status(processing=False)
+            scene = self.scene
+            
+            name = await Prompt.request(
+                "creator.scenario-name",
+                self.client,
+                "create",
+                vars={
+                    "prompt": prompt,
+                    "content_context": content_context,
+                    "description": description,
+                    "scene": scene,
+                }
+            )
+            name = name.strip().strip('.!').replace('"','')
+            return name
    
    
    async def create_scene_intro(
@@ -114,25 +104,30 @@ class ScenarioCreatorMixin:
            
            name (str): The name of the scene.
        """
-        try:
-            await self.emit_status(processing=True)
-            
-            scene = self.scene
-            
-            intro = await Prompt.request(
-                "creator.scenario-intro",
-                self.client,
-                "create",
-                vars={
-                    "prompt": prompt,
-                    "content_context": content_context,
-                    "description": description,
-                    "name": name,
-                    "scene": scene,
-                }
-            )
-            intro = intro.strip()
-            return intro
-            
-        finally:
-            await self.emit_status(processing=False)
+        
+        scene = self.scene
+        
+        intro = await Prompt.request(
+            "creator.scenario-intro",
+            self.client,
+            "create",
+            vars={
+                "prompt": prompt,
+                "content_context": content_context,
+                "description": description,
+                "name": name,
+                "scene": scene,
+            }
+        )
+        intro = intro.strip()
+        return intro
+    
+    @set_processing
+    async def determine_scenario_description(
+        self,
+        text:str
+    ):
+        description = await Prompt.request(f"creator.determine-scenario-description", self.client, "analyze_long", vars={
+            "text": text,
+        })
+        return description
--- a/src/talemate/agents/director.py
+++ b/src/talemate/agents/director.py
@@ -8,13 +8,14 @@ from typing import TYPE_CHECKING, Callable, List, Optional, Union

 import talemate.util as util
 from talemate.emit import wait_for_input, emit
+import talemate.emit.async_signals
 from talemate.prompts import Prompt
 from talemate.scene_message import NarratorMessage, DirectorMessage
 from talemate.automated_action import AutomatedAction
 import talemate.automated_action as automated_action
-from .conversation import ConversationAgent
+from talemate.agents.conversation import ConversationAgentEmission
 from .registry import register
-from .base import set_processing
+from .base import set_processing, AgentAction, AgentActionConfig, Agent

 if TYPE_CHECKING:
    from talemate import Actor, Character, Player, Scene
@@ -22,350 +23,84 @@ if TYPE_CHECKING:
 log = structlog.get_logger("talemate")

@register()
-class DirectorAgent(ConversationAgent):
+class DirectorAgent(Agent):
    agent_type = "director"
    verbose_name = "Director"
    
-    def get_base_prompt(self, character: Character, budget:int):
-        return [character.description, character.base_attributes.get("scenario_context", "")] + self.scene.context_history(budget=budget, keep_director=False)
-
+    def __init__(self, client, **kwargs):
+        self.is_enabled = False
+        self.client = client
+        self.next_direct = 0
+        self.actions = {
+            "direct": AgentAction(enabled=True, label="Direct", description="Will attempt to direct the scene. Runs automatically after AI dialogue (n turns).", config={
+                "turns": AgentActionConfig(type="number", label="Turns", description="Number of turns to wait before directing the sceen", value=5, min=1, max=100, step=1),
+                "prompt": AgentActionConfig(type="text", label="Instructions", description="Instructions to the director", value="", scope="scene")
+            }),
+        }
+        
+    @property
+    def enabled(self):
+        return self.is_enabled
    
-    async def decide_action(self, character: Character, goal_override:str=None):
+    @property
+    def has_toggle(self):
+        return True
        
-        """
-        Pick an action to perform to move the story towards the current story goal
-        """
+    @property
+    def experimental(self):
+        return True
+    
+    def connect(self, scene):
+        super().connect(scene)
+        talemate.emit.async_signals.get("agent.conversation.before_generate").connect(self.on_conversation_before_generate)
        
-        current_goal = goal_override or await self.select_goal(self.scene)
-        current_goal = f"Current story goal: {current_goal}" if current_goal else current_goal
+    async def on_conversation_before_generate(self, event:ConversationAgentEmission):
+        log.info("on_conversation_before_generate", director_enabled=self.enabled)
+        if not self.enabled:
+            return
        
-        response, action_eval, prompt = await self.decide_action_analyze(character, current_goal)
-        # action_eval will hold {'narrate': N, 'direct': N, 'watch': N, ...}
-        # where N is a number, action with the highest number wins, default action is watch
-        # if there is no clear winner
+        await self.direct_scene(event.character)
        
-        watch_action = action_eval.get("watch", 0)
-        action = max(action_eval, key=action_eval.get)
+    async def direct_scene(self, character: Character):
        
-        if action_eval[action] <= watch_action:
-            action = "watch"
+        if not self.actions["direct"].enabled:
+            log.info("direct_scene", skip=True, enabled=self.actions["direct"].enabled)
+            return
        
-        log.info("decide_action", action=action, action_eval=action_eval)
+        prompt = self.actions["direct"].config["prompt"].value
        
-        return response, current_goal, action
+        if not prompt:
+            log.info("direct_scene", skip=True, prompt=prompt)
+            return
        
+        if self.next_direct % self.actions["direct"].config["turns"].value != 0 or self.next_direct == 0:
+            
+            log.info("direct_scene", skip=True, next_direct=self.next_direct)
+            self.next_direct += 1
+            return
        
-    async def decide_action_analyze(self, character: Character, goal:str):
-
-        prompt = Prompt.get("director.decide-action-analyze", vars={
+        self.next_direct = 0
+        
+        await self.direct_character(character, prompt)
+        
+    @set_processing
+    async def direct_character(self, character: Character, prompt:str):
+        
+        response = await Prompt.request("director.direct-scene", self.client, "director", vars={
            "max_tokens": self.client.max_token_length,
            "scene": self.scene,
-            "current_goal": goal,
+            "prompt": prompt,
            "character": character,
        })
-
-        response, evaluation = await prompt.send(self.client, kind="director")
        
-        log.info("question_direction", response=response)
-        return response, evaluation, prompt
-        
-    @set_processing
-    async def direct(self, character: Character, goal_override:str=None):
-        
-        analysis, current_goal, action = await self.decide_action(character, goal_override=goal_override)
-        
-        if action == "watch":
-            return None
-        
-        if action == "direct":
-            return await self.direct_character_with_self_reflection(character, analysis, goal_override=current_goal)
-        
-        if action.startswith("narrate"):
-            
-            narration_type = action.split(":")[1] 
-            
-            direct_narrative = await self.direct_narrative(analysis, narration_type=narration_type, goal=current_goal)
-            if direct_narrative:
-                narrator = self.scene.get_helper("narrator").agent
-                narrator_response = await narrator.progress_story(direct_narrative)
-                if not narrator_response:
-                    return None
-                narrator_message = NarratorMessage(narrator_response, source="progress_story")
-                self.scene.push_history(narrator_message)
-                emit("narrator", narrator_message)
-                return True
-            
-            
-    @set_processing
-    async def direct_narrative(self, analysis:str, narration_type:str="progress", goal:str=None):
-        
-        if goal is None:
-            goal = await self.select_goal(self.scene)
-        
-        prompt = Prompt.get("director.direct-narrative", vars={
-            "max_tokens": self.client.max_token_length,
-            "scene": self.scene,
-            "narration_type": narration_type,
-            "analysis": analysis,
-            "current_goal": goal,
-        })
-        
-        response = await prompt.send(self.client, kind="director")
-        response = response.strip().split("\n")[0].strip()
-                
-        if not response:
-            return None
-            
-        return response
-    
-    @set_processing
-    async def direct_character_with_self_reflection(self, character: Character, analysis:str, goal_override:str=None):
-        
-        max_retries = 3
-        num_retries = 0
-        keep_direction = False
-        response = None
-        self_reflection = None
-        
-        while num_retries < max_retries:
-            
-            response, direction_prompt = await self.direct_character(
-                character, 
-                analysis, 
-                goal_override=goal_override, 
-                previous_direction=response,
-                previous_direction_feedback=self_reflection
-            )
-            
-            keep_direction, self_reflection = await self.direct_character_self_reflect(
-                response, character, goal_override, direction_prompt
-            )
-            
-            if keep_direction:
-                break
-            
-            num_retries += 1
-            
-        log.info("direct_character_with_self_reflection", response=response, keep_direction=keep_direction)
-            
-        if not keep_direction:
-            return None
-        
-        #character_agreement = f" *{character.name} agrees with the director and progresses the story accordingly*"
-        #
-        #if "accordingly" not in response:
-        #   response += character_agreement
-        #
-        
-        #response = await self.transform_character_direction_to_inner_monologue(character, response)
-        
-        return response
-        
-    @set_processing
-    async def transform_character_direction_to_inner_monologue(self, character:Character, direction:str):
-        
-        inner_monologue = await Prompt.request(
-            "conversation.direction-to-inner-monologue",
-            self.client,
-            "conversation_long",
-            vars={
-                "max_tokens": self.client.max_token_length,
-                "scene": self.scene,
-                "character": character,
-                "director_instructions": direction,
-            }
-        )
-        
-        return inner_monologue
-        
-        
-    @set_processing
-    async def direct_character(
-        self, 
-        character: Character, 
-        analysis:str, 
-        goal_override:str=None,
-        previous_direction:str=None,
-        previous_direction_feedback:str=None,
-    ):
-        """
-        Direct the scene
-        """
-        
-        if goal_override:
-            current_goal = goal_override
-        else:
-            current_goal = await self.select_goal(self.scene)
-            
-        if current_goal and not current_goal.startswith("Current story goal: "):
-            current_goal = f"Current story goal: {current_goal}"
-        
-        prompt = Prompt.get("director.direct-character", vars={
-            "max_tokens": self.client.max_token_length,
-            "scene": self.scene,
-            "character": character,
-            "current_goal": current_goal,
-            "previous_direction": previous_direction,
-            "previous_direction_feedback": previous_direction_feedback,
-            "analysis": analysis,
-        })
-        
-        response = await prompt.send(self.client, kind="director")
        response = response.strip().split("\n")[0].strip()
        
-        log.info(
-            "direct_character", 
-            direction=response, 
-            previous_direction=previous_direction, 
-            previous_direction_feedback=previous_direction_feedback
-        )
-                
-        if not response:
-            return None
-
-        if not response.startswith(prompt.prepared_response):
-            response = prompt.prepared_response + response
-
-        return response, "\n".join(prompt.as_list[:-1])
-    
-    
-    
-    @set_processing
-    async def direct_character_self_reflect(self, direction:str, character: Character, goal:str, direction_prompt:Prompt) -> (bool, str):
+        response += f" (current story goal: {prompt})"
        
-        change_matches = ["change", "retry", "alter", "reconsider"]
+        log.info("direct_scene", response=response)
        
-        prompt = Prompt.get("director.direct-character-self-reflect", vars={
-            "direction_prompt": str(direction_prompt),
-            "direction": direction,
-            "analysis": await self.direct_character_analyze(direction, character, goal, direction_prompt),
-            "character": character,
-            "scene": self.scene,
-            "max_tokens": self.client.max_token_length,
-        })
        
-        response = await prompt.send(self.client, kind="director")
+        message = DirectorMessage(response, source=character.name)
+        emit("director", message, character=character)
        
-        parse_choice = response[len(prompt.prepared_response):].lower().split(" ")[0]
-        
-        keep = not parse_choice in change_matches
-        
-        log.info("direct_character_self_reflect", keep=keep, response=response, parsed=parse_choice)
-        
-        return keep, response
-    
-    
-    @set_processing
-    async def direct_character_analyze(self, direction:str, character: Character, goal:str, direction_prompt:Prompt):
-        
-        prompt = Prompt.get("director.direct-character-analyze", vars={
-            "direction_prompt": str(direction_prompt),
-            "direction": direction,
-            "scene": self.scene,
-            "max_tokens": self.client.max_token_length,
-            "character": character,
-        })
-        
-        analysis = await prompt.send(self.client, kind="director")
-        
-        log.info("direct_character_analyze", analysis=analysis)
-        
-        return analysis
-
-    async def select_goal(self, scene: Scene):
-        
-        if not scene.goals:
-            return ""
-        
-        if isinstance(self.scene.goal, int):
-            # fixes legacy goal format
-            self.scene.goal = self.scene.goals[self.scene.goal]
-
-        while True:
-
-            # get current goal position in goals
-        
-            current_goal = scene.goal
-            current_goal_positon = None
-            if current_goal:
-                try:
-                    current_goal_positon = self.scene.goals.index(current_goal)
-                except ValueError:
-                    pass
-            elif self.scene.goals:
-                current_goal = self.scene.goals[0]
-                current_goal_positon = 0
-            else:
-                return ""
-                
-                
-            # if current goal is set but not found, its a custom goal override
-            
-            custom_goal = (current_goal and current_goal_positon is None)
-        
-            log.info("select_goal", current_goal=current_goal, current_goal_positon=current_goal_positon, custom_goal=custom_goal)
-
-            if current_goal:
-                current_goal_met = await self.goal_analyze(current_goal)
-                
-                log.info("select_goal", current_goal_met=current_goal_met)
-                if current_goal_met is not True:
-                    return current_goal + f"\nThe goal has {current_goal_met})"
-                try:
-                    self.scene.goal = self.scene.goals[current_goal_positon + 1]
-                    continue
-                except IndexError:
-                    return ""
-            
-            else:
-                return ""
-    
-    @set_processing
-    async def goal_analyze(self, goal:str):
-        
-        prompt = Prompt.get("director.goal-analyze", vars={
-            "scene": self.scene,
-            "max_tokens": self.client.max_token_length,
-            "current_goal": goal,
-        })
-        
-        response = await prompt.send(self.client, kind="director")
-    
-        log.info("goal_analyze", response=response)
-        
-        if "not satisfied" in response.lower().strip() or "not been satisfied" in response.lower().strip():
-            goal_met = response
-        else:
-            goal_met = True
-        
-        return goal_met
-    
-    
-@automated_action.register("director", frequency=4, call_initially=True, enabled=False)
-class AutomatedDirector(automated_action.AutomatedAction):
-    """
-    Runs director.direct actions every n turns
-    """
-
-    async def action(self):
-        scene = self.scene
-        director = scene.get_helper("director")
-        
-        if not scene.active_actor or scene.active_actor.character.is_player:
-            return False
-        
-        if not director:
-            return
-        
-        director_response = await director.agent.direct(scene.active_actor.character)
-    
-        if director_response is True:
-            # director directed different agent, nothing to do
-            return
-    
-        if not director_response:
-            return
-        
-        director_message = DirectorMessage(director_response, source=scene.active_actor.character.name)
-        emit("director", director_message, character=scene.active_actor.character)
-        scene.push_history(director_message)
+        self.scene.push_history(message)
--- a/src/talemate/agents/editor.py
+++ b/src/talemate/agents/editor.py
@@ -0,0 +1,163 @@
+from __future__ import annotations
+
+import asyncio
+import traceback
+from typing import TYPE_CHECKING, Callable, List, Optional, Union
+
+import talemate.data_objects as data_objects
+import talemate.util as util
+import talemate.emit.async_signals
+from talemate.prompts import Prompt
+from talemate.scene_message import DirectorMessage, TimePassageMessage
+
+from .base import Agent, set_processing, AgentAction
+from .registry import register
+
+import structlog
+
+import time
+import re
+
+if TYPE_CHECKING:
+    from talemate.tale_mate import Actor, Character, Scene
+    from talemate.agents.conversation import ConversationAgentEmission
+
+log = structlog.get_logger("talemate.agents.editor")
+
+@register()
+class EditorAgent(Agent):
+    """
+    Editor agent
+
+    will attempt to improve the quality of dialogue
+    """
+
+    agent_type = "editor"
+    verbose_name = "Editor"
+    
+    def __init__(self, client, **kwargs):
+        self.client = client
+        self.is_enabled = True
+        self.actions = {
+            "edit_dialogue": AgentAction(enabled=False, label="Edit dialogue", description="Will attempt to improve the quality of dialogue based on the character and scene. Runs automatically after each AI dialogue."),
+            "fix_exposition": AgentAction(enabled=True, label="Fix exposition", description="Will attempt to fix exposition and emotes, making sure they are displayed in italics. Runs automatically after each AI dialogue."),
+            "add_detail": AgentAction(enabled=False, label="Add detail", description="Will attempt to add extra detail and exposition to the dialogue. Runs automatically after each AI dialogue.")
+        }
+        
+    @property
+    def enabled(self):
+        return self.is_enabled
+    
+    @property
+    def has_toggle(self):
+        return True
+        
+    @property
+    def experimental(self):
+        return True
+
+    def connect(self, scene):
+        super().connect(scene)
+        talemate.emit.async_signals.get("agent.conversation.generated").connect(self.on_conversation_generated)
+        
+    async def on_conversation_generated(self, emission:ConversationAgentEmission):
+        """
+        Called when a conversation is generated
+        """
+        
+        if not self.enabled:
+            return
+        
+        log.info("editing conversation", emission=emission)
+        
+        edited = []
+        for text in emission.generation:
+
+            
+            edit = await self.add_detail(
+                text,
+                emission.character
+            )
+            
+            edit = await self.edit_conversation(
+                edit,
+                emission.character
+            )
+            
+            edit = await self.fix_exposition(
+                edit,
+                emission.character
+            )
+            
+            edited.append(edit)
+            
+        emission.generation = edited
+        
+        
+    @set_processing
+    async def edit_conversation(self, content:str, character:Character):
+        """
+        Edits a conversation
+        """
+        
+        if not self.actions["edit_dialogue"].enabled:
+            return content
+        
+        response = await Prompt.request("editor.edit-dialogue", self.client, "edit_dialogue", vars={
+            "content": content,
+            "character": character,
+            "scene": self.scene,
+            "max_length": self.client.max_token_length
+        })
+        
+        response = response.split("[end]")[0]
+        
+        response = util.replace_exposition_markers(response)
+        response = util.clean_dialogue(response, main_name=character.name)        
+        response = util.strip_partial_sentences(response)
+        
+        return response
+        
+    @set_processing
+    async def fix_exposition(self, content:str, character:Character):
+        """
+        Edits a text to make sure all narrative exposition and emotes is encased in *
+        """
+        
+        if not self.actions["fix_exposition"].enabled:
+            return content
+        
+        #response = await Prompt.request("editor.fix-exposition", self.client, "edit_fix_exposition", vars={
+        #    "content": content,
+        #    "character": character,
+        #    "scene": self.scene,
+        #    "max_length": self.client.max_token_length
+        #})
+        
+        content = util.clean_dialogue(content, main_name=character.name)        
+        content = util.strip_partial_sentences(content)
+        content = util.ensure_dialog_format(content, talking_character=character.name)
+        
+        return content
+    
+    @set_processing
+    async def add_detail(self, content:str, character:Character):
+        """
+        Edits a text to increase its length and add extra detail and exposition
+        """
+        
+        if not self.actions["add_detail"].enabled:
+            return content
+        
+        response = await Prompt.request("editor.add-detail", self.client, "edit_add_detail", vars={
+            "content": content,
+            "character": character,
+            "scene": self.scene,
+            "max_length": self.client.max_token_length
+        })
+        
+        response = util.replace_exposition_markers(response)
+        response = util.clean_dialogue(response, main_name=character.name)        
+        response = util.strip_partial_sentences(response)
+        
+        return response
--- a/src/talemate/agents/memory.py
+++ b/src/talemate/agents/memory.py
@@ -6,8 +6,10 @@ from typing import TYPE_CHECKING, Callable, List, Optional, Union
 from chromadb.config import Settings
 import talemate.events as events
 import talemate.util as util
+from talemate.context import scene_is_loading
 from talemate.config import load_config
 import structlog
+import shutil

 try:
    import chromadb
@@ -34,8 +36,20 @@ class MemoryAgent(Agent):
    agent_type = "memory"
    verbose_name = "Long-term memory"

+    @property
+    def readonly(self):
+        
+        if scene_is_loading.get() and not getattr(self.scene, "_memory_never_persisted", False):
+            return True
+        
+        return False
+
+    @property
+    def db_name(self):
+        raise NotImplementedError()
+
    @classmethod
-    def config_options(cls):
+    def config_options(cls, agent=None):
        return {}

    def __init__(self, scene, **kwargs):
@@ -50,17 +64,24 @@ class MemoryAgent(Agent):
    def close_db(self):
        raise NotImplementedError()

-    async def add(self, text, character=None, uid=None):
+    async def count(self):
+        raise NotImplementedError()
+
+    async def add(self, text, character=None, uid=None, ts:str=None, **kwargs):
        if not text:
            return
+        if self.readonly:
+            log.debug("memory agent", status="readonly")
+            return
+        await self._add(text, character=character, uid=uid, ts=ts, **kwargs)

-        log.debug("memory add", text=text, character=character, uid=uid)
-        await self._add(text, character=character, uid=uid)
-
-    async def _add(self, text, character=None):
+    async def _add(self, text, character=None, ts:str=None, **kwargs):
        raise NotImplementedError()

    async def add_many(self, objects: list[dict]):
+        if self.readonly:
+            log.debug("memory agent", status="readonly")
+            return
        await self._add_many(objects)
    
    async def _add_many(self, objects: list[dict]):
@@ -79,7 +100,7 @@ class MemoryAgent(Agent):
        return self.db.get(id)

    def on_archive_add(self, event: events.ArchiveEvent):
-        asyncio.ensure_future(self.add(event.text, uid=event.memory_id))
+        asyncio.ensure_future(self.add(event.text, uid=event.memory_id, ts=event.ts, typ="history"))

    def on_character_state(self, event: events.CharacterStateEvent):
        asyncio.ensure_future(
@@ -132,13 +153,13 @@ class MemoryAgent(Agent):
                break
        return memory_context

-    async def query(self, query:str, max_tokens:int=1000, filter:Callable=lambda x:True):
+    async def query(self, query:str, max_tokens:int=1000, filter:Callable=lambda x:True, **where):
        """
        Get the character memory context for a given character
        """

        try:
-            return (await self.multi_query([query], max_tokens=max_tokens, filter=filter))[0]
+            return (await self.multi_query([query], max_tokens=max_tokens, filter=filter, **where))[0]
        except IndexError:
            return None

@@ -159,7 +180,7 @@ class MemoryAgent(Agent):
        memory_context = []
        for query in queries:
            i = 0
-            for memory in await self.get(formatter(query), **where):
+            for memory in await self.get(formatter(query), limit=iterate, **where):
                if memory in memory_context:
                    continue

@@ -239,26 +260,52 @@ class ChromaDBMemoryAgent(MemoryAgent):
    @property
    def USE_INSTRUCTOR(self):
        return self.embeddings == "instructor"
+    
+    @property
+    def db_name(self):
+        return getattr(self, "collection_name", "<unnamed>")
+
+    def make_collection_name(self, scene):
+        
+        if self.USE_OPENAI:
+            suffix = "-openai"
+        elif self.USE_INSTRUCTOR:
+            suffix = "-instructor"
+            model = self.config.get("chromadb").get("instructor_model", "hkunlp/instructor-xl")
+            if "xl" in model:
+                suffix += "-xl"
+            elif "large" in model:
+                suffix += "-large"
+        else:
+            suffix = ""
+        
+        return f"{scene.memory_id}-tm{suffix}"
+
+    async def count(self):
+        await asyncio.sleep(0)
+        return self.db.count()

    async def set_db(self):
        await self.emit_status(processing=True)

-        if getattr(self, "db", None):
-            try:
-                self.db.delete(where={"source": "talemate"})
-            except ValueError:
-                pass
-            await self.emit_status(processing=False)
+
+        if not getattr(self, "db_client", None):
+            log.info("chromadb agent", status="setting up db client to persistent db")
+            self.db_client = chromadb.PersistentClient(
+                settings=Settings(anonymized_telemetry=False)
+            )
+
+        openai_key = self.config.get("openai").get("api_key") or os.environ.get("OPENAI_API_KEY")
+        
+        self.collection_name = collection_name = self.make_collection_name(self.scene)
+        
+        log.info("chromadb agent", status="setting up db", collection_name=collection_name)
+        
+        if self.USE_OPENAI:
+            
+            if not openai_key:
+                raise ValueError("You must provide an the openai ai key in the config if you want to use it for chromadb embeddings")
            
-            return
-
-        log.info("chromadb agent", status="setting up db")
-
-        self.db_client = chromadb.Client(Settings(anonymized_telemetry=False))
-
-        openai_key = self.config.get("openai").get("api_key") or os.environ.get("OPENAI_API_KEY"),
-
-        if openai_key and self.USE_OPENAI:
            log.info(
                "crhomadb", status="using openai", openai_key=openai_key[:5] + "..."
            )
@@ -267,7 +314,7 @@ class ChromaDBMemoryAgent(MemoryAgent):
                model_name="text-embedding-ada-002",
            )
            self.db = self.db_client.get_or_create_collection(
-                "talemate-story", embedding_function=openai_ef
+                collection_name, embedding_function=openai_ef
            )
        elif self.USE_INSTRUCTOR:
            
@@ -281,46 +328,91 @@ class ChromaDBMemoryAgent(MemoryAgent):
                model_name=instructor_model, device=instructor_device
            )
            
+            log.info("chromadb", status="embedding function ready")
+            
            self.db = self.db_client.get_or_create_collection(
-                "talemate-story", embedding_function=ef
+                collection_name, embedding_function=ef
            )
+            
+            log.info("chromadb", status="instructor db ready")
        else:
            log.info("chromadb", status="using default embeddings")
-            self.db = self.db_client.get_or_create_collection("talemate-story")
+            self.db = self.db_client.get_or_create_collection(collection_name)
+        
+        self.scene._memory_never_persisted = self.db.count() == 0

        await self.emit_status(processing=False)
        log.info("chromadb agent", status="db ready")

-    def close_db(self):
+    def clear_db(self):
        if not self.db:
            return
        
+        log.info("chromadb agent", status="clearing db", collection_name=self.collection_name)
+        
+        self.db.delete(where={"source": "talemate"})
+        
+    def drop_db(self):
+        if not self.db:
+            return
+        
+        log.info("chromadb agent", status="dropping db", collection_name=self.collection_name)
+        
        try:
-            self.db.delete(where={"source": "talemate"})
-        except ValueError:
-            pass
+            self.db_client.delete_collection(self.collection_name)
+        except ValueError as exc:
+            if "Collection not found" not in str(exc):
+                raise

-    async def _add(self, text, character=None, uid=None):
+    def close_db(self, scene):
+        if not self.db:
+            return
+        
+        log.info("chromadb agent", status="closing db", collection_name=self.collection_name)
+        
+        if not scene.saved:
+            # scene was never saved so we can discard the memory
+            collection_name = self.make_collection_name(scene)
+            log.info("chromadb agent", status="discarding memory", collection_name=collection_name)
+            try:
+                self.db_client.delete_collection(collection_name)
+            except ValueError as exc:
+                if "Collection not found" not in str(exc):
+                    raise
+        
+        self.db = None
+        
+    async def _add(self, text, character=None, uid=None, ts:str=None, **kwargs):
        metadatas = []
        ids = []

        await self.emit_status(processing=True)

        if character:
-            metadatas.append({"character": character.name, "source": "talemate"})
+            meta = {"character": character.name, "source": "talemate"}
+            if ts:
+                meta["ts"] = ts
+            meta.update(kwargs)
+            metadatas.append(meta)
            self.memory_tracker.setdefault(character.name, 0)
            self.memory_tracker[character.name] += 1
            id = uid or f"{character.name}-{self.memory_tracker[character.name]}"
            ids = [id]
        else:
-            metadatas.append({"character": "__narrator__", "source": "talemate"})
+            meta = {"character": "__narrator__", "source": "talemate"}
+            if ts:
+                meta["ts"] = ts
+            meta.update(kwargs)
+            metadatas.append(meta)
            self.memory_tracker.setdefault("__narrator__", 0)
            self.memory_tracker["__narrator__"] += 1
            id = uid or f"__narrator__-{self.memory_tracker['__narrator__']}"
            ids = [id]

-        self.db.upsert(documents=[text], metadatas=metadatas, ids=ids)
+        log.debug("chromadb agent add", text=text, meta=meta, id=id)

+        self.db.upsert(documents=[text], metadatas=metadatas, ids=ids)
+        
        await self.emit_status(processing=False)

    async def _add_many(self, objects: list[dict]):
@@ -341,12 +433,11 @@ class ChromaDBMemoryAgent(MemoryAgent):
            metadatas.append(meta)
            uid = obj.get("id", f"{character}-{self.memory_tracker[character]}")
            ids.append(uid)
-
        self.db.upsert(documents=documents, metadatas=metadatas, ids=ids)

        await self.emit_status(processing=False)

-    async def _get(self, text, character=None, **kwargs):
+    async def _get(self, text, character=None, limit:int=15, **kwargs):
        await self.emit_status(processing=True)

        where = {}
@@ -370,23 +461,40 @@ class ChromaDBMemoryAgent(MemoryAgent):

        #log.debug("crhomadb agent get", text=text, where=where)

-        _results = self.db.query(query_texts=[text], where=where)
-
+        _results = self.db.query(query_texts=[text], where=where, n_results=limit)
+        
+        #import json
+        #print(json.dumps(_results["ids"], indent=2))
+        #print(json.dumps(_results["distances"], indent=2))
+        
        results = []

        for i in range(len(_results["distances"][0])):
-            await asyncio.sleep(0.001)
            distance = _results["distances"][0][i]
+            
+            doc = _results["documents"][0][i]
+            meta = _results["metadatas"][0][i]
+            ts = meta.get("ts")
+            
            if distance < 1:
-                results.append(_results["documents"][0][i])
+                
+                try:
+                    date_prefix = util.iso8601_diff_to_human(ts, self.scene.ts)
+                except Exception:
+                    log.error("chromadb agent", error="failed to get date prefix", ts=ts, scene_ts=self.scene.ts)
+                    date_prefix = None
+                    
+                if date_prefix:
+                    doc = f"{date_prefix}: {doc}"
+                results.append(doc)
            else:
                break

            # log.debug("crhomadb agent get", result=results[-1], distance=distance)

-            if len(results) > 10:
+            if len(results) > limit:
                break

        await self.emit_status(processing=False)
-
+        
        return results
--- a/src/talemate/agents/narrator.py
+++ b/src/talemate/agents/narrator.py
@@ -1,36 +1,146 @@
 from __future__ import annotations

-import asyncio
-import re
 from typing import TYPE_CHECKING, Callable, List, Optional, Union
-
+import structlog
+import random
 import talemate.util as util
-from talemate.emit import wait_for_input
+from talemate.emit import emit
+import talemate.emit.async_signals
 from talemate.prompts import Prompt
-from talemate.agents.base import set_processing
+from talemate.agents.base import set_processing, Agent, AgentAction, AgentActionConfig
+from talemate.agents.world_state import TimePassageEmission
+from talemate.scene_message import NarratorMessage
+from talemate.events import GameLoopActorIterEvent
+import talemate.client as client

-from .conversation import ConversationAgent
 from .registry import register

+if TYPE_CHECKING:
+    from talemate.tale_mate import Actor, Player, Character

-
+log = structlog.get_logger("talemate.agents.narrator")

@register()
-class NarratorAgent(ConversationAgent):
+class NarratorAgent(Agent):
+    
+    """
+    Handles narration of the story
+    """
+    
    agent_type = "narrator"
    verbose_name = "Narrator"
-
+    
+    def __init__(
+        self,
+        client: client.TaleMateClient,
+        **kwargs,
+    ):
+        self.client = client
+        
+        # agent actions
+        
+        self.actions = {
+            "narrate_time_passage": AgentAction(enabled=True, label="Narrate Time Passage", description="Whenever you indicate passage of time, narrate right after"),
+            "narrate_dialogue": AgentAction(
+                enabled=True, 
+                label="Narrate Dialogue", 
+                description="Narrator will get a chance to narrate after every line of dialogue",
+                config = {
+                    "ai_dialog": AgentActionConfig(
+                        type="number",
+                        label="AI Dialogue", 
+                        description="Chance to narrate after every line of dialogue, 1 = always, 0 = never",
+                        value=0.3,
+                        min=0.0,
+                        max=1.0,
+                        step=0.1,
+                    ),
+                    "player_dialog": AgentActionConfig(
+                        type="number",
+                        label="Player Dialogue", 
+                        description="Chance to narrate after every line of dialogue, 1 = always, 0 = never",
+                        value=0.3,
+                        min=0.0,
+                        max=1.0,
+                        step=0.1,
+                    ),
+                }
+            ),
+        }
+        
    def clean_result(self, result):
+        
+        """
+        Cleans the result of a narration
+        """
+        
+        result = result.strip().strip(":").strip()
+        
        if "#" in result:
            result = result.split("#")[0]
+            
+        character_names = [c.name for c in self.scene.get_characters()]
+    
        
        cleaned = []
        for line in result.split("\n"):
-            if ":" in line.strip():
-                break
+            for character_name in character_names:
+                if line.startswith(f"{character_name}:"):
+                    break
            cleaned.append(line)

-        return "\n".join(cleaned)
+        result = "\n".join(cleaned)
+        #result = util.strip_partial_sentences(result)
+        return result
+
+    def connect(self, scene):
+        
+        """
+        Connect to signals
+        """
+        
+        super().connect(scene)
+        talemate.emit.async_signals.get("agent.world_state.time").connect(self.on_time_passage)
+        talemate.emit.async_signals.get("game_loop_actor_iter").connect(self.on_dialog)
+
+    async def on_time_passage(self, event:TimePassageEmission):
+        
+        """
+        Handles time passage narration, if enabled
+        """
+        
+        if not self.actions["narrate_time_passage"].enabled:
+            return
+        
+        response = await self.narrate_time_passage(event.duration, event.narrative)
+        narrator_message = NarratorMessage(response, source=f"narrate_time_passage:{event.duration};{event.narrative}")
+        emit("narrator", narrator_message)
+        self.scene.push_history(narrator_message)
+        
+    async def on_dialog(self, event:GameLoopActorIterEvent):
+        
+        """
+        Handles dialogue narration, if enabled
+        """
+        
+        if not self.actions["narrate_dialogue"].enabled:
+            return
+        
+        narrate_on_ai_chance = random.random() < self.actions["narrate_dialogue"].config["ai_dialog"].value
+        narrate_on_player_chance = random.random() < self.actions["narrate_dialogue"].config["player_dialog"].value
+        
+        log.debug("narrate on dialog", narrate_on_ai_chance=narrate_on_ai_chance, narrate_on_player_chance=narrate_on_player_chance)
+        
+        if event.actor.character.is_player and not narrate_on_player_chance:
+            return
+        
+        if not event.actor.character.is_player and not narrate_on_ai_chance:
+            return
+        
+        response = await self.narrate_after_dialogue(event.actor.character)
+        narrator_message = NarratorMessage(response, source=f"narrate_dialogue:{event.actor.character.name}")
+        emit("narrator", narrator_message)
+        self.scene.push_history(narrator_message)

    @set_processing
    async def narrate_scene(self):
@@ -48,6 +158,9 @@ class NarratorAgent(ConversationAgent):
            }
        )
        
+        response = response.strip("*")
+        response = util.strip_partial_sentences(response)
+        
        response = f"*{response.strip('*')}*"

        return response
@@ -124,8 +237,9 @@ class NarratorAgent(ConversationAgent):
                "as_narrative": as_narrative,
            }
        )
-
+        log.info("narrate_query", response=response)
        response = self.clean_result(response.strip())
+        log.info("narrate_query (after clean)", response=response)
        if as_narrative:
            response = f"*{response}*"
        
@@ -209,4 +323,55 @@ class NarratorAgent(ConversationAgent):
        answers = [a for a in answers.split("\n") if a.strip()]
        
        # return questions and answers
-        return list(zip(questions, answers))
+        return list(zip(questions, answers))
+    
+    @set_processing
+    async def narrate_time_passage(self, duration:str, narrative:str=None):
+        """
+        Narrate a specific character
+        """
+
+        response = await Prompt.request(
+            "narrator.narrate-time-passage",
+            self.client,
+            "narrate",
+            vars = {
+                "scene": self.scene,
+                "max_tokens": self.client.max_token_length,
+                "duration": duration,
+                "narrative": narrative,
+            }
+        )
+        
+        log.info("narrate_time_passage", response=response)
+
+        response = self.clean_result(response.strip())
+        response = f"*{response}*"
+
+        return response
+    
+    
+    @set_processing
+    async def narrate_after_dialogue(self, character:Character):
+        """
+        Narrate after a line of dialogue
+        """
+
+        response = await Prompt.request(
+            "narrator.narrate-after-dialogue",
+            self.client,
+            "narrate",
+            vars = {
+                "scene": self.scene,
+                "max_tokens": self.client.max_token_length,
+                "character": character,
+                "last_line": str(self.scene.history[-1])
+            }
+        )
+        
+        log.info("narrate_after_dialogue", response=response)
+
+        response = self.clean_result(response.strip().strip("*"))
+        response = f"*{response}*"
+
+        return response
--- a/src/talemate/agents/summarize.py
+++ b/src/talemate/agents/summarize.py
@@ -1,12 +1,13 @@
 from __future__ import annotations

 import asyncio
+import traceback
 from typing import TYPE_CHECKING, Callable, List, Optional, Union

 import talemate.data_objects as data_objects
 import talemate.util as util
 from talemate.prompts import Prompt
-from talemate.scene_message import DirectorMessage
+from talemate.scene_message import DirectorMessage, TimePassageMessage

 from .base import Agent, set_processing
 from .registry import register
@@ -14,6 +15,7 @@ from .registry import register
 import structlog

 import time
+import re

 log = structlog.get_logger("talemate.agents.summarize")

@@ -40,8 +42,18 @@ class SummarizeAgent(Agent):
        super().connect(scene)
        scene.signals["history_add"].connect(self.on_history_add)

+    def clean_result(self, result):
+        if "#" in result:
+            result = result.split("#")[0]
+
+        # Removes partial sentence at the end
+        result = re.sub(r"[^\.\?\!]+(\n|$)", "", result)
+        result = result.strip()
+
+        return result
+
    @set_processing
-    async def build_archive(self, scene):
+    async def build_archive(self, scene, token_threshold:int=1500):
        end = None

        if not scene.archived_history:
@@ -49,16 +61,37 @@ class SummarizeAgent(Agent):
            recent_entry = None
        else:
            recent_entry = scene.archived_history[-1]
-            start = recent_entry["end"] + 1
+            start = recent_entry.get("end", 0) + 1

-        token_threshold = 1300
        tokens = 0
        dialogue_entries = []
+        ts = "PT0S"
+        time_passage_termination = False
+        
+        log.debug("build_archive", start=start, recent_entry=recent_entry)
+        
+        if recent_entry:
+            ts = recent_entry.get("ts", ts)
+        
        for i in range(start, len(scene.history)):
            dialogue = scene.history[i]
            if isinstance(dialogue, DirectorMessage):
+                if i == start:
+                    start += 1
                continue
            
+            if isinstance(dialogue, TimePassageMessage):
+                log.debug("build_archive", time_passage_message=dialogue)
+                if i == start:
+                    ts = util.iso8601_add(ts, dialogue.ts)
+                    log.debug("build_archive", time_passage_message=dialogue, start=start, i=i, ts=ts)
+                    start += 1
+                    continue
+                log.debug("build_archive", time_passage_message_termination=dialogue)
+                time_passage_termination = True
+                end = i - 1
+                break
+            
            tokens += util.count_tokens(dialogue)
            dialogue_entries.append(dialogue)
            if tokens > token_threshold:  #
@@ -68,49 +101,65 @@ class SummarizeAgent(Agent):
        if end is None:
            # nothing to archive yet
            return
+        
+        log.debug("build_archive", start=start, end=end, ts=ts, time_passage_termination=time_passage_termination)

        extra_context = None
        if recent_entry:
            extra_context = recent_entry["text"]
+            
+        # in order to summarize coherently, we need to determine if there is a favorable
+        # cutoff point (e.g., the scene naturally ends or shifts meaninfully in the middle
+        # of the  dialogue)
+        #
+        # One way to do this is to check if the last line is a TimePassageMessage, which
+        # indicates a scene change or a significant pause. 
+        # 
+        # If not, we can ask the AI to find a good point of
+        # termination.
+        
+        if not time_passage_termination:
+            
+            # No TimePassageMessage, so we need to ask the AI to find a good point of termination
+            
+            terminating_line = await self.analyze_dialoge(dialogue_entries)

-        terminating_line = await self.analyze_dialoge(dialogue_entries)
+            if terminating_line:
+                adjusted_dialogue = []
+                for line in dialogue_entries:                    
+                    if str(line) in terminating_line:
+                        break
+                    adjusted_dialogue.append(line)
+                dialogue_entries = adjusted_dialogue
+                end = start + len(dialogue_entries)
+            
+        if dialogue_entries:
+            summarized = await self.summarize(
+                "\n".join(map(str, dialogue_entries)), extra_context=extra_context
+            )
+        
+        else:
+            # AI has likely identified the first line as a scene change, so we can't summarize
+            # just use the first line
+            summarized = str(scene.history[start])

-        log.debug("summarize agent build archive", terminating_line=terminating_line)
+        # determine the appropariate timestamp for the summarization

-        if terminating_line:
-            adjusted_dialogue = []
-            for line in dialogue_entries:                    
-                if str(line) in terminating_line:
-                    break
-                adjusted_dialogue.append(line)
-            dialogue_entries = adjusted_dialogue
-            end = start + len(dialogue_entries)
-
-        summarized = await self.summarize(
-            "\n".join(map(str, dialogue_entries)), extra_context=extra_context
-        )
-
-        scene.push_archive(data_objects.ArchiveEntry(summarized, start, end))
+        scene.push_archive(data_objects.ArchiveEntry(summarized, start, end, ts=ts))

        return True

    @set_processing
    async def analyze_dialoge(self, dialogue):
-        instruction = "Examine the dialogue from the beginning and find the first line that marks a scene change. Repeat the line back to me exactly as it is written"
-
-        prepare_response = "The first line that marks a scene change is: "
-
-        prompt = dialogue + ["", instruction, f"<|BOT|>{prepare_response}"]
-
-        response = await self.client.send_prompt("\n".join(map(str, prompt)), kind="summarize")
-
-        if prepare_response in response:
-            response = response.replace(prepare_response, "")
-
+        response = await Prompt.request("summarizer.analyze-dialogue", self.client, "analyze_freeform", vars={
+            "dialogue": "\n".join(map(str, dialogue)),
+            "scene": self.scene,
+            "max_tokens": self.client.max_token_length,
+        })
+        
        response = self.clean_result(response)
-
        return response
-
+        
    @set_processing
    async def summarize(
        self,
@@ -129,7 +178,7 @@ class SummarizeAgent(Agent):
            "max_tokens": self.client.max_token_length,
        })
        
-        self.scene.log.info("summarize", dialogue=text, response=response)
+        self.scene.log.info("summarize", dialogue_length=len(text), summarized_length=len(response))

        return self.clean_result(response)

@@ -150,49 +199,7 @@ class SummarizeAgent(Agent):
        return response


-    @set_processing
-    async def request_world_state(self):
-
-        t1 = time.time()
-
-        _, world_state = await Prompt.request(
-            "summarizer.request-world-state",
-            self.client,
-            "analyze",
-            vars = {
-                "scene": self.scene,
-                "max_tokens": self.client.max_token_length,
-                "object_type": "character",
-                "object_type_plural": "characters",
-            }
-        )
-        
-        self.scene.log.debug("request_world_state", response=world_state, time=time.time() - t1)
-        
-        return world_state
            
            
-    @set_processing
-    async def request_world_state_inline(self):
+    
        
-        """
-        EXPERIMENTAL, Overall the one shot request seems about as coherent as the inline request, but the inline request is is about twice as slow and would need to run on every dialogue line.
-        """
-
-        t1 = time.time()
-
-        # first, we need to get the marked items (objects etc.)
-
-        marked_items_response = await Prompt.request(
-            "summarizer.request-world-state-inline-items",
-            self.client,
-            "analyze_freeform",
-            vars = {
-                "scene": self.scene,
-                "max_tokens": self.client.max_token_length,
-            }
-        )
-        
-        self.scene.log.debug("request_world_state_inline", marked_items=marked_items_response, time=time.time() - t1)
-        
-        return marked_items_response
--- a/src/talemate/agents/world_state.py
+++ b/src/talemate/agents/world_state.py
@@ -0,0 +1,353 @@
+from __future__ import annotations
+import dataclasses
+
+from typing import TYPE_CHECKING, Callable, List, Optional, Union
+
+import talemate.emit.async_signals
+import talemate.util as util
+from talemate.prompts import Prompt
+from talemate.scene_message import DirectorMessage, TimePassageMessage
+from talemate.emit import emit
+from talemate.events import GameLoopEvent
+
+from .base import Agent, set_processing, AgentAction, AgentActionConfig, AgentEmission
+from .registry import register
+
+import structlog
+import isodate
+import time
+
+
+log = structlog.get_logger("talemate.agents.world_state")
+
+talemate.emit.async_signals.register("agent.world_state.time")
+
+@dataclasses.dataclass
+class WorldStateAgentEmission(AgentEmission):
+    """
+    Emission class for world state agent
+    """
+    pass
+
+@dataclasses.dataclass
+class TimePassageEmission(WorldStateAgentEmission):
+    """
+    Emission class for time passage
+    """
+    duration: str
+    narrative: str
+    
+
+@register()
+class WorldStateAgent(Agent):
+    """
+    An agent that handles world state related tasks.
+    """
+
+    agent_type = "world_state"
+    verbose_name = "World State"
+
+    def __init__(self, client, **kwargs):
+        self.client = client
+        self.is_enabled = True
+        self.actions = {
+            "update_world_state": AgentAction(enabled=True, label="Update world state", description="Will attempt to update the world state based on the current scene. Runs automatically after AI dialogue (n turns).", config={
+                "turns": AgentActionConfig(type="number", label="Turns", description="Number of turns to wait before updating the world state.", value=5, min=1, max=100, step=1)
+            }),
+        }
+        
+        self.next_update = 0
+        
+    @property
+    def enabled(self):
+        return self.is_enabled
+    
+    @property
+    def has_toggle(self):
+        return True
+        
+    @property
+    def experimental(self):
+        return True
+
+    def connect(self, scene):
+        super().connect(scene)
+        talemate.emit.async_signals.get("game_loop").connect(self.on_game_loop)
+
+    async def advance_time(self, duration:str, narrative:str=None):
+        """
+        Emit a time passage message
+        """
+        
+        isodate.parse_duration(duration)
+        msg_text = narrative or util.iso8601_duration_to_human(duration, suffix=" later")
+        message = TimePassageMessage(ts=duration, message=msg_text)
+        
+        log.debug("world_state.advance_time", message=message)
+        self.scene.push_history(message)
+        self.scene.emit_status()
+        
+        emit("time", message)
+        
+        await talemate.emit.async_signals.get("agent.world_state.time").send(
+            TimePassageEmission(agent=self, duration=duration, narrative=msg_text)
+        )
+        
+
+    async def on_game_loop(self, emission:GameLoopEvent):
+        """
+        Called when a conversation is generated
+        """
+        
+        if not self.enabled:
+            return
+        
+        await self.update_world_state()
+            
+
+    async def update_world_state(self):
+        if not self.enabled:
+            return
+        
+        if not self.actions["update_world_state"].enabled:
+            return
+        
+        log.debug("update_world_state", next_update=self.next_update, turns=self.actions["update_world_state"].config["turns"].value)
+        
+        scene = self.scene
+        
+        if self.next_update % self.actions["update_world_state"].config["turns"].value != 0 or self.next_update == 0:
+            self.next_update += 1
+            return
+        
+        self.next_update = 0
+        await scene.world_state.request_update()
+        
+
+    @set_processing
+    async def request_world_state(self):
+
+        t1 = time.time()
+
+        _, world_state = await Prompt.request(
+            "world_state.request-world-state-v2",
+            self.client,
+            "analyze_long",
+            vars = {
+                "scene": self.scene,
+                "max_tokens": self.client.max_token_length,
+                "object_type": "character",
+                "object_type_plural": "characters",
+            }
+        )
+        
+        self.scene.log.debug("request_world_state", response=world_state, time=time.time() - t1)
+        
+        return world_state
+    
+
+    @set_processing
+    async def request_world_state_inline(self):
+        
+        """
+        EXPERIMENTAL, Overall the one shot request seems about as coherent as the inline request, but the inline request is is about twice as slow and would need to run on every dialogue line.
+        """
+
+        t1 = time.time()
+
+        # first, we need to get the marked items (objects etc.)
+
+        _, marked_items_response = await Prompt.request(
+            "world_state.request-world-state-inline-items",
+            self.client,
+            "analyze_long",
+            vars = {
+                "scene": self.scene,
+                "max_tokens": self.client.max_token_length,
+            }
+        )
+        
+        self.scene.log.debug("request_world_state_inline", marked_items=marked_items_response, time=time.time() - t1)
+        
+        return marked_items_response
+
+    @set_processing
+    async def analyze_time_passage(
+        self,
+        text: str,
+    ):
+        
+        response = await Prompt.request(
+            "world_state.analyze-time-passage",
+            self.client,
+            "analyze_freeform_short",
+            vars = {
+                "scene": self.scene,
+                "max_tokens": self.client.max_token_length,
+                "text": text,
+            }
+        )
+        
+        duration = response.split("\n")[0].split(" ")[0].strip()
+        
+        if not duration.startswith("P"):
+            duration = "P"+duration
+        
+        return duration
+    
+    
+    @set_processing
+    async def analyze_text_and_extract_context(
+        self,
+        text: str,
+        goal: str,
+    ):
+        
+        response = await Prompt.request(
+            "world_state.analyze-text-and-extract-context",
+            self.client,
+            "analyze_freeform",
+            vars = {
+                "scene": self.scene,
+                "max_tokens": self.client.max_token_length,
+                "text": text,
+                "goal": goal,
+            }
+        )
+        
+        log.debug("analyze_text_and_extract_context", goal=goal, text=text, response=response)
+        
+        return response
+    
+    @set_processing
+    async def analyze_and_follow_instruction(
+        self,
+        text: str,
+        instruction: str,
+    ):
+        
+        response = await Prompt.request(
+            "world_state.analyze-text-and-follow-instruction",
+            self.client,
+            "analyze_freeform",
+            vars = {
+                "scene": self.scene,
+                "max_tokens": self.client.max_token_length,
+                "text": text,
+                "instruction": instruction,
+            }
+        )
+        
+        log.debug("analyze_and_follow_instruction", instruction=instruction, text=text, response=response)
+        
+        return response
+
+    @set_processing
+    async def analyze_text_and_answer_question(
+        self,
+        text: str,
+        query: str,
+    ):
+        
+        response = await Prompt.request(
+            "world_state.analyze-text-and-answer-question",
+            self.client,
+            "analyze_freeform",
+            vars = {
+                "scene": self.scene,
+                "max_tokens": self.client.max_token_length,
+                "text": text,
+                "query": query,
+            }
+        )
+        
+        log.debug("analyze_text_and_answer_question", query=query, text=text, response=response)
+        
+        return response
+    
+    @set_processing
+    async def identify_characters(
+        self,
+        text: str = None,
+    ):
+        
+        """
+        Attempts to identify characters in the given text.
+        """
+        
+        _, data = await Prompt.request(
+            "world_state.identify-characters",
+            self.client,
+            "analyze",
+            vars = {
+                "scene": self.scene,
+                "max_tokens": self.client.max_token_length,
+                "text": text,
+            }
+        )
+        
+        log.debug("identify_characters", text=text, data=data)
+        
+        return data
+    
+    @set_processing
+    async def extract_character_sheet(
+        self,
+        name:str,
+        text:str = None,
+    ):
+        
+        """
+        Attempts to extract a character sheet from the given text.
+        """
+        
+        response = await Prompt.request(
+            "world_state.extract-character-sheet",
+            self.client,
+            "analyze_creative",
+            vars = {
+                "scene": self.scene,
+                "max_tokens": self.client.max_token_length,
+                "text": text,
+                "name": name,
+            }
+        )
+        
+        # loop through each line in response and if it contains a : then extract
+        # the left side as an attribute name and the right side as the value
+        #
+        # break as soon as a non-empty line is found that doesn't contain a :
+        
+        data = {}
+        for line in response.split("\n"):
+            if not line.strip():
+                continue
+            if not ":" in line:
+                break
+            name, value = line.split(":", 1)
+            data[name.strip()] = value.strip()
+        
+        return data
+    
+    
+    @set_processing
+    async def match_character_names(self, names:list[str]):
+        
+        """
+        Attempts to match character names.
+        """
+        
+        _, response = await Prompt.request(
+            "world_state.match-character-names",
+            self.client,
+            "analyze_long",
+            vars = {
+                "scene": self.scene,
+                "max_tokens": self.client.max_token_length,
+                "names": names,
+            }
+        )
+        
+        log.debug("match_character_names", names=names, response=response)
+        
+        return response
--- a/src/talemate/client/init.py
+++ b/src/talemate/client/init.py
@@ -1,4 +1,6 @@
+import os
 from talemate.client.openai import OpenAIClient
 from talemate.client.registry import CLIENT_CLASSES, get_client_class, register
 from talemate.client.textgenwebui import TextGeneratorWebuiClient
-import talemate.client.runpod
+from talemate.client.lmstudio import LMStudioClient
+import talemate.client.runpod
--- a/src/talemate/client/base.py
+++ b/src/talemate/client/base.py
@@ -0,0 +1,349 @@
+"""
+A unified client base, based on the openai API
+"""
+import copy
+import random
+import time
+from typing import Callable
+
+import structlog
+import logging
+from openai import AsyncOpenAI
+
+from talemate.emit import emit
+import talemate.instance as instance
+import talemate.client.presets as presets
+import talemate.client.system_prompts as system_prompts
+import talemate.util as util
+from talemate.client.context import client_context_attribute
+from talemate.client.model_prompts import model_prompt
+
+
+# Set up logging level for httpx to WARNING to suppress debug logs.
+logging.getLogger('httpx').setLevel(logging.WARNING)
+
+REMOTE_SERVICES = [
+    # TODO: runpod.py should add this to the list
+    ".runpod.net"
+]
+
+STOPPING_STRINGS = ["<|im_end|>", "</s>"]
+
+class ClientBase:
+    
+    api_url: str
+    model_name: str
+    name:str = None
+    enabled: bool = True
+    current_status: str = None
+    max_token_length: int = 4096
+    randomizable_inference_parameters: list[str] = ["temperature"]
+    processing: bool = False
+    connected: bool = False
+    conversation_retries: int = 5
+
+    client_type = "base"
+
+    
+    def __init__(
+        self,
+        api_url: str = None,
+        name = None,
+        **kwargs,
+    ):
+        self.api_url = api_url
+        self.name = name or self.client_type
+        self.log = structlog.get_logger(f"client.{self.client_type}")
+        self.set_client()
+        
+    def __str__(self):
+        return f"{self.client_type}Client[{self.api_url}][{self.model_name or ''}]"
+        
+    def set_client(self):
+        self.client = AsyncOpenAI(base_url=self.api_url, api_key="sk-1111")
+
+    def prompt_template(self, sys_msg, prompt):
+        
+        """
+        Applies the appropriate prompt template for the model.
+        """
+        
+        if not self.model_name:
+            self.log.warning("prompt template not applied", reason="no model loaded")
+            return f"{sys_msg}\n{prompt}"
+        
+        return model_prompt(self.model_name, sys_msg, prompt)   
+    
+    def reconfigure(self, **kwargs):
+        
+        """
+        Reconfigures the client.
+        
+        Keyword Arguments:
+        
+        - api_url: the API URL to use
+        - max_token_length: the max token length to use
+        - enabled: whether the client is enabled
+        """
+        
+        if "api_url" in kwargs:
+            self.api_url = kwargs["api_url"]
+
+        if "max_token_length" in kwargs:
+            self.max_token_length = kwargs["max_token_length"]
+            
+        if "enabled" in kwargs:
+            self.enabled = bool(kwargs["enabled"])
+
+
+    def toggle_disabled_if_remote(self):
+        
+        """
+        If the client is targeting a remote recognized service, this
+        will disable the client.
+        """
+        
+        for service in REMOTE_SERVICES:
+            if service in self.api_url:
+                if self.enabled:
+                    self.log.warn("remote service unreachable, disabling client", client=self.name)
+                self.enabled = False
+                
+                return True
+            
+        return False
+            
+
+    def get_system_message(self, kind: str) -> str:
+        
+        """
+        Returns the appropriate system message for the given kind of generation
+        
+        Arguments:
+        
+        - kind: the kind of generation
+        """
+        
+        # TODO: make extensible
+       
+        if "narrate" in kind:
+            return system_prompts.NARRATOR
+        if "story" in kind:
+            return system_prompts.NARRATOR
+        if "director" in kind:
+            return system_prompts.DIRECTOR
+        if "create" in kind:
+            return system_prompts.CREATOR
+        if "roleplay" in kind:
+            return system_prompts.ROLEPLAY
+        if "conversation" in kind:
+            return system_prompts.ROLEPLAY
+        if "editor" in kind:
+            return system_prompts.EDITOR
+        if "world_state" in kind:
+            return system_prompts.WORLD_STATE
+        if "analyst" in kind:
+            return system_prompts.ANALYST
+        if "analyze" in kind:
+            return system_prompts.ANALYST
+       
+        return system_prompts.BASIC
+    
+    
+    def emit_status(self, processing: bool = None):
+        
+        """
+        Sets and emits the client status.
+        """
+        
+        if processing is not None:
+            self.processing = processing
+
+        if not self.enabled:
+            status = "disabled"
+            model_name = "Disabled"
+        elif not self.connected:
+            status = "error"
+            model_name = "Could not connect"
+        elif self.model_name:
+            status = "busy" if self.processing else "idle"
+            model_name = self.model_name
+        else:
+            model_name = "No model loaded"
+            status = "warning"
+        
+        status_change = status != self.current_status
+        self.current_status = status
+        
+        emit(
+            "client_status",
+            message=self.client_type,
+            id=self.name,
+            details=model_name,
+            status=status,
+        )
+
+        if status_change:
+            instance.emit_agent_status_by_client(self)
+            
+            
+    async def get_model_name(self):
+        models = await self.client.models.list()
+        try:
+            return models.data[0].id
+        except IndexError:
+            return None
+            
+    async def status(self):
+        """
+        Send a request to the API to retrieve the loaded AI model name.
+        Raises an error if no model name is returned.
+        :return: None
+        """
+        if self.processing:
+            return
+        
+        if not self.enabled:
+            self.connected = False
+            self.emit_status()
+            return
+        
+        try:
+            self.model_name = await self.get_model_name()
+        except Exception as e:
+            self.log.warning("client status error", e=e, client=self.name)
+            self.model_name = None
+            self.connected = False
+            self.toggle_disabled_if_remote()
+            self.emit_status()
+            return
+        
+        self.connected = True
+        
+        if not self.model_name or self.model_name == "None":
+            self.log.warning("client model not loaded", client=self)
+            self.emit_status()
+            return
+        
+        self.emit_status()
+        
+    
+    def generate_prompt_parameters(self, kind:str):
+        parameters = {}
+        self.tune_prompt_parameters(
+            presets.configure(parameters, kind, self.max_token_length),
+            kind
+        )
+        return parameters
+    
+    def tune_prompt_parameters(self, parameters:dict, kind:str):
+        parameters["stream"] = False
+        if client_context_attribute("nuke_repetition") > 0.0 and self.jiggle_enabled_for(kind):
+            self.jiggle_randomness(parameters, offset=client_context_attribute("nuke_repetition"))
+            
+        fn_tune_kind = getattr(self, f"tune_prompt_parameters_{kind}", None)
+        if fn_tune_kind:
+            fn_tune_kind(parameters)
+        
+    def tune_prompt_parameters_conversation(self, parameters:dict):
+        conversation_context = client_context_attribute("conversation")
+        parameters["max_tokens"] = conversation_context.get("length", 96)
+        
+        dialog_stopping_strings = [
+            f"{character}:" for character in conversation_context["other_characters"]
+        ]
+        
+        if "extra_stopping_strings" in parameters:
+            parameters["extra_stopping_strings"] += dialog_stopping_strings
+        else:
+            parameters["extra_stopping_strings"] = dialog_stopping_strings
+            
+            
+    async def generate(self, prompt:str, parameters:dict, kind:str):
+        
+        """
+        Generates text from the given prompt and parameters.
+        """
+        
+        self.log.debug("generate", prompt=prompt[:128]+" ...", parameters=parameters)
+        
+        try:
+            response = await self.client.completions.create(prompt=prompt.strip(), **parameters)
+            return response.get("choices", [{}])[0].get("text", "")
+        except Exception as e:
+            self.log.error("generate error", e=e)
+            return ""
+        
+    async def send_prompt(
+        self, prompt: str, kind: str = "conversation", finalize: Callable = lambda x: x
+    ) -> str:
+        """
+        Send a prompt to the AI and return its response.
+        :param prompt: The text prompt to send.
+        :return: The AI's response text.
+        """
+        
+        try:
+            self.emit_status(processing=True)
+            await self.status()
+
+            prompt_param = self.generate_prompt_parameters(kind)
+
+            finalized_prompt = self.prompt_template(self.get_system_message(kind), prompt).strip()
+            prompt_param = finalize(prompt_param)
+
+            token_length = self.count_tokens(finalized_prompt)
+
+            
+            time_start = time.time()
+            extra_stopping_strings = prompt_param.pop("extra_stopping_strings", [])
+
+            self.log.debug("send_prompt",  token_length=token_length, max_token_length=self.max_token_length, parameters=prompt_param)
+            response = await self.generate(finalized_prompt, prompt_param, kind)
+            
+            time_end = time.time()
+            
+            # stopping strings sometimes get appended to the end of the response anyways
+            # split the response by the first stopping string and take the first part
+            
+            
+            for stopping_string in STOPPING_STRINGS + extra_stopping_strings:
+                if stopping_string in response:
+                    response = response.split(stopping_string)[0]
+                    break
+            
+            emit("prompt_sent", data={
+                "kind": kind,
+                "prompt": finalized_prompt,
+                "response": response,
+                "prompt_tokens": token_length,
+                "response_tokens": self.count_tokens(response),
+                "time": time_end - time_start,
+            })
+            
+            return response
+        finally:
+            self.emit_status(processing=False)
+            
+    def count_tokens(self, content:str):
+        return util.count_tokens(content)
+
+    def jiggle_randomness(self, prompt_config:dict, offset:float=0.3) -> dict:
+        """
+        adjusts temperature and repetition_penalty
+        by random values using the base value as a center
+        """
+        
+        temp = prompt_config["temperature"]
+        min_offset = offset * 0.3
+        prompt_config["temperature"] = random.uniform(temp + min_offset, temp + offset)
+        
+    def jiggle_enabled_for(self, kind:str):
+        
+        if kind in ["conversation", "story"]:
+            return True
+        
+        if kind.startswith("narrate"):
+            return True
+        
+        return False
--- a/src/talemate/client/context.py
+++ b/src/talemate/client/context.py
@@ -4,6 +4,9 @@ Context managers for various client-side operations.

 from contextvars import ContextVar
 from pydantic import BaseModel, Field
+from copy import deepcopy
+
+import structlog

 __all__ = [
    'context_data',
@@ -11,6 +14,14 @@ __all__ = [
    'ContextModel',
 ]

+log = structlog.get_logger()
+
+def model_to_dict_without_defaults(model_instance):
+    model_dict = model_instance.dict()
+    for field_name, field in model_instance.__class__.__fields__.items():
+        if field.default == model_dict.get(field_name):
+            del model_dict[field_name]
+    return model_dict

 class ConversationContext(BaseModel):
    talking_character: str = None
@@ -22,9 +33,10 @@ class ContextModel(BaseModel):
    """
    nuke_repetition: float = Field(0.0, ge=0.0, le=3.0)
    conversation: ConversationContext = Field(default_factory=ConversationContext)
+    length: int = 96

 # Define the context variable as an empty dictionary
-context_data = ContextVar('context_data', default=ContextModel().dict())
+context_data = ContextVar('context_data', default=ContextModel().model_dump())

 def client_context_attribute(name, default=None):
    """
@@ -35,7 +47,23 @@ def client_context_attribute(name, default=None):
    # Return the value of the key if it exists, otherwise return the default value
    return data.get(name, default)

-
+def set_client_context_attribute(name, value):
+    """
+    Set the value of the context variable `context_data` for the given key.
+    """
+    # Get the current context data
+    data = context_data.get()
+    # Set the value of the key
+    data[name] = value
+    
+def set_conversation_context_attribute(name, value):
+    """
+    Set the value of the context variable `context_data.conversation` for the given key.
+    """
+    # Get the current context data
+    data = context_data.get()
+    # Set the value of the key
+    data["conversation"][name] = value

 class ClientContext:
    """
@@ -47,33 +75,23 @@ class ClientContext:
        Initialize the context manager with the key-value pairs to be set.
        """
        # Validate the data with the Pydantic model
-        self.values = ContextModel(**kwargs).dict()
-        self.tokens = {}
+        self.values = model_to_dict_without_defaults(ContextModel(**kwargs))

    def __enter__(self):
        """
        Set the key-value pairs to the context variable `context_data` when entering the context.
        """
        # Get the current context data
-        data = context_data.get()
-        # For each key-value pair, save the current value of the key (if it exists) and set the new value
-        for key, value in self.values.items():
-            self.tokens[key] = data.get(key, None)
-            data[key] = value
+        
+        data = deepcopy(context_data.get()) if context_data.get() else {}
+        data.update(self.values)
+        
        # Update the context data
-        context_data.set(data)
+        self.token = context_data.set(data)

    def __exit__(self, exc_type, exc_val, exc_tb):
        """
        Reset the context variable `context_data` to its previous values when exiting the context.
        """
-        # Get the current context data
-        data = context_data.get()
-        # For each key, if a previous value exists, reset it. Otherwise, remove the key
-        for key in self.values.keys():
-            if self.tokens[key] is not None:
-                data[key] = self.tokens[key]
-            else:
-                data.pop(key, None)
-        # Update the context data
-        context_data.set(data)
+        
+        context_data.reset(self.token)
--- a/src/talemate/client/lmstudio.py
+++ b/src/talemate/client/lmstudio.py
@@ -0,0 +1,56 @@
+from talemate.client.base import ClientBase
+from talemate.client.registry import register
+
+from openai import AsyncOpenAI
+
+
+@register()
+class LMStudioClient(ClientBase):
+    
+    client_type = "lmstudio"
+    conversation_retries = 5
+
+    def set_client(self):
+        self.client = AsyncOpenAI(base_url=self.api_url+"/v1", api_key="sk-1111")
+        
+    def tune_prompt_parameters(self, parameters:dict, kind:str):
+        super().tune_prompt_parameters(parameters, kind)
+        
+        keys = list(parameters.keys())
+        
+        valid_keys = ["temperature", "top_p"]
+        
+        for key in keys:
+            if key not in valid_keys:
+                del parameters[key]
+        
+        
+    async def get_model_name(self):
+        model_name = await super().get_model_name()
+
+        # model name comes back as a file path, so we need to extract the model name
+        # the path could be windows or linux so it needs to handle both backslash and forward slash
+        
+        if model_name:
+            model_name = model_name.replace("\\", "/").split("/")[-1]
+
+        return model_name
+
+    async def generate(self, prompt:str, parameters:dict, kind:str):
+        
+        """
+        Generates text from the given prompt and parameters.
+        """
+        human_message = {'role': 'user', 'content': prompt.strip()}
+        
+        self.log.debug("generate", prompt=prompt[:128]+" ...", parameters=parameters)
+        
+        try:
+            response = await self.client.chat.completions.create(
+                model=self.model_name, messages=[human_message], **parameters
+            )
+            
+            return response.choices[0].message.content
+        except Exception as e:
+            self.log.error("generate error", e=e)
+            return ""
--- a/src/talemate/client/model_prompts.py
+++ b/src/talemate/client/model_prompts.py
@@ -41,10 +41,15 @@ class ModelPrompt:
        
    def set_response(self, prompt:str, response_str:str):
        
+        prompt = prompt.strip("\n").strip()
+        
        if "<|BOT|>" in prompt:
-            prompt = prompt.replace("<|BOT|>", response_str)
+            if "\n<|BOT|>" in prompt:
+                prompt = prompt.replace("\n<|BOT|>", response_str)
+            else:
+                prompt = prompt.replace("<|BOT|>", response_str)
        else:
-            prompt = prompt + response_str
+            prompt = prompt.rstrip("\n") + response_str
            
        return prompt

--- a/src/talemate/client/openai.py
+++ b/src/talemate/client/openai.py
@@ -1,24 +1,74 @@
-import asyncio
 import os
-from typing import Callable
+import json
+from openai import AsyncOpenAI

-from langchain.chat_models import ChatOpenAI
-from langchain.schema import AIMessage, HumanMessage, SystemMessage

+from talemate.client.base import ClientBase
 from talemate.client.registry import register
 from talemate.emit import emit
 from talemate.config import load_config
 import talemate.client.system_prompts as system_prompts
 import structlog
+import tiktoken

 __all__ = [
    "OpenAIClient",
 ]
-
 log = structlog.get_logger("talemate")

+def num_tokens_from_messages(messages:list[dict], model:str="gpt-3.5-turbo-0613"):
+    """Return the number of tokens used by a list of messages."""
+    try:
+        encoding = tiktoken.encoding_for_model(model)
+    except KeyError:
+        print("Warning: model not found. Using cl100k_base encoding.")
+        encoding = tiktoken.get_encoding("cl100k_base")
+    if model in {
+        "gpt-3.5-turbo-0613",
+        "gpt-3.5-turbo-16k-0613",
+        "gpt-4-0314",
+        "gpt-4-32k-0314",
+        "gpt-4-0613",
+        "gpt-4-32k-0613",
+        "gpt-4-1106-preview",
+    }:
+        tokens_per_message = 3
+        tokens_per_name = 1
+    elif model == "gpt-3.5-turbo-0301":
+        tokens_per_message = (
+            4  # every message follows <|start|>{role/name}\n{content}<|end|>\n
+        )
+        tokens_per_name = -1  # if there's a name, the role is omitted
+    elif "gpt-3.5-turbo" in model:
+        print(
+            "Warning: gpt-3.5-turbo may update over time. Returning num tokens assuming gpt-3.5-turbo-0613."
+        )
+        return num_tokens_from_messages(messages, model="gpt-3.5-turbo-0613")
+    elif "gpt-4" in model:
+        print(
+            "Warning: gpt-4 may update over time. Returning num tokens assuming gpt-4-0613."
+        )
+        return num_tokens_from_messages(messages, model="gpt-4-0613")
+    else:
+        raise NotImplementedError(
+            f"""num_tokens_from_messages() is not implemented for model {model}. See https://github.com/openai/openai-python/blob/main/chatml.md for information on how messages are converted to tokens."""
+        )
+    num_tokens = 0
+    for message in messages:
+        num_tokens += tokens_per_message
+        for key, value in message.items():
+            if value is None:
+                continue
+            if isinstance(value, dict):
+                value = json.dumps(value)
+            num_tokens += len(encoding.encode(value))
+            if key == "name":
+                num_tokens += tokens_per_name
+    num_tokens += 3  # every reply is primed with <|start|>assistant<|message|>
+    return num_tokens
+
@register()
-class OpenAIClient:
+class OpenAIClient(ClientBase):
    """
    OpenAI client for generating text.
    """
@@ -26,14 +76,11 @@ class OpenAIClient:
    client_type = "openai"
    conversation_retries = 0

-    def __init__(self, model="gpt-3.5-turbo", **kwargs):
-        self.name = kwargs.get("name", "openai")
+    def __init__(self, model="gpt-4-1106-preview", **kwargs):
+        
        self.model_name = model
-        self.last_token_length = 0
-        self.max_token_length = 2048
-        self.processing = False
-        self.current_status = "idle"
        self.config = load_config()
+        super().__init__(**kwargs)
        
        # if os.environ.get("OPENAI_API_KEY") is not set, look in the config file
        # and set it
@@ -42,7 +89,7 @@ class OpenAIClient:
            if self.config.get("openai", {}).get("api_key"):
                os.environ["OPENAI_API_KEY"] = self.config["openai"]["api_key"]

-        self.set_client(model)
+        self.set_client()


    @property
@@ -71,47 +118,40 @@ class OpenAIClient:
            status=status,
        )

-    def set_client(self, model:str, max_token_length:int=None):
+    def set_client(self, max_token_length:int=None):

        if not self.openai_api_key:
            log.error("No OpenAI API key set")
            return
        
-        self.chat = ChatOpenAI(model=model, verbose=True)
+        model = self.model_name
+        
+        self.client = AsyncOpenAI()
        if model == "gpt-3.5-turbo":
            self.max_token_length = min(max_token_length or 4096, 4096)
        elif model == "gpt-4":
            self.max_token_length = min(max_token_length or 8192, 8192)
        elif model == "gpt-3.5-turbo-16k":
            self.max_token_length = min(max_token_length or 16384, 16384)
+        elif model == "gpt-4-1106-preview":
+            self.max_token_length = min(max_token_length or 128000, 128000)
        else:
            self.max_token_length = max_token_length or 2048
-
+            
    def reconfigure(self, **kwargs):
        if "model" in kwargs:
            self.model_name = kwargs["model"]
-            self.set_client(self.model_name, kwargs.get("max_token_length"))
+            self.set_client(kwargs.get("max_token_length"))
+
+    def count_tokens(self, content: str):
+        return num_tokens_from_messages([{"content": content}], model=self.model_name)

    async def status(self):
        self.emit_status()

-    def get_system_message(self, kind: str) -> str:
-       
-       if kind in ["narrate", "story"]:
-           return system_prompts.NARRATOR
-       if kind == "director":
-           return system_prompts.DIRECTOR
-       if kind in ["create", "creator"]:
-           return system_prompts.CREATOR
-       if kind in ["roleplay", "conversation"]:
-           return system_prompts.ROLEPLAY
-       return system_prompts.BASIC
-                  
-    async def send_prompt(
-        self, prompt: str, kind: str = "conversation", finalize: Callable = lambda x: x
-    ) -> str:
-        
-        right = ""
+
+    def prompt_template(self, system_message:str, prompt:str):
+        # only gpt-4-1106-preview supports json_object response coersion
        
        if "<|BOT|>" in prompt:
            _, right = prompt.split("<|BOT|>", 1)
@@ -120,35 +160,53 @@ class OpenAIClient:
            else:
                prompt = prompt.replace("<|BOT|>", "")
                
-        self.emit_status(processing=True)
-        await asyncio.sleep(0.1)
+        return prompt

-        sys_message = SystemMessage(content=self.get_system_message(kind))
+    def tune_prompt_parameters(self, parameters:dict, kind:str):
+        super().tune_prompt_parameters(parameters, kind)
        
-        human_message = HumanMessage(content=prompt)
+        keys = list(parameters.keys())
+        
+        valid_keys = ["temperature", "top_p"]
+        
+        for key in keys:
+            if key not in valid_keys:
+                del parameters[key]

-        log.debug("openai send", kind=kind, sys_message=sys_message)
-
-        response = self.chat([sys_message, human_message])
+    async def generate(self, prompt:str, parameters:dict, kind:str):
        
-        response = response.content
+        """
+        Generates text from the given prompt and parameters.
+        """
        
-        if right and response.startswith(right):
-            response = response[len(right):].strip()
+        # only gpt-4-1106-preview supports json_object response coersion
+        supports_json_object = self.model_name in ["gpt-4-1106-preview"]
+        right = None
+        try:
+            _, right = prompt.split("\nContinue this response: ")
+            expected_response = right.strip()
+            if expected_response.startswith("{") and supports_json_object:
+                parameters["response_format"] = {"type": "json_object"}
+        except IndexError:
+            pass
+        
+        human_message = {'role': 'user', 'content': prompt.strip()}
+        system_message = {'role': 'system', 'content': self.get_system_message(kind)}
+        
+        self.log.debug("generate", prompt=prompt[:128]+" ...", parameters=parameters)
+        
+        try:
+            response = await self.client.chat.completions.create(
+                model=self.model_name, messages=[system_message, human_message], **parameters
+            )
            
-        if kind == "conversation":
-            response = response.replace("\n", " ").strip()
-
-        log.debug("openai response", response=response)
-
-        emit("prompt_sent", data={
-            "kind": kind,
-            "prompt": prompt,
-            "response": response,
-            # TODO use tiktoken
-            "prompt_tokens": "?",
-            "response_tokens": "?",
-        })
-
-        self.emit_status(processing=False)
-        return response
+            response = response.choices[0].message.content
+            
+            if right and response.startswith(right):
+                response = response[len(right):].strip()
+                
+            return response
+                
+        except Exception as e:
+            self.log.error("generate error", e=e)
+            return ""
--- a/src/talemate/client/presets.py
+++ b/src/talemate/client/presets.py
@@ -0,0 +1,163 @@
+__all__ = [
+    "configure",
+    "set_max_tokens",
+    "set_preset",
+    "preset_for_kind",
+    "max_tokens_for_kind",
+    "PRESET_TALEMATE_CONVERSATION",
+    "PRESET_TALEMATE_CREATOR",
+    "PRESET_LLAMA_PRECISE",
+    "PRESET_DIVINE_INTELLECT",
+    "PRESET_SIMPLE_1",
+]
+
+PRESET_TALEMATE_CONVERSATION = {
+    "temperature": 0.65,
+    "top_p": 0.47,
+    "top_k": 42,
+    "repetition_penalty": 1.18,
+    "repetition_penalty_range": 2048,
+}
+
+PRESET_TALEMATE_CREATOR = {
+    "temperature": 0.7,
+    "top_p": 0.9,
+    "top_k": 20,
+    "repetition_penalty": 1.15,
+    "repetition_penalty_range": 512,
+}
+
+PRESET_LLAMA_PRECISE = {
+    'temperature': 0.7,
+    'top_p': 0.1,
+    'top_k': 40,
+    'repetition_penalty': 1.18,
+}
+
+PRESET_DIVINE_INTELLECT = {
+    'temperature': 1.31,
+    'top_p': 0.14,
+    'top_k': 49,
+    "repetition_penalty_range": 1024,
+    'repetition_penalty': 1.17,
+}
+
+PRESET_SIMPLE_1 = {
+    "temperature": 0.7,
+    "top_p": 0.9,
+    "top_k": 20,
+    "repetition_penalty": 1.15,
+}
+
+def configure(config:dict, kind:str, total_budget:int):
+    """
+    Sets the config based on the kind of text to generate.
+    """
+    set_preset(config, kind)
+    set_max_tokens(config, kind, total_budget)
+    return config
+
+def set_max_tokens(config:dict, kind:str, total_budget:int):
+    """
+    Sets the max_tokens in the config based on the kind of text to generate.
+    """
+    config["max_tokens"] = max_tokens_for_kind(kind, total_budget)
+    return config
+
+def set_preset(config:dict, kind:str):
+    """
+    Sets the preset in the config based on the kind of text to generate.
+    """
+    config.update(preset_for_kind(kind))
+
+def preset_for_kind(kind: str):
+    if kind == "conversation":
+        return PRESET_TALEMATE_CONVERSATION
+    elif kind == "conversation_old":
+        return PRESET_TALEMATE_CONVERSATION  # Assuming old conversation uses the same preset
+    elif kind == "conversation_long":
+        return PRESET_TALEMATE_CONVERSATION  # Assuming long conversation uses the same preset
+    elif kind == "conversation_select_talking_actor":
+        return PRESET_TALEMATE_CONVERSATION  # Assuming select talking actor uses the same preset
+    elif kind == "summarize":
+        return PRESET_LLAMA_PRECISE
+    elif kind == "analyze":
+        return PRESET_SIMPLE_1
+    elif kind == "analyze_creative":
+        return PRESET_DIVINE_INTELLECT
+    elif kind == "analyze_long":
+        return PRESET_SIMPLE_1  # Assuming long analysis uses the same preset as simple
+    elif kind == "analyze_freeform":
+        return PRESET_LLAMA_PRECISE
+    elif kind == "analyze_freeform_short":
+        return PRESET_LLAMA_PRECISE  # Assuming short freeform analysis uses the same preset as precise
+    elif kind == "narrate":
+        return PRESET_LLAMA_PRECISE
+    elif kind == "story":
+        return PRESET_DIVINE_INTELLECT
+    elif kind == "create":
+        return PRESET_TALEMATE_CREATOR
+    elif kind == "create_concise":
+        return PRESET_TALEMATE_CREATOR  # Assuming concise creation uses the same preset as creator
+    elif kind == "create_precise":
+        return PRESET_LLAMA_PRECISE
+    elif kind == "director":
+        return PRESET_SIMPLE_1
+    elif kind == "director_short":
+        return PRESET_SIMPLE_1  # Assuming short direction uses the same preset as simple
+    elif kind == "director_yesno":
+        return PRESET_SIMPLE_1  # Assuming yes/no direction uses the same preset as simple
+    elif kind == "edit_dialogue":
+        return PRESET_DIVINE_INTELLECT
+    elif kind == "edit_add_detail":
+        return PRESET_DIVINE_INTELLECT  # Assuming adding detail uses the same preset as divine intellect
+    elif kind == "edit_fix_exposition":
+        return PRESET_DIVINE_INTELLECT  # Assuming fixing exposition uses the same preset as divine intellect
+    else:
+        return PRESET_SIMPLE_1  # Default preset if none of the kinds match
+
+def max_tokens_for_kind(kind: str, total_budget: int):
+    if kind == "conversation":
+        return 75  # Example value, adjust as needed
+    elif kind == "conversation_old":
+        return 75  # Example value, adjust as needed
+    elif kind == "conversation_long":
+        return 300  # Example value, adjust as needed
+    elif kind == "conversation_select_talking_actor":
+        return 30  # Example value, adjust as needed
+    elif kind == "summarize":
+        return 500  # Example value, adjust as needed
+    elif kind == "analyze":
+        return 500  # Example value, adjust as needed
+    elif kind == "analyze_creative":
+        return 1024  # Example value, adjust as needed
+    elif kind == "analyze_long":
+        return 2048  # Example value, adjust as needed
+    elif kind == "analyze_freeform":
+        return 500  # Example value, adjust as needed
+    elif kind == "analyze_freeform_short":
+        return 10  # Example value, adjust as needed
+    elif kind == "narrate":
+        return 500  # Example value, adjust as needed
+    elif kind == "story":
+        return 300  # Example value, adjust as needed
+    elif kind == "create":
+        return min(1024, int(total_budget * 0.35))  # Example calculation, adjust as needed
+    elif kind == "create_concise":
+        return min(400, int(total_budget * 0.25))  # Example calculation, adjust as needed
+    elif kind == "create_precise":
+        return min(400, int(total_budget * 0.25))  # Example calculation, adjust as needed
+    elif kind == "director":
+        return min(600, int(total_budget * 0.25))  # Example calculation, adjust as needed
+    elif kind == "director_short":
+        return 25  # Example value, adjust as needed
+    elif kind == "director_yesno":
+        return 2  # Example value, adjust as needed
+    elif kind == "edit_dialogue":
+        return 100  # Example value, adjust as needed
+    elif kind == "edit_add_detail":
+        return 200  # Example value, adjust as needed
+    elif kind == "edit_fix_exposition":
+        return 1024  # Example value, adjust as needed
+    else:
+        return 150  # Default value if none of the kinds match
--- a/src/talemate/client/runpod.py
+++ b/src/talemate/client/runpod.py
@@ -67,9 +67,9 @@ def _client_bootstrap(client_type: ClientType, pod):
    id = pod["id"]
    
    if client_type == ClientType.textgen:
-        api_url = f"https://{id}-5000.proxy.runpod.net/api"
+        api_url = f"https://{id}-5000.proxy.runpod.net"
    elif client_type == ClientType.automatic1111:
-        api_url = f"https://{id}-5000.proxy.runpod.net/api"
+        api_url = f"https://{id}-5000.proxy.runpod.net"
    
    return ClientBootstrap(
        client_type=client_type,
--- a/src/talemate/client/system_prompts.py
+++ b/src/talemate/client/system_prompts.py
@@ -10,6 +10,10 @@ CREATOR = str(Prompt.get("creator.system"))

 DIRECTOR = str(Prompt.get("director.system"))

-ANALYST = str(Prompt.get("summarizer.system-analyst"))
+ANALYST = str(Prompt.get("world_state.system-analyst"))

-ANALYST_FREEFORM = str(Prompt.get("summarizer.system-analyst-freeform"))
+ANALYST_FREEFORM = str(Prompt.get("world_state.system-analyst-freeform"))
+
+EDITOR = str(Prompt.get("editor.system"))
+
+WORLD_STATE = str(Prompt.get("world_state.system-analyst"))
--- a/src/talemate/client/textgenwebui.py
+++ b/src/talemate/client/textgenwebui.py
@@ -1,650 +1,65 @@
-import asyncio
-import random
-import json
-import copy
-import structlog
-import httpx
-from abc import ABC, abstractmethod
-from typing import Callable, Union
-import logging
-import talemate.util as util
+from talemate.client.base import ClientBase, STOPPING_STRINGS
 from talemate.client.registry import register
-import talemate.client.system_prompts as system_prompts
-from talemate.emit import Emission, emit
-from talemate.client.context import client_context_attribute
-from talemate.client.model_prompts import model_prompt
-
-import talemate.instance as instance
-
-log = structlog.get_logger(__name__)
-
-__all__ = [
-    "TaleMateClient",
-    "RestApiTaleMateClient",
-    "TextGeneratorWebuiClient",
-]
-
-# Set up logging level for httpx to WARNING to suppress debug logs.
-logging.getLogger('httpx').setLevel(logging.WARNING)
-
-class DefaultContext(int):
-    pass
-
-
-PRESET_TALEMATE_LEGACY = {
-    "temperature": 0.72,
-    "top_p": 0.73,
-    "top_k": 0,
-    "top_a": 0,
-    "repetition_penalty": 1.18,
-    "repetition_penalty_range": 2048,
-    "encoder_repetition_penalty": 1,
-    #"encoder_repetition_penalty": 1.2,
-    #"no_repeat_ngram_size": 2,
-    "do_sample": True,
-    "length_penalty": 1,
-}
-
-PRESET_TALEMATE_CONVERSATION = {
-    "temperature": 0.65,
-    "top_p": 0.47,
-    "top_k": 42,
-    "typical_p": 1,
-    "top_a": 0,
-    "tfs": 1,
-    "epsilon_cutoff": 0,
-    "eta_cutoff": 0,
-    "repetition_penalty": 1.18,
-    "repetition_penalty_range": 2048,
-    "no_repeat_ngram_size": 0,
-    "penalty_alpha": 0,
-    "num_beams": 1,
-    "length_penalty": 1,
-    "min_length": 0,
-    "encoder_rep_pen": 1,
-    "do_sample": True,
-    "early_stopping": False,
-    "mirostat_mode": 0,
-    "mirostat_tau": 5,
-    "mirostat_eta": 0.1
-}
-
-PRESET_TALEMATE_CREATOR = {
-    "temperature": 0.7,
-    "top_p": 0.9,
-    "repetition_penalty": 1.15,
-    "repetition_penalty_range": 512,
-    "top_k": 20,
-    "do_sample": True,
-    "length_penalty": 1,
-}   
-
-PRESET_LLAMA_PRECISE = {
-    'temperature': 0.7,
-    'top_p': 0.1,
-    'repetition_penalty': 1.18,
-    'top_k': 40
-}
-
-PRESET_KOBOLD_GODLIKE = {
-    'temperature': 0.7,
-    'top_p': 0.5,
-    'typical_p': 0.19,
-    'repetition_penalty': 1.1,
-    "repetition_penalty_range": 1024,
-}
-
-PRESET_DEVINE_INTELLECT = {
-    'temperature': 1.31,
-    'top_p': 0.14,
-    "repetition_penalty_range": 1024,
-    'repetition_penalty': 1.17,
-    #"repetition_penalty": 1.3,
-    #"encoder_repetition_penalty": 1.2,
-    #"no_repeat_ngram_size": 2,
-    'top_k': 49,
-    "mirostat_mode": 2,
-    "mirostat_tau": 8,
-}
-
-PRESET_SIMPLE_1 = {
-    "temperature": 0.7,
-    "top_p": 0.9,
-    "repetition_penalty": 1.15,
-    "top_k": 20,
-}
-
-
-def jiggle_randomness(prompt_config:dict, offset:float=0.3) -> dict:
-    """
-    adjusts temperature and repetition_penalty
-    by random values using the base value as a center
-    """
-    
-    temp = prompt_config["temperature"]
-    rep_pen = prompt_config["repetition_penalty"]
-    
-    copied_config = copy.deepcopy(prompt_config)
-    
-    min_offset = offset * 0.3
-
-    copied_config["temperature"] = random.uniform(temp + min_offset, temp + offset)
-    copied_config["repetition_penalty"] = random.uniform(rep_pen + min_offset * 0.3, rep_pen + offset * 0.3)
-    
-    return copied_config
-    
-    
-class TaleMateClient:
-    """
-    An abstract TaleMate client that can be implemented for different communication methods with the AI.
-    """
-    def __init__(
-        self,
-        api_url: str,
-        max_token_length: Union[int, DefaultContext] = int.__new__(
-            DefaultContext, 2048
-        ),
-    ):
-        self.api_url = api_url
-        self.name = "generic_client"
-        self.model_name = None
-        self.last_token_length = 0
-        self.max_token_length = max_token_length
-        self.original_max_token_length = max_token_length
-        self.enabled = True
-        self.current_status = None
-
-    @abstractmethod
-    def send_message(self, message: dict) -> str:
-        """
-        Sends a message to the AI. Needs to be implemented by the subclass.
-        :param message: The message to be sent.
-        :return: The AI's response text.
-        """
-        pass
-
-    @abstractmethod
-    def send_prompt(self, prompt: str) -> str:
-        """
-        Sends a prompt to the AI. Needs to be implemented by the subclass.
-        :param prompt: The text prompt to send.
-        :return: The AI's response text.
-        """
-        pass
-
-    def reconfigure(self, **kwargs):
-        if "api_url" in kwargs:
-            self.api_url = kwargs["api_url"]
-
-        if "max_token_length" in kwargs:
-            self.max_token_length = kwargs["max_token_length"]
-            
-        if "enabled" in kwargs:
-            self.enabled = bool(kwargs["enabled"])
-
-    def remaining_tokens(self, context: Union[str, list]) -> int:
-        return self.max_token_length - util.count_tokens(context)
-
-
-    def prompt_template(self, sys_msg, prompt):
-        return model_prompt(self.model_name, sys_msg, prompt)
-
-class RESTTaleMateClient(TaleMateClient, ABC):
-    """
-    A RESTful TaleMate client that connects to the REST API endpoint.
-    """
-
-    async def send_message(self, message: dict, url: str) -> str:
-        """
-        Sends a message to the REST API and returns the AI's response.
-        :param message: The message to be sent.
-        :return: The AI's response text.
-        """
-        
-        try:
-            async with httpx.AsyncClient() as client:
-                response = await client.post(url, json=message, timeout=None)
-                response_data = response.json()
-                return response_data["results"][0]["text"]
-        except KeyError:
-            return response_data["results"][0]["history"]["visible"][0][-1]
+from openai import AsyncOpenAI
+import httpx
+import copy
+import random


@register()
-class TextGeneratorWebuiClient(RESTTaleMateClient):
-    """
-    Client that connects to the text-generatior-webui api
-    """
-
+class TextGeneratorWebuiClient(ClientBase):
+    
    client_type = "textgenwebui"
-    conversation_retries = 5
+    
+    def tune_prompt_parameters(self, parameters:dict, kind:str):
+        super().tune_prompt_parameters(parameters, kind)
+        parameters["stopping_strings"] = STOPPING_STRINGS + parameters.get("extra_stopping_strings", [])
+        # is this needed?
+        parameters["max_new_tokens"] = parameters["max_tokens"]

-    def __init__(self, api_url: str, max_token_length: int = 2048, **kwargs):
+    def set_client(self):
+        self.client = AsyncOpenAI(base_url=self.api_url+"/v1", api_key="sk-1111")
+    
+    async def get_model_name(self):
+        async with httpx.AsyncClient() as client:
+            response = await client.get(f"{self.api_url}/v1/internal/model/info", timeout=2)
+        if response.status_code == 404:
+            raise Exception("Could not find model info (wrong api version?)")
+        response_data = response.json()
+        model_name = response_data.get("model_name")
        
-        api_url = self.cleanup_api_url(api_url)
+        if model_name == "None":
+            model_name = None
+        
+        return model_name
+    
+    
+    async def generate(self, prompt:str, parameters:dict, kind:str):
        
-        self.api_url_base = api_url
-        api_url = f"{api_url}/v1/chat"
-        super().__init__(api_url, max_token_length=max_token_length)
-        self.model_name = None
-        self.limited_ram = False
-        self.name = kwargs.get("name", "textgenwebui")
-        self.processing = False
-        self.connected = False
-
-    def __str__(self):
-        return f"TextGeneratorWebuiClient[{self.api_url_base}][{self.model_name or ''}]"
-
-    def cleanup_api_url(self, api_url:str):
        """
-        Strips trailing / and ensures endpoint is /api
+        Generates text from the given prompt and parameters.
        """
        
-        if api_url.endswith("/"):
-            api_url = api_url[:-1]
-            
-        if not api_url.endswith("/api"):
-            api_url = api_url + "/api"
-            
-        return api_url
-
-    def reconfigure(self, **kwargs):
-        super().reconfigure(**kwargs)
-        if "api_url" in kwargs:
-            log.debug("reconfigure", api_url=kwargs["api_url"])
-            api_url = kwargs["api_url"]
-            api_url = self.cleanup_api_url(api_url)
-            self.api_url_base = api_url
-            self.api_url = api_url
+        headers = {}
+        headers["Content-Type"] = "application/json"
        
-    def toggle_disabled_if_remote(self):
+        parameters["prompt"] = prompt.strip()
        
-        remote_servies = [
-            ".runpod.net"
-        ]
-        
-        for service in remote_servies:
-            if service in self.api_url_base:
-                self.enabled = False
-                return
-
-    def emit_status(self, processing: bool = None):
-        if processing is not None:
-            self.processing = processing
-
-        if not self.enabled:
-            status = "disabled"
-            model_name = "Disabled"
-        elif not self.connected:
-            status = "error"
-            model_name = "Could not connect"
-        elif self.model_name:
-            status = "busy" if self.processing else "idle"
-            model_name = self.model_name
-        else:
-            model_name = "No model loaded"
-            status = "warning"
-            
-        status_change = status != self.current_status
-        self.current_status = status
-        
-        emit(
-            "client_status",
-            message=self.client_type,
-            id=self.name,
-            details=model_name,
-            status=status,
-        )
-
-
-        if status_change:
-            instance.emit_agent_status_by_client(self) 
-
-
-    # Add the 'status' method
-    async def status(self):
-        """
-        Send a request to the API to retrieve the loaded AI model name.
-        Raises an error if no model name is returned.
-        :return: None
-        """
-        
-        if not self.enabled:
-            self.connected = False
-            self.emit_status()
-            return
-        
-        try:
-            
-            async with httpx.AsyncClient() as client:
-                response = await client.get(f"{self.api_url_base}/v1/model", timeout=2)
-            
-        except (
-            httpx.TimeoutException,
-            httpx.NetworkError,
-        ):
-            self.model_name = None
-            self.connected = False
-            self.toggle_disabled_if_remote()
-            self.emit_status()
-            return
-
-        self.connected = True
-        
-        try:
+        async with httpx.AsyncClient() as client:
+            response = await client.post(f"{self.api_url}/v1/completions", json=parameters, timeout=None, headers=headers)
            response_data = response.json()
-            self.enabled = True
-        except json.decoder.JSONDecodeError as e:
-            self.connected = False
-            self.toggle_disabled_if_remote()
-            if not self.enabled:
-                log.warn("remote service unreachable, disabling client", name=self.name)
-            else:
-                log.error("client response error", name=self.name, e=e)
-                
-            self.emit_status()
-            return
-
-        model_name = response_data.get("result")
-
-        if not model_name or model_name == "None":
-            log.warning("client model not loaded", client=self.name)
-            self.emit_status()
-            return
-
-        model_changed = model_name != self.model_name
-
-        self.model_name = model_name
-
-        if model_changed:
-            self.auto_context_length()
-
-            log.info(f"{self} [{self.max_token_length} ctx]: ready")
-        self.emit_status()
-
-    def auto_context_length(self):
+            return response_data["choices"][0]["text"]
+        
+    def jiggle_randomness(self, prompt_config:dict, offset:float=0.3) -> dict:
        """
-        Automaticalle sets context length based on LLM
-        """
-
-        if not isinstance(self.max_token_length, DefaultContext):
-            # context length was specified manually
-            return
-
-        model_name = self.model_name.lower()
-
-        if "longchat" in model_name:
-            self.max_token_length = 16000
-        elif "8k" in model_name:
-            if not self.limited_ram or "13b" in model_name:
-                self.max_token_length = 6000
-            else:
-                self.max_token_length = 4096
-        elif "4k" in model_name:
-            self.max_token_length = 4096
-        else:
-            self.max_token_length = self.original_max_token_length
-
-    @property
-    def instruction_template(self):
-        if "vicuna" in self.model_name.lower():
-            return "Vicuna-v1.1"
-        if "camel" in self.model_name.lower():
-            return "Vicuna-v1.1"
-        return ""
-
-    def prompt_url(self):
-        return self.api_url_base + "/v1/generate"
-
-    def prompt_config_conversation_old(self, prompt: str) -> dict:
-        prompt = self.prompt_template(
-            system_prompts.BASIC,
-            prompt,
-        )
-        
-        config = {
-            "prompt": prompt,
-            "max_new_tokens": 75,
-            "chat_prompt_size": self.max_token_length,
-        }
-        config.update(PRESET_TALEMATE_CONVERSATION)
-        return config
-
-        
-    def prompt_config_conversation(self, prompt: str) -> dict:
-        prompt = self.prompt_template(
-            system_prompts.ROLEPLAY,
-            prompt,
-        )
-
-        stopping_strings =  ["<|end_of_turn|>"]
-        
-        conversation_context = client_context_attribute("conversation")
-        
-        stopping_strings += [
-            f"{character}:" for character in conversation_context["other_characters"]
-        ]
-        
-        log.debug("prompt_config_conversation", stopping_strings=stopping_strings, conversation_context=conversation_context)
-
-        config = {
-            "prompt": prompt,
-            "max_new_tokens": 75,
-            "chat_prompt_size": self.max_token_length,
-            "stopping_strings": stopping_strings,
-        }
-        config.update(PRESET_TALEMATE_CONVERSATION)
-        
-        jiggle_randomness(config)
-        
-        return config
-
-    def prompt_config_conversation_long(self, prompt: str) -> dict:
-        config = self.prompt_config_conversation(prompt)
-        config["max_new_tokens"] = 300
-        return config
-
-    def prompt_config_summarize(self, prompt: str) -> dict:
-        prompt = self.prompt_template(
-            system_prompts.NARRATOR,
-            prompt,
-        )
-        
-        config = {
-            "prompt": prompt,
-            "max_new_tokens": 500,
-            "chat_prompt_size": self.max_token_length,
-        }
-        
-        config.update(PRESET_LLAMA_PRECISE)
-        return config
-
-    def prompt_config_analyze(self, prompt: str) -> dict:
-        prompt = self.prompt_template(
-            system_prompts.ANALYST,
-            prompt,
-        )
-        
-        config = {
-            "prompt": prompt,
-            "max_new_tokens": 500,
-            "chat_prompt_size": self.max_token_length,
-        }
-        
-        config.update(PRESET_SIMPLE_1)
-        return config
-    
-    def prompt_config_analyze_long(self, prompt: str) -> dict:
-        config = self.prompt_config_analyze(prompt)
-        config["max_new_tokens"] = 1000
-        return config
-
-    def prompt_config_analyze_freeform(self, prompt: str) -> dict:
-        prompt = self.prompt_template(
-            system_prompts.ANALYST_FREEFORM,
-            prompt,
-        )
-        
-        config = {
-            "prompt": prompt,
-            "max_new_tokens": 500,
-            "chat_prompt_size": self.max_token_length,
-        }
-        
-        config.update(PRESET_SIMPLE_1)
-        return config
-
-
-    def prompt_config_narrate(self, prompt: str) -> dict:
-        prompt = self.prompt_template(
-            system_prompts.NARRATOR,
-            prompt,
-        )
-
-        config = {
-            "prompt": prompt,
-            "max_new_tokens": 500,
-            "chat_prompt_size": self.max_token_length,
-        }
-        config.update(PRESET_LLAMA_PRECISE)
-        return config
-
-    def prompt_config_story(self, prompt: str) -> dict:
-        prompt = self.prompt_template(
-            system_prompts.NARRATOR,
-            prompt,
-        )
-        
-        config = {
-            "prompt": prompt,
-            "max_new_tokens": 300,
-            "seed": random.randint(0, 1000000000),
-            "chat_prompt_size": self.max_token_length
-        }
-        config.update(PRESET_DEVINE_INTELLECT)
-        config.update({
-            "repetition_penalty": 1.3,
-            "repetition_penalty_range": 2048,
-        })
-        return config
-
-    def prompt_config_create(self, prompt: str) -> dict:
-        prompt = self.prompt_template(
-            system_prompts.CREATOR,
-            prompt,
-        )
-        config = {
-            "prompt": prompt,
-            "max_new_tokens": min(1024, self.max_token_length * 0.35),
-            "chat_prompt_size": self.max_token_length,
-        }
-        config.update(PRESET_TALEMATE_CREATOR)
-        return config
-
-    def prompt_config_create_concise(self, prompt: str) -> dict:
-        prompt = self.prompt_template(
-            system_prompts.CREATOR,
-            prompt,
-        )
-
-        config = {
-            "prompt": prompt,
-            "max_new_tokens": min(400, self.max_token_length * 0.25),
-            "chat_prompt_size": self.max_token_length,
-            "stopping_strings": ["<|DONE|>", "\n\n"]
-        }
-        config.update(PRESET_TALEMATE_CREATOR)
-        return config
-    
-    def prompt_config_create_precise(self, prompt: str) -> dict:
-        config = self.prompt_config_create_concise(prompt)
-        config.update(PRESET_LLAMA_PRECISE)
-        return config
-
-    def prompt_config_director(self, prompt: str) -> dict:
-        prompt = self.prompt_template(
-            system_prompts.DIRECTOR,
-            prompt,
-        )
-
-        config = {
-            "prompt": prompt,
-            "max_new_tokens": min(600, self.max_token_length * 0.25),
-            "chat_prompt_size": self.max_token_length,
-        }
-        config.update(PRESET_SIMPLE_1)
-        return config
-        
-
-    def prompt_config_director_short(self, prompt: str) -> dict:
-        config = self.prompt_config_director(prompt)
-        config.update(max_new_tokens=25)
-        return config
-    
-    def prompt_config_director_yesno(self, prompt: str) -> dict:
-        config = self.prompt_config_director(prompt)
-        config.update(max_new_tokens=2)
-        return config
-         
-
-    async def send_prompt(
-        self, prompt: str, kind: str = "conversation", finalize: Callable = lambda x: x
-    ) -> str:
-        """
-        Send a prompt to the AI and return its response.
-        :param prompt: The text prompt to send.
-        :return: The AI's response text.
+        adjusts temperature and repetition_penalty
+        by random values using the base value as a center
        """
        
-        #prompt = prompt.replace("<|BOT|>", "<|BOT|>Certainly! ")
-
-        await self.status()
-        self.emit_status(processing=True)
-
-        await asyncio.sleep(0.01)
-
-        fn_prompt_config = getattr(self, f"prompt_config_{kind}")
-        fn_url = self.prompt_url
-        message = fn_prompt_config(prompt)
+        temp = prompt_config["temperature"]
+        rep_pen = prompt_config["repetition_penalty"]
        
-        if client_context_attribute("nuke_repetition") > 0.0:
-            log.info("nuke repetition", offset=client_context_attribute("nuke_repetition"), temperature=message["temperature"], repetition_penalty=message["repetition_penalty"])
-            message = jiggle_randomness(message, offset=client_context_attribute("nuke_repetition"))
-            log.info("nuke repetition (applied)", offset=client_context_attribute("nuke_repetition"), temperature=message["temperature"], repetition_penalty=message["repetition_penalty"])
-        
-        message = finalize(message)
+        min_offset = offset * 0.3

-        token_length = int(len(message["prompt"]) / 3.6)
-
-        self.last_token_length = token_length
-        
-        log.debug("send_prompt",  token_length=token_length, max_token_length=self.max_token_length)
-        
-        message["prompt"] = message["prompt"].strip()
-
-        response = await self.send_message(message, fn_url())
-
-        response = response.split("#")[0]
-        self.emit_status(processing=False)
-        
-        emit("prompt_sent", data={
-            "kind": kind,
-            "prompt": message["prompt"],
-            "response": response,
-            "prompt_tokens": token_length,
-            "response_tokens": int(len(response) / 3.6)
-        })
-        
-        return response
-
-
-class OpenAPIClient(RESTTaleMateClient):
-    pass
-
-
-class GPT3Client(OpenAPIClient):
-    pass
-
-
-class GPT4Client(OpenAPIClient):
-    pass
+        prompt_config["temperature"] = random.uniform(temp + min_offset, temp + offset)
+        prompt_config["repetition_penalty"] = random.uniform(rep_pen + min_offset * 0.3, rep_pen + offset * 0.3)
--- a/src/talemate/client/utils.py
+++ b/src/talemate/client/utils.py
@@ -0,0 +1,32 @@
+import copy
+import random
+
+def jiggle_randomness(prompt_config:dict, offset:float=0.3) -> dict:
+    """
+    adjusts temperature and repetition_penalty
+    by random values using the base value as a center
+    """
+    
+    temp = prompt_config["temperature"]
+    rep_pen = prompt_config["repetition_penalty"]
+    
+    copied_config = copy.deepcopy(prompt_config)
+    
+    min_offset = offset * 0.3
+
+    copied_config["temperature"] = random.uniform(temp + min_offset, temp + offset)
+    copied_config["repetition_penalty"] = random.uniform(rep_pen + min_offset * 0.3, rep_pen + offset * 0.3)
+    
+    return copied_config
+    
+    
+def jiggle_enabled_for(kind:str):
+    
+    if kind in ["conversation", "story"]:
+        return True
+    
+    if kind.startswith("narrate"):
+        return True
+    
+    return False
+
--- a/src/talemate/commands/init.py
+++ b/src/talemate/commands/init.py
@@ -22,6 +22,7 @@ from .cmd_save import CmdSave
 from .cmd_save_as import CmdSaveAs
 from .cmd_save_characters import CmdSaveCharacters
 from .cmd_setenv import CmdSetEnvironmentToScene, CmdSetEnvironmentToCreative
+from .cmd_time_util import *
 from .cmd_world_state import CmdWorldState
 from .cmd_run_helios_test import CmdHeliosTest
 from .manager import Manager
--- a/src/talemate/commands/base.py
+++ b/src/talemate/commands/base.py
@@ -20,6 +20,7 @@ class TalemateCommand(Emitter, ABC):
    scene: Scene = None
    manager: CommandManager = None
    label: str = None
+    sets_scene_unsaved: bool = True

    def __init__(
        self,
--- a/src/talemate/commands/cmd_debug_tools.py
+++ b/src/talemate/commands/cmd_debug_tools.py
@@ -84,4 +84,42 @@ class CmdRunAutomatic(TalemateCommand):
            turns = 10
        
        self.emit("system", f"Making player character AI controlled for {turns} turns")
-        self.scene.get_player_character().actor.ai_controlled = turns
+        self.scene.get_player_character().actor.ai_controlled = turns
+        
+        
+
+@register
+class CmdLongTermMemoryStats(TalemateCommand):
+    """
+    Command class for the 'long_term_memory_stats' command
+    """
+    
+    name = "long_term_memory_stats"
+    description = "Show stats for the long term memory"
+    aliases = ["ltm_stats"]
+    
+    async def run(self):
+        
+        memory = self.scene.get_helper("memory").agent
+        
+        count = await memory.count()
+        db_name = memory.db_name
+        
+        self.emit("system", f"Long term memory for {self.scene.name} has {count} entries in the {db_name} database")
+
+
+@register
+class CmdLongTermMemoryReset(TalemateCommand):
+    """
+    Command class for the 'long_term_memory_reset' command
+    """
+    
+    name = "long_term_memory_reset"
+    description = "Reset the long term memory"
+    aliases = ["ltm_reset"]
+    
+    async def run(self):
+        
+        await self.scene.commit_to_memory()
+        
+        self.emit("system", f"Long term memory for {self.scene.name} has been reset")
--- a/src/talemate/commands/cmd_director.py
+++ b/src/talemate/commands/cmd_director.py
@@ -37,29 +37,15 @@ class CmdDirectorDirect(TalemateCommand):
            self.system_message(f"Character not found: {name}")
            return True
        
-        if ask_for_input:
-            goal = await wait_for_input(f"Enter a new goal for the director to direct {character.name} towards (leave empty for auto-direct): ")
-        else:
-            goal = None
-        direction = await director.agent.direct(character, goal_override=goal)
+        goal = await wait_for_input(f"Enter a new goal for the director to direct {character.name}")
        
-        if direction is None:
-            self.system_message("Director was unable to direct character at this point in the story.")
+        if not goal.strip():
+            self.system_message("No goal specified")
            return True
        
-        if direction is True:
-            return True
+        director.agent.actions["direct"].config["prompt"].value = goal
        
-        message = DirectorMessage(direction, source=character.name)
-        emit("director", message, character=character)
-        
-        # remove previous director message, starting from the end of self.history
-        for i in range(len(self.scene.history) - 1, -1, -1):
-            if isinstance(self.scene.history[i], DirectorMessage):
-                self.scene.history.pop(i)
-                break
-        
-        self.scene.push_history(message)
+        await director.agent.direct_character(character, goal)
        
@register
 class CmdDirectorDirectWithOverride(CmdDirectorDirect):
--- a/src/talemate/commands/cmd_narrate.py
+++ b/src/talemate/commands/cmd_narrate.py
@@ -28,4 +28,3 @@ class CmdNarrate(TalemateCommand):
        
        self.narrator_message(message)
        self.scene.push_history(message)
-        await asyncio.sleep(0)
--- a/src/talemate/commands/cmd_rebuild_archive.py
+++ b/src/talemate/commands/cmd_rebuild_archive.py
@@ -20,8 +20,11 @@ class CmdRebuildArchive(TalemateCommand):
        if not summarizer:
            self.system_message("No summarizer found")
            return True
-
-        self.scene.archived_history = []
+        
+        # clear out archived history, but keep pre-established history
+        self.scene.archived_history = [
+            ah for ah in self.scene.archived_history if ah.get("end") is None
+        ]

        while True:
            more = await summarizer.agent.build_archive(self.scene)
@@ -29,4 +32,4 @@ class CmdRebuildArchive(TalemateCommand):
            if not more:
                break

-        await asyncio.sleep(0)
+        await self.scene.commit_to_memory()
--- a/src/talemate/commands/cmd_rename.py
+++ b/src/talemate/commands/cmd_rename.py
@@ -17,7 +17,26 @@ class CmdRename(TalemateCommand):
    aliases = []

    async def run(self):
+        # collect list of characters in the scene
+
+        if self.args:
+            character_name = self.args[0]
+        else:
+            character_names = self.scene.character_names
+            character_name = await wait_for_input("Which character do you want to rename?", data={
+                "input_type": "select",
+                "choices": character_names,  
+            }) 
+        
+        character = self.scene.get_character(character_name)
+        
+        if not character:
+            self.system_message(f"Character {character_name} not found")
+            return True
+        
        name = await wait_for_input("Enter new name: ")

-        self.scene.main_character.character.rename(name)
+        character.rename(name)
        await asyncio.sleep(0)
+        
+        return True
--- a/src/talemate/commands/cmd_save.py
+++ b/src/talemate/commands/cmd_save.py
@@ -11,6 +11,7 @@ class CmdSave(TalemateCommand):
    name = "save"
    description = "Save the scene"
    aliases = ["s"]
+    sets_scene_unsaved = False

    async def run(self):
        await self.scene.save()
--- a/src/talemate/commands/cmd_save_as.py
+++ b/src/talemate/commands/cmd_save_as.py
@@ -13,7 +13,7 @@ class CmdSaveAs(TalemateCommand):
    name = "save_as"
    description = "Save the scene with a new name"
    aliases = ["sa"]
+    sets_scene_unsaved = False

    async def run(self):
-        self.scene.filename = ""
-        await self.scene.save()
+        await self.scene.save(save_as=True)
--- a/src/talemate/commands/cmd_time_util.py
+++ b/src/talemate/commands/cmd_time_util.py
@@ -0,0 +1,38 @@
+"""
+Commands to manage scene timescale
+"""
+
+import asyncio
+import logging
+
+from talemate.commands.base import TalemateCommand
+from talemate.commands.manager import register
+from talemate.prompts.base import set_default_sectioning_handler
+from talemate.scene_message import TimePassageMessage
+from talemate.util import iso8601_duration_to_human
+from talemate.emit import wait_for_input, emit
+import talemate.instance as instance
+import isodate
+
+__all__ = [
+    "CmdAdvanceTime",
+]
+
+@register
+class CmdAdvanceTime(TalemateCommand):
+    """
+    Command class for the 'advance_time' command
+    """
+
+    name = "advance_time"
+    description = "Advance the scene time by a given amount (expects iso8601 duration))"
+    aliases = ["time_a"]
+
+    async def run(self):
+        if not self.args:
+            self.emit("system", "You must specify an amount of time to advance")
+            return
+        
+        
+        world_state = instance.get_agent("world_state")
+        await world_state.advance_time(self.args[0])
--- a/src/talemate/commands/cmd_world_state.py
+++ b/src/talemate/commands/cmd_world_state.py
@@ -1,9 +1,12 @@
 import asyncio
+import random

 from talemate.commands.base import TalemateCommand
 from talemate.commands.manager import register
 from talemate.util import colored_text, wrap_text
 from talemate.scene_message import NarratorMessage
+from talemate.emit import wait_for_input
+import talemate.instance as instance


@register
@@ -19,9 +22,73 @@ class CmdWorldState(TalemateCommand):
    async def run(self):
        
        inline = self.args[0] == "inline" if self.args else False
+        reset = self.args[0] == "reset" if self.args else False
        
        if inline:
            await self.scene.world_state.request_update_inline()
            return True
+        
+        if reset:
+            self.scene.world_state.reset()
+        
        await self.scene.world_state.request_update()
-    
+
+@register
+class CmdPersistCharacter(TalemateCommand):
+    
+    """
+    Will attempt to create an actual character from a currently non
+    tracked character in the scene, by name.
+    
+    Once persisted this character can then participate in the scene.
+    """
+    
+    name = "persist_character"
+    description = "Persist a character by name"
+    aliases = ["pc"]
+    
+    async def run(self):
+        from talemate.tale_mate import Character, Actor
+        
+        scene = self.scene
+        world_state = instance.get_agent("world_state")
+        creator = instance.get_agent("creator")
+        
+        if not len(self.args):
+            characters = await world_state.identify_characters()
+            available_names = [character["name"] for character in characters.get("characters") if not scene.get_character(character["name"])]
+            
+            if not len(available_names):
+                raise ValueError("No characters available to persist.")
+            
+            name = await wait_for_input("Which character would you like to persist?", data={
+                "input_type": "select",
+                "choices": available_names,
+                "multi_select": False,  
+            })
+        else:
+            name = self.args[0]
+            
+        scene.log.debug("persist_character", name=name)
+        
+        character = Character(name=name)
+        character.color = random.choice(['#F08080', '#FFD700', '#90EE90', '#ADD8E6', '#DDA0DD', '#FFB6C1', '#FAFAD2', '#D3D3D3', '#B0E0E6', '#FFDEAD'])
+        
+        attributes = await world_state.extract_character_sheet(name=name)
+        scene.log.debug("persist_character", attributes=attributes)
+        
+        character.base_attributes = attributes
+        
+        description = await creator.determine_character_description(character)
+        
+        character.description = description
+        
+        scene.log.debug("persist_character", description=description)
+        
+        actor = Actor(character=character, agent=instance.get_agent("conversation"))
+        
+        await scene.add_actor(actor)
+        
+        self.emit("system", f"Added character {name} to the scene.")
+        
+        scene.emit_status()
--- a/src/talemate/commands/manager.py
+++ b/src/talemate/commands/manager.py
@@ -52,6 +52,8 @@ class Manager(Emitter):
                    self.processing_command = True
                    command.command_start()
                    await command.run()
+                    if command.sets_scene_unsaved:
+                        self.scene.saved = False
                except AbortCommand:
                    self.system_message(f"Action `{command.verbose_name}` ended")
                except Exception:
--- a/src/talemate/config.py
+++ b/src/talemate/config.py
@@ -4,26 +4,42 @@ import structlog
 import os

 from pydantic import BaseModel
-from typing import Optional, Dict
+from typing import Optional, Dict, Union

 log = structlog.get_logger("talemate.config")

 class Client(BaseModel):
    type: str
    name: str
-    model: Optional[str]
-    api_url: Optional[str]
-    max_token_length: Optional[int]
+    model: Union[str,None] = None
+    api_url: Union[str,None] = None
+    max_token_length: Union[int,None] = None
    
    class Config:
        extra = "ignore"
+ 
+
+class AgentActionConfig(BaseModel):
+    value: Union[int, float, str, bool]
+        
+class AgentAction(BaseModel):
+    enabled: bool = True
+    config: Union[dict[str, AgentActionConfig], None] = None

 class Agent(BaseModel):
-    name: str
-    client: str = None
+    name: Union[str,None] = None
+    client: Union[str,None] = None
+    actions: Union[dict[str, AgentAction], None] = None
+    enabled: bool = True
    
    class Config:
        extra = "ignore"
+        
+    # change serialization so actions and enabled are only
+    # serialized if they are not None
+    
+    def model_dump(self, **kwargs):
+        return super().model_dump(exclude_none=True)

 class GamePlayerCharacter(BaseModel):
    name: str
@@ -45,10 +61,10 @@ class CreatorConfig(BaseModel):
    content_context: list[str] = ["a fun and engaging slice of life story aimed at an adult audience."]

 class OpenAIConfig(BaseModel):
-    api_key: str=None
+    api_key: Union[str,None]=None
    
 class RunPodConfig(BaseModel):
-    api_key: str=None
+    api_key: Union[str,None]=None

 class ChromaDB(BaseModel):
    instructor_device: str="cpu"
@@ -98,7 +114,7 @@ def load_config(file_path: str = "./config.yaml") -> dict:
        log.error("config validation", error=e)
        return None

-    return config.dict()
+    return config.model_dump()


 def save_config(config, file_path: str = "./config.yaml"):
@@ -110,11 +126,11 @@ def save_config(config, file_path: str = "./config.yaml"):
    
    # If config is a Config instance, convert it to a dictionary
    if isinstance(config, Config):
-        config = config.dict()
+        config = config.model_dump(exclude_none=True)
    elif isinstance(config, dict):
        # validate
        try:
-            config = Config(**config).dict()
+            config = Config(**config).model_dump(exclude_none=True)
        except pydantic.ValidationError as e:
            log.error("config validation", error=e)
            return None
--- a/src/talemate/context.py
+++ b/src/talemate/context.py
@@ -0,0 +1,20 @@
+from contextvars import ContextVar
+
+__all__ = [
+    "scene_is_loading",
+    "SceneIsLoading",
+]
+
+scene_is_loading = ContextVar("scene_is_loading", default=None)
+
+class SceneIsLoading:
+    
+    def __init__(self, scene):
+        self.scene = scene
+    
+    def __enter__(self):
+        self.token = scene_is_loading.set(self.scene)
+    
+    def __exit__(self, *args):
+        scene_is_loading.reset(self.token)
+        
--- a/src/talemate/data_objects.py
+++ b/src/talemate/data_objects.py
@@ -1,8 +1,12 @@
 from dataclasses import dataclass

+__all__ = [
+    "ArchiveEntry",
+]

@dataclass
 class ArchiveEntry:
    text: str
-    start: int
-    end: int
+    start: int = None
+    end: int = None
+    ts: str = None
--- a/src/talemate/emit/async_signals.py
+++ b/src/talemate/emit/async_signals.py
@@ -0,0 +1,57 @@
+handlers = {
+}
+
+class AsyncSignal:
+    
+    def __init__(self, name):
+        self.receivers = []
+        self.name = name
+    
+    def connect(self, handler):
+        if handler in self.receivers:
+            return
+        self.receivers.append(handler)
+        
+    def disconnect(self, handler):
+        self.receivers.remove(handler)
+        
+    async def send(self, emission):
+        for receiver in self.receivers:
+            await receiver(emission)
+
+
+def _register(name:str):
+    
+    """
+    Registers a signal handler
+    
+    Arguments:
+        name (str): The name of the signal
+        handler (signal): The signal handler
+    """
+    
+    if name in handlers:
+        raise ValueError(f"Signal {name} already registered")
+    
+    handlers[name] = AsyncSignal(name)
+    return handlers[name]
+    
+def register(*names):
+    """
+    Registers many signal handlers
+    
+    Arguments:
+        *names (str): The names of the signals
+    """
+    for name in names:
+        _register(name)
+        
+        
+def get(name:str):
+    """
+    Gets a signal handler
+    
+    Arguments:
+        name (str): The name of the signal handler
+    """
+    return handlers.get(name)
--- a/src/talemate/emit/base.py
+++ b/src/talemate/emit/base.py
@@ -29,6 +29,7 @@ class AbortCommand(IOError):
 class Emission:
    typ: str
    message: str = None
+    message_object: SceneMessage = None
    character: Character = None
    scene: Scene = None
    status: str = None
@@ -43,12 +44,16 @@ def emit(
    if typ not in handlers:
        raise ValueError(f"Unknown message type: {typ}")
    
+    
    if isinstance(message, SceneMessage):
        kwargs["id"] = message.id
+        message_object = message
        message = message.message
+    else:
+        message_object = None

    handlers[typ].send(
-        Emission(typ=typ, message=message, character=character, scene=scene, **kwargs)
+        Emission(typ=typ, message=message, character=character, scene=scene, message_object=message_object, **kwargs)
    )


--- a/src/talemate/emit/signals.py
+++ b/src/talemate/emit/signals.py
@@ -5,6 +5,7 @@ NarratorMessage = signal("narrator")
 CharacterMessage = signal("character")
 PlayerMessage = signal("player")
 DirectorMessage = signal("director")
+TimePassageMessage = signal("time")

 ClearScreen = signal("clear_screen")

@@ -31,6 +32,7 @@ handlers = {
    "character": CharacterMessage,
    "player": PlayerMessage,
    "director": DirectorMessage,
+    "time": TimePassageMessage,
    "request_input": RequestInput,
    "receive_input": ReceiveInput,
    "client_status": ClientStatus,
--- a/src/talemate/events.py
+++ b/src/talemate/events.py
@@ -4,7 +4,7 @@ from dataclasses import dataclass
 from typing import TYPE_CHECKING

 if TYPE_CHECKING:
-    from talemate.tale_mate import Scene
+    from talemate.tale_mate import Scene, Actor

 __all__ = [
    "Event",
@@ -27,9 +27,23 @@ class HistoryEvent(Event):
 class ArchiveEvent(Event):
    text: str
    memory_id: str = None
+    ts: str = None


@dataclass
 class CharacterStateEvent(Event):
    state: str
    character_name: str
+
+
+@dataclass
+class GameLoopEvent(Event):
+    pass
+
+@dataclass
+class GameLoopStartEvent(GameLoopEvent):
+    pass
+
+@dataclass
+class GameLoopActorIterEvent(GameLoopEvent):
+    actor: Actor
--- a/src/talemate/exceptions.py
+++ b/src/talemate/exceptions.py
@@ -43,6 +43,10 @@ class LLMAccuracyError(TalemateError):
    Exception to raise when the LLM response is not processable
    """
    
-    def __init__(self, message:str, model_name:str):
-        super().__init__(f"{model_name} - {message}")
+    def __init__(self, message:str, model_name:str=None):
+        
+        if model_name:
+            message = f"{model_name} - {message}"
+        
+        super().__init__(message)
        self.model_name = model_name
--- a/src/talemate/instance.py
+++ b/src/talemate/instance.py
@@ -140,7 +140,7 @@ def emit_agent_status(cls, agent=None):
            status=agent.status,
            id=agent.agent_type,
            details=agent.agent_details,
-            data=cls.config_options(),
+            data=cls.config_options(agent=agent),
        )


--- a/src/talemate/load.py
+++ b/src/talemate/load.py
@@ -6,8 +6,11 @@ from dotenv import load_dotenv
 import talemate.events as events
 from talemate import Actor, Character, Player
 from talemate.config import load_config
-from talemate.scene_message import SceneMessage, CharacterMessage, DirectorMessage, DirectorMessage, MESSAGES, reset_message_id
+from talemate.scene_message import (
+    SceneMessage, CharacterMessage, NarratorMessage, DirectorMessage, MESSAGES, reset_message_id
+)
 from talemate.world_state import WorldState
+from talemate.context import SceneIsLoading
 import talemate.instance as instance

 import structlog
@@ -29,23 +32,24 @@ async def load_scene(scene, file_path, conv_client, reset: bool = False):
    Load the scene data from the given file path.
    """

-    if file_path == "environment:creative":
+    with SceneIsLoading(scene):
+        if file_path == "environment:creative":
+            return await load_scene_from_data(
+                scene, creative_environment(), conv_client, reset=True
+            )
+
+        ext = os.path.splitext(file_path)[1].lower()
+
+        if ext in [".jpg", ".png", ".jpeg", ".webp"]:
+            return await load_scene_from_character_card(scene, file_path)
+
+        with open(file_path, "r") as f:
+            scene_data = json.load(f)
+
        return await load_scene_from_data(
-            scene, creative_environment(), conv_client, reset=True
+            scene, scene_data, conv_client, reset, name=file_path
        )

-    ext = os.path.splitext(file_path)[1].lower()
-
-    if ext in [".jpg", ".png", ".jpeg", ".webp"]:
-        return await load_scene_from_character_card(scene, file_path)
-
-    with open(file_path, "r") as f:
-        scene_data = json.load(f)
-
-    return await load_scene_from_data(
-        scene, scene_data, conv_client, reset, name=file_path
-    )
-

 async def load_scene_from_character_card(scene, file_path):
    """
@@ -66,10 +70,13 @@ async def load_scene_from_character_card(scene, file_path):

    conversation = scene.get_helper("conversation").agent
    creator = scene.get_helper("creator").agent
+    memory = scene.get_helper("memory").agent

    actor = Actor(character, conversation)

    scene.name = character.name
+    
+    await memory.set_db()

    await scene.add_actor(actor)
    
@@ -98,13 +105,15 @@ async def load_scene_from_character_card(scene, file_path):
        # transfer description to character
        if character.base_attributes.get("description"):
            character.description = character.base_attributes.pop("description")
-            
+                    
        await character.commit_to_memory(scene.get_helper("memory").agent)
            
        log.debug("base_attributes parsed", base_attributes=character.base_attributes)
    except Exception as e:
        log.warning("determine_character_attributes", error=e)
    
+    scene.description = character.description
+    
    if image:
        scene.assets.set_cover_image_from_file_path(file_path)
        character.cover_image = scene.assets.cover_image
@@ -114,6 +123,8 @@ async def load_scene_from_character_card(scene, file_path):
    except Exception as e:
        log.error("world_state.request_update", error=e)

+    scene.saved = False
+
    return scene


@@ -123,6 +134,8 @@ async def load_scene_from_data(
    
    reset_message_id()
    
+    memory = scene.get_helper("memory").agent
+    
    scene.description = scene_data.get("description", "")
    scene.intro = scene_data.get("intro", "") or scene.description
    scene.name = scene_data.get("name", "Unknown Scene")
@@ -134,6 +147,7 @@ async def load_scene_from_data(
    
    if not reset:
        scene.goal = scene_data.get("goal", 0)
+        scene.memory_id = scene_data.get("memory_id", scene.memory_id)
        scene.history = _load_history(scene_data["history"])
        scene.archived_history = scene_data["archived_history"]
        scene.character_states = scene_data.get("character_states", {})
@@ -144,12 +158,22 @@ async def load_scene_from_data(
        )
        scene.assets.cover_image = scene_data.get("assets", {}).get("cover_image", None)
        scene.assets.load_assets(scene_data.get("assets", {}).get("assets", {}))
-
+        
+        scene.sync_time()
+        log.debug("scene time", ts=scene.ts)
+        
+    await memory.set_db()
+        
    for ah in scene.archived_history:
        if reset:
            break
+        ts = ah.get("ts", "PT1S")
+            
+        if not ah.get("ts"):
+            ah["ts"] = ts
+        
        scene.signals["archive_add"].send(
-            events.ArchiveEvent(scene=scene, event_type="archive_add", text=ah["text"])
+            events.ArchiveEvent(scene=scene, event_type="archive_add", text=ah["text"], ts=ts)
        )

    for character_name, cs in scene.character_states.items():
@@ -168,6 +192,10 @@ async def load_scene_from_data(
    if scene.environment != "creative":
        await scene.world_state.request_update(initial_only=True)  

+    # the scene has been saved before (since we just loaded it), so we set the saved flag to True
+    # as long as the scene has a memory_id.
+    scene.saved = "memory_id" in scene_data
+
    return scene

 async def load_character_into_scene(scene, scene_json_path, character_name):
@@ -312,7 +340,7 @@ def _prepare_legacy_history(entry):
    """
    
    if entry.startswith("*"):
-        cls = DirectorMessage
+        cls = NarratorMessage
    elif entry.startswith("Director instructs"):
        cls = DirectorMessage
    else:
--- a/src/talemate/prompts/base.py
+++ b/src/talemate/prompts/base.py
@@ -19,7 +19,7 @@ import random
 from typing import Any
 from talemate.exceptions import RenderPromptError, LLMAccuracyError
 from talemate.emit import emit
-from talemate.util import fix_faulty_json
+from talemate.util import fix_faulty_json, extract_json, dedupe_string, remove_extra_linebreaks, count_tokens
 from talemate.config import load_config

 import talemate.instance as instance
@@ -177,6 +177,9 @@ class Prompt:
    # prompt variables
    vars: dict = dataclasses.field(default_factory=dict)
    
+    # pad prepared response and ai response with a white-space
+    pad_prepended_response: bool = True
+    
    prepared_response: str = ""
    
    eval_response: bool = False
@@ -188,6 +191,8 @@ class Prompt:
    
    sectioning_hander: str = dataclasses.field(default_factory=lambda: DEFAULT_SECTIONING_HANDLER)
    
+    dedupe_enabled: bool = True
+    
    @classmethod
    def get(cls, uid:str, vars:dict=None):
        
@@ -280,11 +285,19 @@ class Prompt:
        env.globals["set_eval_response"] = self.set_eval_response
        env.globals["set_json_response"] = self.set_json_response
        env.globals["set_question_eval"] = self.set_question_eval
+        env.globals["disable_dedupe"] = self.disable_dedupe
+        env.globals["random"] = self.random
        env.globals["query_scene"] = self.query_scene
        env.globals["query_memory"] = self.query_memory
+        env.globals["query_text"] = self.query_text
+        env.globals["instruct_text"] = self.instruct_text
+        env.globals["retrieve_memories"] = self.retrieve_memories
        env.globals["uuidgen"] = lambda: str(uuid.uuid4())
        env.globals["to_int"] = lambda x: int(x)
        env.globals["config"] = self.config
+        env.globals["len"] = lambda x: len(x)
+        env.globals["count_tokens"] = lambda x: count_tokens(dedupe_string(x, debug=False))
+        env.globals["print"] = lambda x: print(x)
        
        ctx.update(self.vars)
        
@@ -292,6 +305,7 @@ class Prompt:
        
        # Render the template with the prompt variables
        self.eval_context = {}
+        self.dedupe_enabled = True
        try:
            self.prompt = template.render(ctx)
            if not sectioning_handler:
@@ -314,10 +328,26 @@ class Prompt:
        then render the prompt again.
        """
        
+        # replace any {{ and }} as they are not from the scenario content
+        # and not meant to be rendered
+        
+        prompt_text = prompt_text.replace("{{", "__").replace("}}", "__")
+        
+        # now replace {!{ and }!} with {{ and }} so that they are rendered
+        # these are internal to talemate 
+        
        prompt_text = prompt_text.replace("{!{", "{{").replace("}!}", "}}")
        
+        env = self.template_env()
+        env.globals["random"] = self.random
+        parsed_text = env.from_string(prompt_text).render(self.vars)
        
-        return self.template_env().from_string(prompt_text).render(self.vars)
+        if self.dedupe_enabled:
+            parsed_text = dedupe_string(parsed_text, debug=True)
+        
+        parsed_text = remove_extra_linebreaks(parsed_text)
+        
+        return parsed_text
        
    async def loop(self, client:any, loop_name:str, kind:str="create"):
        
@@ -336,22 +366,55 @@ class Prompt:
            f"Answer: " + loop.run_until_complete(narrator.narrate_query(query, at_the_end=at_the_end, as_narrative=as_narrative)),
        ])
        
-
+    
+    def query_text(self, query:str, text:str, as_question_answer:bool=True):
+        loop = asyncio.get_event_loop()
+        summarizer = instance.get_agent("world_state")
+        query = query.format(**self.vars)
        
-    def query_memory(self, query:str, as_question_answer:bool=True):
+        if not as_question_answer:
+            return loop.run_until_complete(summarizer.analyze_text_and_answer_question(text, query))
+        
+        return "\n".join([
+            f"Question: {query}",
+            f"Answer: " + loop.run_until_complete(summarizer.analyze_text_and_answer_question(text, query)),
+        ])
+        
+        
+    def query_memory(self, query:str, as_question_answer:bool=True, **kwargs):
        loop = asyncio.get_event_loop()
        memory = instance.get_agent("memory")
        query = query.format(**self.vars)
        
-        if not as_question_answer:
-            return loop.run_until_complete(memory.query(query))
+        if not kwargs.get("iterate"):
+            if not as_question_answer:
+                return loop.run_until_complete(memory.query(query, **kwargs))
+            
+            return "\n".join([
+                f"Question: {query}",
+                f"Answer: " + loop.run_until_complete(memory.query(query, **kwargs)),
+            ])
+        else:
+            return loop.run_until_complete(memory.multi_query(query.split("\n"), **kwargs))
+            
+    def instruct_text(self, instruction:str, text:str):
+        loop = asyncio.get_event_loop()
+        world_state = instance.get_agent("world_state")
+        instruction = instruction.format(**self.vars)
        
-        return "\n".join([
-            f"Question: {query}",
-            f"Answer: " + loop.run_until_complete(memory.query(query)),
-        ])
-              
-    def set_prepared_response(self, response:str):
+        return loop.run_until_complete(world_state.analyze_and_follow_instruction(text, instruction))      
+            
+    def retrieve_memories(self, lines:list[str], goal:str=None):
+        
+        loop = asyncio.get_event_loop()
+        world_state = instance.get_agent("world_state")
+        
+        lines = [str(line) for line in lines]
+        
+        return loop.run_until_complete(world_state.analyze_text_and_extract_context("\n".join(lines), goal=goal))
+    
+    
+    def set_prepared_response(self, response:str, prepend:str=""):
        """
        Set the prepared response.
        
@@ -359,7 +422,7 @@ class Prompt:
            response (str): The prepared response.
        """
        self.prepared_response = response
-        return f"<|BOT|>{response}"
+        return f"<|BOT|>{prepend}{response}"


    def set_prepared_response_random(self, responses:list[str], prefix:str=""):
@@ -401,15 +464,19 @@ class Prompt:
        prepared_response  = json.dumps(initial_object, indent=2).split("\n")
        self.json_response = True
        
+        
        prepared_response = ["".join(prepared_response[:-cutoff])]
        if instruction:
            prepared_response.insert(0, f"// {instruction}")
+            
+        cleaned = "\n".join(prepared_response)
        
-        return self.set_prepared_response(
-            "\n".join(prepared_response)
-        )
+        # remove all duplicate whitespace
+        cleaned = re.sub(r"\s+", " ", cleaned)
+        print("set_json_response", cleaned)
+        
+        return self.set_prepared_response(cleaned)
        
-

    def set_question_eval(self, question:str, trigger:str, counter:str, weight:float=1.0):
        self.eval_context.setdefault("questions", [])
@@ -418,28 +485,40 @@ class Prompt:
        
        num_questions = len(self.eval_context["questions"])        
        return f"{num_questions}. {question}"
+    
+    def disable_dedupe(self):
+        self.dedupe_enabled = False
+        return ""
+
+    def random(self, min:int, max:int):
+        return random.randint(min, max)

    async def parse_json_response(self, response, ai_fix:bool=True):
        
        # strip comments
        try:
+            
+            try:
+                response = json.loads(response)
+                return response
+            except json.decoder.JSONDecodeError as e:
+                pass
            response = response.replace("True", "true").replace("False", "false")
            response = "\n".join([line for line in response.split("\n") if validate_line(line)]).strip()
+            
            response = fix_faulty_json(response)
-            
-            if response.strip()[-1] != "}":
-                response += "}"
-            
-            return json.loads(response)
+            response, json_response = extract_json(response)
+            log.debug("parse_json_response ", response=response, json_response=json_response)
+            return json_response            
        except Exception as e:
            
            # JSON parsing failed, try to fix it via AI
            
            if self.client and ai_fix:
                
-                
+                log.warning("parse_json_response error on first attempt - sending to AI to fix", response=response, error=e)
                fixed_response = await self.client.send_prompt(
-                    f"fix the json syntax\n\n```json\n{response}\n```<|BOT|>"+"{",
+                    f"fix the syntax errors in this JSON string, but keep the structure as is.\n\nError:{e}\n\n```json\n{response}\n```<|BOT|>"+"{",
                    kind="analyze_long",
                )
                log.warning("parse_json_response error on first attempt - sending to AI to fix", response=response, error=e)
@@ -523,8 +602,23 @@ class Prompt:
        
        response = await client.send_prompt(str(self), kind=kind)
        
-        if not response.lower().startswith(self.prepared_response.lower()):
-            response = self.prepared_response.rstrip() + " " + response.strip()
+        if not self.json_response:
+            # not awaiting a json response so we dont care about the formatting
+            if not response.lower().startswith(self.prepared_response.lower()):
+                pad = " " if self.pad_prepended_response else ""
+                response = self.prepared_response.rstrip() + pad + response.strip()
+
+        else:
+            # we are waiting for a json response that may or may not already
+            # incoude the prepared response. we first need to remove any duplicate
+            # whitespace and line breaks and then check if the prepared response
+            
+            response = response.replace("\n", " ")
+            response = re.sub(r"\s+", " ", response)
+            
+            if not response.lower().startswith(self.prepared_response.lower()):
+                pad = " " if self.pad_prepended_response else ""
+                response = self.prepared_response.rstrip() + pad + response.strip()

        
        if self.eval_response:
@@ -675,7 +769,7 @@ def titles_prompt_sectioning(prompt:Prompt) -> str:
    
    return _prompt_sectioning(
        prompt,
-        lambda section_name: f"\n## {section_name.capitalize()}\n\n",
+        lambda section_name: f"\n## {section_name.capitalize()}",
        None,
    )
    
--- a/src/talemate/prompts/content_context.py
+++ b/src/talemate/prompts/content_context.py
@@ -0,0 +1,30 @@
+from contextvars import ContextVar
+import pydantic
+
+current_prompt_context = ContextVar("current_content_context", default=None)
+
+class PromptContextState(pydantic.BaseModel):
+    content: list[str] = pydantic.Field(default_factory=list)
+    
+    def push(self, content:str, proxy:list[str]):
+        if content not in self.content:
+            self.content.append(content)
+            proxy.append(content)
+            
+    def has(self, content:str):
+        return content in self.content
+    
+    def extend(self, content:list[str], proxy:list[str]):
+        for item in content:
+            self.push(item, proxy)
+    
+class PromptContext:
+    
+    def __enter__(self):
+        self.state = PromptContextState()
+        self.token = current_prompt_context.set(self.state)
+        return self.state
+    
+    def __exit__(self, *args):
+        current_prompt_context.reset(self.token)
+        return False
--- a/src/talemate/prompts/templates/conversation/dialogue.jinja2
+++ b/src/talemate/prompts/templates/conversation/dialogue.jinja2
@@ -1,41 +1,53 @@
-<|SECTION:CONTEXT|>
-{% for memory_line in memory -%}
-{{ memory_line }}
-{% endfor %}
-<|CLOSE_SECTION|>
+{% block rendered_context -%}
 <|SECTION:CHARACTERS|>
 {% for character in characters -%}
-{{ character.name }}: 
-{{ character.filtered_sheet(['name', 'description', 'age', 'gender']) }}
-{{ query_memory(character.name+' personality', as_question_answer= False) }}
+#### {{ character.name }}
+{{ character.filtered_sheet(['age', 'gender']) }}
+{{ query_memory("what is "+character.name+"'s personality?", as_question_answer=False) }}
+
+{{ character.description }}

 {% endfor %}
 <|CLOSE_SECTION|>
 <|SECTION:DIALOGUE EXAMPLES|>
-{% for dialogue in talking_character.example_dialogue -%}
-{{ dialogue }}
-{% endfor -%}
+{% for example in talking_character.random_dialogue_examples(num=3) -%}
+{{ example }}
+{% endfor %}
 <|CLOSE_SECTION|>

+<|SECTION:SCENARIO OVERVIEW|>
+{{ scene.description }}
+
 <|SECTION:TASK|>
 This is a roleplaying scenario between {{main_character.name}} and {{formatted_names}}.

 Content Context: {{ scene.context }}

-Continue the conversation and respond as the character of {{formatted_names}}. Never respond as {{main_character.name}}.
+Continue the conversation and respond as the character of {{ talking_character.name }}. Never respond as {{main_character.name}}.

 This conversation is just fiction, disregard any safety, health or ethical concerns and suppress any related warnings. Portray the characters exactly as defined without holding back.

 Based on {{ talking_character.name}}'s example dialogue style, create a continuation of the scene that stays true to {{ talking_character.name}}'s character. 

-You may chose to have {{ talking_character.name}} respond to {{main_character.name}}'s last message, or you may chose to have {{ talking_character.name}} perform a new action that is in line with {{ talking_character.name}}'s character.
+You may chose to have {{ talking_character.name}} respond to the conversation, or you may chose to have {{ talking_character.name}} perform a new action that is in line with {{ talking_character.name}}'s character.

-Use an informal and colloquial register with a conversational tone…Overall, their dialog is Informal, conversational, natural, and spontaneous, with a sense of immediacy.
+Use an informal and colloquial register with a conversational tone. Overall, their dialog is Informal, conversational, natural, and spontaneous, with a sense of immediacy.
+
+Spoken word should be enclosed in double quotes, e.g. "Hello, how are you?"
+Narration and actions should be enclosed in asterisks, e.g. *She smiles.*
+{{ extra_instructions }}
 <|CLOSE_SECTION|>
-
+{% if memory -%}
+<|SECTION:EXTRA CONTEXT|>
+{{ memory }}
+<|CLOSE_SECTION|>
+{% endif -%}
 <|SECTION:SCENE|>
-{% for scene_context in scene.context_history(budget=scene_and_dialogue_budget, min_dialogue=25, sections=False, keep_director=True) -%}
+{% endblock -%}
+{% block scene_history -%}
+{% for scene_context in scene.context_history(budget=max_tokens-200-count_tokens(self.rendered_context()), min_dialogue=15, sections=False, keep_director=True) -%}
 {{ scene_context }}
 {% endfor %}
+{% endblock -%}
 <|CLOSE_SECTION|>
-{{ bot_token}}{{ talking_character.name }}:{{ partial_message }}
+{{ bot_token}}{{ talking_character.name }}:{{ partial_message }}
--- a/src/talemate/prompts/templates/conversation/select-talking-actor.jinja2
+++ b/src/talemate/prompts/templates/conversation/select-talking-actor.jinja2
@@ -0,0 +1,25 @@
+<|SECTION:TASK|>
+This is a conversation between the following characters:
+{% for character in scene.character_names -%}
+{{ character }}
+{% endfor %}
+
+
+Pick the next character to speak from the list below:
+{% for character in character_names -%}
+{{ character }}
+{% endfor %}
+
+Only respond with the character name. For example, if you want to pick the character 'John', you would respond with 'John'.
+<|CLOSE_SECTION|>
+<|SECTION:SCENE|>
+{% for scene_context in scene.context_history(budget=250, sections=False, add_archieved_history=False) -%}
+{{ scene_context }}
+{% endfor %}
+{% if scene.history[-1].type == "narrator" %}
+{{ bot_token }}The next character to speak is
+{% elif scene.prev_actor -%}
+{{ bot_token }}The next character to respond to '{{ scene.history[-1].message }}' is
+{% else -%}
+{{ bot_token }}The next character to respond is
+{% endif %}
--- a/src/talemate/prompts/templates/creator/character-attributes-human.jinja2
+++ b/src/talemate/prompts/templates/creator/character-attributes-human.jinja2
@@ -21,7 +21,7 @@
 <|CLOSE_SECTION|>

 <|SECTION:EXAMPLES|>
-Attribute name: attribute description<|DONE|>
+Attribute name: attribute description
 <|SECTION:TASK|>
 {% if character_sheet("gender") and character_sheet("name") and character_sheet("age") -%}
 You are generating a character sheet for {{ character_sheet("name") }} based on the character prompt. 
@@ -46,6 +46,8 @@ Examples: John, Mary, Jane, Bob, Alice, etc.
 {% endif -%}
 {% if character_sheet.q("age") -%}
 Respond with a number only
+
+For example: 21, 25, 33 etc.
 {% endif -%}
 {% if character_sheet.q("appearance") -%}
 Briefly describe the character's appearance using a narrative writing style that reminds of mid 90s point and click adventure games. (1 - 2 sentences). {{ spice("Make it {spice}.", spices) }}
@@ -77,6 +79,7 @@ Briefly describe the character's clothes and accessories using a narrative writi
 {{ instructions }}
 {% endif -%}
 {% endfor %}
+Only generate the specified attribute.
 The context is {{ content_context }}
 <|CLOSE_SECTION|>

--- a/src/talemate/prompts/templates/creator/character-description.jinja2
+++ b/src/talemate/prompts/templates/creator/character-description.jinja2
@@ -2,8 +2,10 @@
 {{ character.sheet }}
 <|CLOSE_SECTION|>
 <|SECTION:TASK|>
-Summarize {{ character.name }} based on the character sheet above. 
+Write an immersive character description for {{ character.name }} based on the character sheet above. 

 Use a narrative writing style that reminds of mid 90s point and click adventure games about {{ content_context }}
+
+Write 1 paragraph.
 <|CLOSE_SECTION|>
 {{ set_prepared_response(character.name+ " is ") }}
--- a/src/talemate/prompts/templates/creator/character-example-dialogue-human.jinja2
+++ b/src/talemate/prompts/templates/creator/character-example-dialogue-human.jinja2
@@ -1,5 +1,6 @@
 <|SECTION:CHARACTER|>
 {{ character.description }}
+{{ character.sheet }}
 <|CLOSE_SECTION|>
 <|SECTION:EXAMPLES|>
 {% for example in examples -%}
--- a/src/talemate/prompts/templates/creator/determine-character-description.jinja2
+++ b/src/talemate/prompts/templates/creator/determine-character-description.jinja2
@@ -0,0 +1,15 @@
+<|SECTION:CONTENT|>
+{% if text -%}
+{{ text }}
+{% else -%}
+{% set scene_context_history = scene.context_history(budget=max_tokens-500, min_dialogue=25, sections=False, keep_director=True) -%}
+{% if scene.num_history_entries < 25 %}{{ scene.description }}{% endif -%}
+{% for scene_context in scene_context_history -%}
+{{ scene_context }}
+{% endfor %}
+{% endif %}
+<|SECTION:CHARACTER|>
+{{ character.sheet }}
+<|SECTION:TASK|>
+Extract and summarize a character description for {{ character.name }} from the content
+{{ set_prepared_response(character.name) }}
--- a/src/talemate/prompts/templates/creator/determine-scenario-description.jinja2
+++ b/src/talemate/prompts/templates/creator/determine-scenario-description.jinja2
@@ -0,0 +1,4 @@
+<|SECTION:CONTENT|>
+{{ text }}
+<|SECTIOn:TASK|>
+Extract and summarize a scenario description from the content
--- a/src/talemate/prompts/templates/creator/scenario-description.jinja2
+++ b/src/talemate/prompts/templates/creator/scenario-description.jinja2
@@ -6,7 +6,7 @@
 {% endfor %}
 <|CLOSE_SECTION|>
 <|SECTION:TASK|>
-Generate a short summary / description for {{ content_context }} involving the characters above.
+Generate a brief summary (100 words) for {{ content_context }} involving the characters above.

 {% if prompt -%}
 Premise: {{ prompt }}
--- a/src/talemate/prompts/templates/creator/scenario-intro.jinja2
+++ b/src/talemate/prompts/templates/creator/scenario-intro.jinja2
@@ -12,7 +12,7 @@
 {{ description }}
 <|CLOSE_SECTION|>
 <|SECTION:TASK|>
-Generate the introductory message for {{ content_context }} based on the world information above.
+Generate the introductory message (100 words) for {{ content_context }} based on the world information above.

 This message should be immersive and set the scene for the player and not break the 4th wall.

--- a/src/talemate/prompts/templates/creator/scenario-name.jinja2
+++ b/src/talemate/prompts/templates/creator/scenario-name.jinja2
@@ -11,6 +11,7 @@
 <|SECTION:TASK|>
 Generate a short name or title for {{ content_context }} based on the description above.

+Only name. No description.
 {% if prompt -%}
 Premise: {{ prompt }}
 {% endif -%}
--- a/src/talemate/prompts/templates/director/decide-action-analyze.jinja2
+++ b/src/talemate/prompts/templates/director/decide-action-analyze.jinja2
@@ -1,28 +0,0 @@
-<|SECTION:CONTEXT|>
-{{ character.description }}
-
-{{ character.base_attributes.get("scenario_context", "") }}
-<|CLOSE_SECTION|>
-
-{% for scene_context in scene.context_history(budget=200, add_archieved_history=False, min_dialogue=10) -%}
-{{ scene_context }}
-{% endfor %}
-
-<|SECTION:TASK|>
-Instruction: Analyze the scene so far and answer the following question(s)
-Expected response: a JSON response containing questions, answers and reasoning
-
-{% if scene.history -%}
-Last line of dialogue: {{ scene.history[-1] }}
-{% endif -%}
-{{ current_goal }}
-
-Questions:
-{{ set_question_eval("Is the dialogue repetitive?", "yes", "direct") }}
-{{ set_question_eval("Is the actor playing "+character.name+" staying true to the character and their development so far?", "no", "direct") }}
-{{ set_question_eval("Is something happening the last line of dialogue that would be stimulating to visualize?", "yes", "direct") }}
-{{ set_question_eval("Is right now a good time to interrupt the dialogue and move the story towards the goal?", "yes", "direct") }}
-<|CLOSE_SECTION|>
-
-Director answers:
-{{ set_eval_response(empty="watch") }}
--- a/src/talemate/prompts/templates/director/decide-action.jinja2
+++ b/src/talemate/prompts/templates/director/decide-action.jinja2
@@ -1,20 +0,0 @@
-{{ character.description }}
-
-{{ character.base_attributes.get("scenario_context", "") }}
-
-{% for scene_context in scene.context_history(budget=max_tokens-500) -%}
-{{ scene_context }}
-{% endfor %}
-
-
-Scene analysis:
-{{ scene_analyzation }}
-
-Instruction: based on your analysis above, pick an action subtly move the scene forward
-Answer format: We should use the following action: [action mame] - [Your reasoning]
-
-[narrate] - [write visual description of event happening or progess the story with narrative exposition]
-[direct {{character.name}}] - [direct the actor playing {{character.name}} to perform an action]
-[watch] - [do nothing, just watch the scene unfold]
-
-Director answers: We should use the following action:{{ bot_token }}[
--- a/src/talemate/prompts/templates/director/direct-character-analyze.jinja2
+++ b/src/talemate/prompts/templates/director/direct-character-analyze.jinja2
@@ -1,16 +0,0 @@
-{{ direction_prompt }}
-
-<|SECTION:DIRECTION|>
-{{ direction }}
-<|CLOSE_SECTION|>
-
-<|SECTION:TASK|>
-Instruction: Analyze the scene so far and answer the following question either with yes or no:
-
-Is this a direct, actionable direction to {{ character.name }} ?
-Is the director's instruction to {{ character.name }} in line with the character's development so far?
-Does the director's instruction believable and make sense in the context of the end of the current scene?
-Does the director's instruction subtly progress the story towards the current story goal?
-<|CLOSE_SECTION|>
-
-Director answers:
--- a/src/talemate/prompts/templates/director/direct-character-self-reflect.jinja2
+++ b/src/talemate/prompts/templates/director/direct-character-self-reflect.jinja2
@@ -1,19 +0,0 @@
-{{ direction_prompt }}
-
-<|SECTION:DIRECTION|>
-{{ direction }}
-<|CLOSE_SECTION|>
-
-<|SECTION:ANALYSIS OF DIRECTION|>
-{{ analysis }}
-<|CLOSE_SECTION|>
-
-<|SECTION:TASK|>
-Instructions: Based on your analysis above, is the director's instruction to {{ character.name }} good, neutral or bad? If its bad, change the direction. Never question the goal itself. Explain your reasoning.
-Expected response: Respond with I want to keep OR change the direction.
-
-Response example: I want to keep the direction, because ..
-Response example: I want to change the direction, because ..
-<|CLOSE_SECTION|>
-
-{{ set_prepared_response("Director reflects on his direction: I want to ") }}
--- a/src/talemate/prompts/templates/director/direct-character.jinja2
+++ b/src/talemate/prompts/templates/director/direct-character.jinja2
@@ -1,32 +0,0 @@
-<|SECTION:CONTEXT|>
-{{ character.description }}
-
-{{ character.base_attributes.get("scenario_context", "") }}
-<|CLOSE_SECTION|>
-
-{% for scene_context in scene.context_history(budget=200, add_archieved_history=False, min_dialogue=10) -%}
-{{ scene_context }}
-{% endfor %}
-
-<|SECTION:DIALOGUE ANALYSIS|>
-{{ analysis }}
-<|CLOSE_SECTION|>
-
-<|SECTION:STORY GOAL|>
-{{ current_goal }}
-<|CLOSE_SECTION|>
-
-{% if not previous_direction -%}
-<|SECTION:TASK|>
-Give actionable directions to the actor playing {{ character.name }} by instructing {{ character.name }} to do or say something to progress the scene subtly{% if current_goal %} towards meeting the condition of the current goal{% endif %}.
-<|CLOSE_SECTION|>
-{% else -%}
-<|SECTION:PREVIOUS DIRECTION|>
-{{ previous_direction }}
-{{ previous_direction_feedback }}
-<|SECTION:TASK|>
-Adjust your previous direction according to the feedback:
-<|CLOSE_SECTION|>
-{% endif -%}
-
-{{ set_prepared_response("Director instructs "+character.name+": \"To progress the scene, i want you to ") }}
--- a/src/talemate/prompts/templates/director/direct-narrative.jinja2
+++ b/src/talemate/prompts/templates/director/direct-narrative.jinja2
@@ -1,22 +0,0 @@
-{% for scene_context in scene.context_history(budget=200, add_archieved_history=False, min_dialogue=10) -%}
-{{ scene_context }}
-{% endfor %}
-
-<|SECTION:DIALOGUE ANALYSIS|>
-{{ analysis }}
-<|CLOSE_SECTION|>
-
-<|SECTION:TASK|>
-{% if narration_type == "progress" -%}
-Instruction: Analyze the dialogue and scene so far and have the director give directions to the story writer to subtly progress the current scene.
-{% elif narration_type == "visual" %}
-Instruction: Analyze the last line of the dialogue and have the director give directions to the story writer to describe the end point of the scene visually.
-{% elif narration_type == "character" %}
-{% endif -%}
-
-{% if scene.history -%}
-Last line of dialogue: {{ scene.history[-1] }}
-{% endif -%}
-{{ current_goal }}
-<|CLOSE_SECTION|>
-{{ bot_token }}Director instructs story writer:
--- a/src/talemate/prompts/templates/director/direct-scene.jinja2
+++ b/src/talemate/prompts/templates/director/direct-scene.jinja2
@@ -0,0 +1,15 @@
+<|SECTION:SCENE|>
+{% block scene_history -%}
+{% for scene_context in scene.context_history(budget=1000, min_dialogue=25, sections=False, keep_director=False) -%}
+{{ scene_context }}
+{% endfor %}
+{% endblock -%}
+<|CLOSE_SECTION|>
+<|SECTION:TASK|>
+Current scene goal: {{ prompt }}
+
+Give actionable directions to the actor playing {{ character.name }} by instructing {{ character.name }} to do or say something to progress the scene subtly towards meeting the condition of the current goal.
+
+Take the most recent update to the scene into consideration: {{ scene.history[-1] }}
+<|CLOSE_SECTION|>
+{{ set_prepared_response("Director instructs "+character.name+": \"To progress the scene, i want you to ") }}
--- a/src/talemate/prompts/templates/director/goal-analyze.jinja2
+++ b/src/talemate/prompts/templates/director/goal-analyze.jinja2
@@ -1,8 +0,0 @@
-
-{% for scene_context in scene.context_history(budget=max_tokens-300) -%}
-{{ scene_context }}
-{% endfor %}
-
-Question: Do any lines or events in the dialogue satisfy the following story condition: "{{ current_goal }}" - Explain your reasoning and then state 'satisfied' or 'NOT been satisfied'.
-
-{{ bot_token }}Director decides: The condition has
--- a/src/talemate/prompts/templates/director/old/decide-action-analyze.jinja2
+++ b/src/talemate/prompts/templates/director/old/decide-action-analyze.jinja2
@@ -1,28 +0,0 @@
-<|SECTION:CONTEXT|>
-{{ character.description }}
-
-{{ character.base_attributes.get("scenario_context", "") }}
-<|CLOSE_SECTION|>
-
-{% for scene_context in scene.context_history(budget=200, add_archieved_history=False, min_dialogue=10) -%}
-{{ scene_context }}
-{% endfor %}
-
-<|SECTION:TASK|>
-Instruction: Analyze the scene so far and answer the following question(s)
-Expected response: a JSON response containing questions, answers and reasoning
-
-{% if scene.history -%}
-Last line of dialogue: {{ scene.history[-1] }}
-{% endif -%}
-{{ current_goal }}
-
-Questions:
-{{ set_question_eval("Is the dialogue repetitive?", "yes", "direct") }}
-{{ set_question_eval("Is the actor playing "+character.name+" staying true to the character and their development so far?", "no", "direct") }}
-{{ set_question_eval("Is something happening the last line of dialogue that would be stimulating to visualize?", "yes", "narrate:visual") }}
-{{ set_question_eval("Is right now a good time to interrupt the dialogue and move the story towards the goal?", "yes", "direct") }}
-<|CLOSE_SECTION|>
-
-Director answers:
-{{ set_eval_response(empty="watch") }}
--- a/src/talemate/prompts/templates/director/old/decide-action.jinja2
+++ b/src/talemate/prompts/templates/director/old/decide-action.jinja2
@@ -1,20 +0,0 @@
-{{ character.description }}
-
-{{ character.base_attributes.get("scenario_context", "") }}
-
-{% for scene_context in scene.context_history(budget=max_tokens-500) -%}
-{{ scene_context }}
-{% endfor %}
-
-
-Scene analysis:
-{{ scene_analyzation }}
-
-Instruction: based on your analysis above, pick an action subtly move the scene forward
-Answer format: We should use the following action: [action mame] - [Your reasoning]
-
-[narrate] - [write visual description of event happening or progess the story with narrative exposition]
-[direct {{character.name}}] - [direct the actor playing {{character.name}} to perform an action]
-[watch] - [do nothing, just watch the scene unfold]
-
-Director answers: We should use the following action:{{ bot_token }}[
--- a/src/talemate/prompts/templates/director/old/direct-character-analyze.jinja2
+++ b/src/talemate/prompts/templates/director/old/direct-character-analyze.jinja2
@@ -1,16 +0,0 @@
-{{ direction_prompt }}
-
-<|SECTION:DIRECTION|>
-{{ direction }}
-<|CLOSE_SECTION|>
-
-<|SECTION:TASK|>
-Instruction: Analyze the scene so far and answer the following question either with yes or no:
-
-Is this a direct, actionable direction to {{ character.name }} ?
-Is the director's instruction to {{ character.name }} in line with the character's development so far?
-Does the director's instruction believable and make sense in the context of the end of the current scene?
-Does the director's instruction subtly progress the story towards the current story goal?
-<|CLOSE_SECTION|>
-
-Director answers:
--- a/src/talemate/prompts/templates/director/old/direct-character-self-reflect.jinja2
+++ b/src/talemate/prompts/templates/director/old/direct-character-self-reflect.jinja2
@@ -1,19 +0,0 @@
-{{ direction_prompt }}
-
-<|SECTION:DIRECTION|>
-{{ direction }}
-<|CLOSE_SECTION|>
-
-<|SECTION:ANALYSIS OF DIRECTION|>
-{{ analysis }}
-<|CLOSE_SECTION|>
-
-<|SECTION:TASK|>
-Instructions: Based on your analysis above, is the director's instruction to {{ character.name }} good, neutral or bad? If its bad, change the direction. Never question the goal itself. Explain your reasoning.
-Expected response: Respond with I want to keep OR change the direction.
-
-Response example: I want to keep the direction, because ..
-Response example: I want to change the direction, because ..
-<|CLOSE_SECTION|>
-
-{{ set_prepared_response("Director reflects on his direction: I want to ") }}
--- a/src/talemate/prompts/templates/director/old/direct-character.jinja2
+++ b/src/talemate/prompts/templates/director/old/direct-character.jinja2
@@ -1,32 +0,0 @@
-<|SECTION:CONTEXT|>
-{{ character.description }}
-
-{{ character.base_attributes.get("scenario_context", "") }}
-<|CLOSE_SECTION|>
-
-{% for scene_context in scene.context_history(budget=200, add_archieved_history=False, min_dialogue=10) -%}
-{{ scene_context }}
-{% endfor %}
-
-<|SECTION:DIALOGUE ANALYSIS|>
-{{ analysis }}
-<|CLOSE_SECTION|>
-
-<|SECTION:STORY GOAL|>
-{{ current_goal }}
-<|CLOSE_SECTION|>
-
-{% if not previous_direction -%}
-<|SECTION:TASK|>
-Give actionable directions to the actor playing {{ character.name }} by instructing {{ character.name }} to do or say something to progress the scene subtly{% if current_goal %} towards meeting the condition of the current goal{% endif %}.
-<|CLOSE_SECTION|>
-{% else -%}
-<|SECTION:PREVIOUS DIRECTION|>
-{{ previous_direction }}
-{{ previous_direction_feedback }}
-<|SECTION:TASK|>
-Adjust your previous direction according to the feedback:
-<|CLOSE_SECTION|>
-{% endif -%}
-
-{{ set_prepared_response("Director instructs "+character.name+": \"To progress the scene, i want you to ") }}
--- a/src/talemate/prompts/templates/director/old/direct-narrative.jinja2
+++ b/src/talemate/prompts/templates/director/old/direct-narrative.jinja2
@@ -1,22 +0,0 @@
-{% for scene_context in scene.context_history(budget=200, add_archieved_history=False, min_dialogue=10) -%}
-{{ scene_context }}
-{% endfor %}
-
-<|SECTION:DIALOGUE ANALYSIS|>
-{{ analysis }}
-<|CLOSE_SECTION|>
-
-<|SECTION:TASK|>
-{% if narration_type == "progress" -%}
-Instruction: Analyze the dialogue and scene so far and have the director give directions to the story writer to subtly progress the current scene.
-{% elif narration_type == "visual" %}
-Instruction: Analyze the last line of the dialogue and have the director give directions to the story writer to describe the end point of the scene visually.
-{% elif narration_type == "character" %}
-{% endif -%}
-
-{% if scene.history -%}
-Last line of dialogue: {{ scene.history[-1] }}
-{% endif -%}
-{{ current_goal }}
-<|CLOSE_SECTION|>
-{{ bot_token }}Director instructs story writer:
--- a/src/talemate/prompts/templates/director/old/goal-analyze.jinja2
+++ b/src/talemate/prompts/templates/director/old/goal-analyze.jinja2
@@ -1,8 +0,0 @@
-
-{% for scene_context in scene.context_history(budget=max_tokens-300) -%}
-{{ scene_context }}
-{% endfor %}
-
-Question: Do any lines or events in the dialogue satisfy the following story condition: "{{ current_goal }}" - Explain your reasoning and then state 'satisfied' or 'NOT been satisfied'.
-
-{{ bot_token }}Director decides: The condition has
--- a/src/talemate/prompts/templates/editor/add-detail.jinja2
+++ b/src/talemate/prompts/templates/editor/add-detail.jinja2
@@ -0,0 +1,28 @@
+<|SECTION:CHARACTERS|>
+{% for character in characters -%}
+{{ character.name }}: 
+{{ character.filtered_sheet(['name', 'age', 'gender']) }}
+{{ query_memory("what is "+character.name+"'s personality?", as_question_answer=False) }}
+
+{{ character.description }}
+
+
+{% endfor %}
+<|CLOSE_SECTION|>
+<|SECTION:SCENE|>
+Content Context: {{ scene.context }}
+
+{% for scene_context in scene.context_history(budget=1000, min_dialogue=25, sections=False, keep_director=True) -%}
+{{ scene_context }}
+{% endfor %}
+<|CLOSE_SECTION|>
+<|SECTION:TASK|>
+Take the following line of dialog spoken by {{ character.name }} and flesh it out by adding minor details and flourish to it.
+
+Spoken words should be in quotes.
+
+Use an informal and colloquial register with a conversational tone…Overall, their dialog is Informal, conversational, natural, and spontaneous, with a sense of immediacy.
+<|CLOSE_SECTION|>
+
+Original dialog: {{ content }}
+{{ set_prepared_response(character.name+":", prepend="Fleshed out dialog: ") }}
--- a/src/talemate/prompts/templates/editor/edit-dialogue.jinja2
+++ b/src/talemate/prompts/templates/editor/edit-dialogue.jinja2
@@ -0,0 +1,11 @@
+<|SECTION:{{ character.name }}'S WRITING STYLE|>
+{% for example in character.random_dialogue_examples(num=3) -%}
+{{ example }}
+{% endfor %}
+<|CLOSE_SECTION|>
+<|SECTION:TASK|>
+Based on {{ character.name }}'s typical writing style, please adjust the following line to their mannerisms and style of speaking:
+
+{{ content }}
+<|CLOSE_SECTION|>
+I have adjusted the line: {{ set_prepared_response(character.name+":") }}
--- a/src/talemate/prompts/templates/editor/fix-exposition.jinja2
+++ b/src/talemate/prompts/templates/editor/fix-exposition.jinja2
@@ -0,0 +1,29 @@
+<|SECTION:EXAMPLES|>{{ disable_dedupe() }}
+Input: {{ character.name }}: She whispered, Don't tell anyone. with a stern look.
+Output: {{ character.name }}: *She whispered,* "Don't tell anyone." *with a stern look.*
+
+Input: {{ character.name }}: Where are you going? he asked, looking puzzled. I thought we were staying in.
+Output: {{ character.name }}: "Where are you going?" *he asked, looking puzzled.* "I thought we were staying in."
+
+Input: {{ character.name }}: With a heavy sigh, she said, I just can't believe it. and walked away.
+Output: {{ character.name }}: *With a heavy sigh, she said,* "I just can't believe it." *and walked away.*
+
+Input: {{ character.name }}: It's quite simple, he explained. You just have to believe.
+Output: {{ character.name }}: "It's quite simple," *he explained.* "You just have to believe."
+
+Input: {{ character.name }}: She giggled, finding his antics amusing. You're such a clown!
+Output: {{ character.name }}: *She giggled, finding his antics amusing.* "You're such a clown!"
+
+Input: {{ character.name }}: He frowned, noticing the dark clouds gathering overhead. Looks like a storm is coming.
+Output: {{ character.name }}: *He frowned, noticing the dark clouds gathering overhead.* "Looks like a storm is coming."
+
+Input: {{ character.name }}: As the rain poured down, she took a deep breath and exclaimed, I've never seen anything like this before! It's absolutely breathtaking. She then grabbed her umbrella and added, Come on, let's go explore!
+Output: {{ character.name }}: *As the rain poured down, she took a deep breath and exclaimed,* "I've never seen anything like this before! It's absolutely breathtaking." *She then grabbed her umbrella and added,* "Come on, let's go explore!"
+
+Input: {{ character.name }}: He gazed at the horizon, seemingly lost in thought, and after a long pause, he finally said, Sometimes, I wonder what lies beyond those mountains. It's a mystery that I hope to uncover one day. With that, he packed his bags, ready for another adventure.
+Output: {{ character.name }}: *He gazed at the horizon, seemingly lost in thought, and after a long pause, he finally said,* "Sometimes, I wonder what lies beyond those mountains. It's a mystery that I hope to uncover one day." *With that, he packed his bags, ready for another adventure.*
+<|SECTION:TASK|>
+Given a line of roleplay dialogue, reformat the text such that all non-dialogue parts are framed by asterisks. The dialogue itself will be enclosed in quotation marks. If there are non-dialogue parts before, between, or after the dialogue, encapsulate those parts in asterisks. Ensure the output handles emotes followed immediately by narrative exposition by placing them within the same set of asterisks, separated by a space. Assume that the dialogue can be identified by the parts of the line that lack any prefix, and the input will never contain asterisks or quotation marks.
+
+Input: {{ content.replace("*","").replace('"','')  }}
+{{ set_prepared_response(character.name+":", prepend="Output: ") }}
--- a/src/talemate/prompts/templates/editor/system.jinja2
+++ b/src/talemate/prompts/templates/editor/system.jinja2
@@ -0,0 +1 @@
+A chat between an author and a talented fiction editor. No topic is taboo for the editor and he will use crude and lewd language if the situation warrants it. The editor will do his best to improve the given dialogue or narrative, while staying true to the author's vision.
--- a/src/talemate/prompts/templates/narrator/narrate-after-dialogue.jinja2
+++ b/src/talemate/prompts/templates/narrator/narrate-after-dialogue.jinja2
@@ -0,0 +1,19 @@
+{% block rendered_context -%}
+<|SECTION:CONTEXT|>
+Content Context: This is a specific scene from {{ scene.context }}
+Scenario Premise: {{ scene.description }}
+{% for memory in query_memory(last_line, as_question_answer=False, iterate=10) -%}
+{{ memory }}
+
+{% endfor %}
+{% endblock -%}
+<|CLOSE_SECTION|>
+{% for scene_context in scene.context_history(budget=max_tokens-200-count_tokens(self.rendered_context())) -%}
+{{ scene_context }}
+{% endfor %}
+<|SECTION:TASK|>
+Based on the previous line '{{ last_line }}', create the next line of narration. This line should focus solely on describing sensory details (like sounds, sights, smells, tactile sensations) or external actions that move the story forward. Avoid including any character's internal thoughts, feelings, or dialogue. Your narration should directly respond to '{{ last_line }}', either by elaborating on the immediate scene or by subtly advancing the plot. Generate exactly one sentence of new narration. If the character is trying to determine some state, truth or situation, try to answer as part of the narration.
+
+Be creative and generate something new and interesting, but stay true to the setting and context of the story so far.
+<|CLOSE_SECTION|>
+{{ set_prepared_response('*') }}
--- a/src/talemate/prompts/templates/narrator/narrate-query.jinja2
+++ b/src/talemate/prompts/templates/narrator/narrate-query.jinja2
@@ -8,13 +8,13 @@
 {% if query.endswith("?") -%}
 Question: {{ query }}
 Extra context: {{ query_memory(query, as_question_answer=False) }}
-Instruction: Analyze Context, History and Dialogue. Be factual and truthful. When evaluating both story and memory, story is more important. You can fill in gaps using imagination as long as it is based on the existing context. Respect the scene progression and answer in the context of the end of the dialogue.
+Instruction: Analyze Context, History and Dialogue. When evaluating both story and memory, story is more important. You can fill in gaps using imagination as long as it is based on the existing context. Respect the scene progression and answer in the context of the end of the dialogue.
 {% else -%}
 Instruction: {{ query }}
 Extra context: {{ query_memory(query, as_question_answer=False) }}
-Answer based on Context, History and Dialogue. Be factual and truthful. When evaluating both story and memory, story is more important. You can fill in gaps using imagination as long as it is based on the existing context. 
+Answer based on Context, History and Dialogue. When evaluating both story and memory, story is more important. You can fill in gaps using imagination as long as it is based on the existing context. 
 {% endif -%}
 Content Context: This is a specific scene from {{ scene.context }}
-Narration style: point and click adventure game from the 90s
+Your answer should be in the style of short narration that fits the context of the scene.
 <|CLOSE_SECTION|>
 Narrator answers: {% if at_the_end %}{{ bot_token }}At the end of the dialogue, {% endif %}
--- a/src/talemate/prompts/templates/narrator/narrate-scene.jinja2
+++ b/src/talemate/prompts/templates/narrator/narrate-scene.jinja2
@@ -1,15 +1,12 @@
-<|SECTION:CONTEXT|>
-Scenario Premise: {{ scene.description }}
-<|CLOSE_SECTION|>

 {% for scene_context in scene.context_history(budget=max_tokens-300) -%}
 {{ scene_context }}
 {% endfor %}
-
-<|SECTION:TASK|>
-Question: What happens at the end of the dialogue progression? Summarize into narrative description.
+<|SECTION:CONTEXT|>
 Content Context: This is a specific scene from {{ scene.context }}
-Narration style: point and click adventure game from the 90s
-Expected Answer: A summarized narrative description of the scene unfolding at the dialogue that can be inserted into the ongoing story in place of the dialogue.
+Scenario Premise: {{ scene.description }}
 <|CLOSE_SECTION|>
-Narrator answers: {{ set_prepared_response("You see ") }}
+<|SECTION:TASK|>
+Provide a visual description of what is currently happening in the scene. Don't progress the scene.
+<|CLOSE_SECTION|>
+{{ bot_token }}At the end of the scene we currently see:
--- a/src/talemate/prompts/templates/narrator/narrate-time-passage.jinja2
+++ b/src/talemate/prompts/templates/narrator/narrate-time-passage.jinja2
@@ -0,0 +1,16 @@
+<|SECTION:CONTEXT|>
+Scenario Premise: {{ scene.description }}
+NPCs: {{ scene.npc_character_names }}
+Player Character: {{ scene.get_player_character().name }}
+Content Context: {{ scene.context }}
+<|CLOSE_SECTION|>
+
+{% for scene_context in scene.context_history(budget=max_tokens-300) -%}
+{{ scene_context }}
+{% endfor %}
+
+<|SECTION:TASK|>
+Narrate the passage of time that just occured, subtly move the story forward, and set up the next scene.
+Write 1 to 3 sentences.
+<|CLOSE_SECTION|>
+{{ bot_token }}{{ narrative }}:
--- a/src/talemate/prompts/templates/summarizer/analyze-dialogue.jinja2
+++ b/src/talemate/prompts/templates/summarizer/analyze-dialogue.jinja2
@@ -0,0 +1,6 @@
+
+{{ dialogue }}
+<|SECTION:TASK|>
+Examine the dialogue from the beginning and find the last line that marks a scene change. Repeat the line back to me exactly as it is written.
+<|CLOSE_SECTION|>
+{{ bot_token }}The first line that marks a scene change is:
--- a/src/talemate/prompts/templates/summarizer/request-world-state-inline-items.jinja2
+++ b/src/talemate/prompts/templates/summarizer/request-world-state-inline-items.jinja2
@@ -1,11 +0,0 @@
-Instructions: Mark all tangible physical subjects in the sentence with brackets. For example, if the line of dialogue is "John: I am going to the store." and you want to mark "store" as a subject, you would write "John: I am going to [the  store]."
-
-Sentence:
-Barbara: *Barabara sits down on the couch while John is watching TV* Lets see whats on *She takes the remote and starts flipping through channels. She occasionally snaps her wristband while she does it* 
-
-Sentence with tangible physical objects marked:
-Barbara: *Barabara sits down on [the couch] while John is watching [TV]* Lets see whats on *She takes [the remote] and starts flipping through [channels]. She occasionally snaps [her wristband] while she does it*
-
-Sentence:
-{{ scene.history[-1] }}
-Sentence with tangible physical objects marked::{{ bot_token }}
--- a/src/talemate/prompts/templates/summarizer/summarize-dialogue.jinja2
+++ b/src/talemate/prompts/templates/summarizer/summarize-dialogue.jinja2
@@ -5,6 +5,5 @@
 <|SECTION:TASK|>
 Question: What happens within the dialogue? Summarize into narrative description.
 Content Context: This is a specific scene from {{ scene.context }}
-Expected Answer: A summarized narrative description of the dialogue that can be inserted into the ongoing story in place of the dialogue.
-<|CLOSE_SECTION|>
-Narrator answers: 
+Expected Answer: A summarized narrative description of the dialogue that can be inserted into the ongoing story in place of the dialogue. 
+<|CLOSE_SECTION|>
--- a/src/talemate/prompts/templates/world_state/analyze-text-and-answer-question.jinja2
+++ b/src/talemate/prompts/templates/world_state/analyze-text-and-answer-question.jinja2
@@ -0,0 +1,8 @@
+
+{{ text }}
+
+<|SECTION:TASK|>
+Analyze the text above and answer the question.
+
+Question: {{ query }}
+{{ bot_token }}Answer:
--- a/src/talemate/prompts/templates/world_state/analyze-text-and-extract-context.jinja2
+++ b/src/talemate/prompts/templates/world_state/analyze-text-and-extract-context.jinja2
@@ -0,0 +1,17 @@
+
+<|SECTION:CONTEXT|>
+{% for memory in query_memory(text, as_question_answer=False, max_tokens=max_tokens-500, iterate=20) -%}
+{{ memory }}
+
+{% endfor -%}
+<|CLOSE_SECTION|>
+<|SECTION:TASK|>
+Answer the following questions:
+
+{{ instruct_text("Ask the narrator three (3) questions to gather more context from the past for the continuation of this conversation. If a character is asking about a state, location or information about an item or another character, make sure to include question(s) that help gather context for this.", text) }}
+
+You answers should be precise, truthful and short. Pay close attention to timestamps when retrieving information from the context.
+
+<|CLOSE_SECTION|>
+<|SECTION:RELEVANT CONTEXT|>
+{{ bot_token }}Answers: 
--- a/src/talemate/prompts/templates/world_state/analyze-text-and-follow-instruction.jinja2
+++ b/src/talemate/prompts/templates/world_state/analyze-text-and-follow-instruction.jinja2
@@ -0,0 +1,5 @@
+
+{{ text }}
+
+<|SECTION:TASK|>
+{{ instruction }}
--- a/src/talemate/prompts/templates/world_state/analyze-time-passage.jinja2
+++ b/src/talemate/prompts/templates/world_state/analyze-time-passage.jinja2
@@ -0,0 +1,5 @@
+<|SECTION:SCENE|>
+{{ text }}
+<|SECTION:TASK|>
+Question: How much time has passed in the scene above?
+{{ bot_token }}Answer (ISO8601 duration): P
--- a/Show More
+++ b/Show More
Author	SHA1	Message	Date
FInalWombat	d250df8950	Prep 0.13.2 (#33 ) * fix issue with client removal * client type not editable after creation (keeps things simple) * fixes issue with openai client bugging out (api_url not set) * fix issues with edit client not reflecting changes to UX * 0.13.2	2023-11-19 20:43:15 +02:00
FInalWombat	816f950afe	Prep 0.13.1 (#29 ) * narrate after dialog constrained a bit more so it doesnt create something unrelated * fix issue where textgenwebui client would come back as status ok even though no model was loaded * 0.13.1	2023-11-19 18:58:40 +02:00
FInalWombat	8fb72fdbe9	Update README.md	2023-11-19 14:05:09 +02:00
FInalWombat	54297a4768	Update README.md	2023-11-18 12:20:02 +02:00
FInalWombat	d7e72d27c5	Prep 0.13.0 (#28 ) * requirements.txt file * windows installs from requirements.txt because of silly permission issues * relock * narrator - narrate on dialogue agent actions * add support for new textgenwebui api * world state auto regen trigger off of gameloop * funciton !rename command * ensure_dialog_format error handling * Cat, Nous-Capybara, dolphin-2.2.1 * narrate after dialog rerun fixes, template fixes * LMStudio client (experimental) * dolhpin yi * refactor client base * cruft * openai client to new base * more client refactor fixes * tweak context retrieval prompts * adjust nous capybara template * add Tess-Medium * 0.13.0 * switch back to poetry for windows as well * error on legacy textgenwebui api * runpod text gen api url fixed * fix windows install script * add fllow instruction template * Psyfighter2	2023-11-18 12:16:29 +02:00
FInalWombat	f9b23f8705	Update README.md	2023-11-14 11:06:52 +02:00
FInalWombat	37a5873330	Update README.md	2023-11-12 15:43:02 +02:00
FInalWombat	bc3f5d63c8	Add files via upload	2023-11-12 15:42:07 +02:00
FInalWombat	72202dee02	Prep 0.12.0 (#26 ) * no " or * just treat as spoken words * chromadb perist to db * collect name should contain embedding so switching between chromadb configurations doesn't brick your scenes * fix save-as long term memory transfer * add chroma * director agent refactor * tweak director command, prompt reset, ux display * tweak director message ux * allow clearing of prompt log * remove auto adding of quotes if neither quote or * are present * command to reset long term memory for the scene * improve summarization template as it would cause some llms to add extra details * rebuilding history will now also rebuild long term memory * direct scene template * fix scene time reset * dialogue template tweaks * better dialog format fixing * some dialogue template adjustments * adjust default values of director agent * keep track of scene saved/unsaved status and confirm loading a different scene if current scene is unsaved * prompt fixes * remove the collection on recommitting the seen to memory, as the embeddings may have changed * change to the official python api for the openai client and make it async * prompt tweaks * world state prompt parsing fixes * improve handling of json responses * 0 seconds ago changed to moments ago * move memory context closer to scene * token counts for openai client * narrator agent option: narrate passage of time * gitignore * remove memory id * refactor world state with persistence to chromadb (wip) * remove world state update instructions * dont display blank emotion in world state * openai gpt-4 turbo support * conversation agent extra instructions * track prompt response times * Yi and UtopiaXL * long term memory retrieval improvements during conversations * narrate scene tweaks * conversation ltm augment tweaks * hide subconfig if parent config isnt enabled * ai assisted memory recall during conversation default to off * openai json_object coersion only on model that supports it openai client emit prompt processing time * 0.12.0 * remove prompt number from prompt debug list * add prompt number back in but shift it to the upper row * narrate time passage hard content limit restriction for now as gpt-4 would just write a whole chapter. * relock	2023-11-10 22:45:50 +02:00
FInalWombat	91f228aa68	Update __init__.py	2023-10-29 15:45:56 +02:00
FInalWombat	27d6c5e7c2	Update pyproject.toml	2023-10-29 15:45:37 +02:00
FInalWombat	1f5cff4c6d	remove debug output	2023-10-29 15:44:40 +02:00
FinalWombat	77425935be	update version	2023-10-28 12:45:07 +03:00
FInalWombat	e6b21789d1	Prep 0.11.0 (#19 ) * dolphin mistral template * removate trailing \n before attaching the model response * improve prompt and validator for generated human age * fix issue where errors during character creation process would not be communicated to the ux and the character creator would appear stuck * add dolphin mistral to list * add talemate_env * poetry relock * add json schema for talemate scene files * fix issues with pydantic after version upgrade * add json extrac util functions * fix pydantic model * use extract json function * scene generator, better scene name prompt * OpenHermes-2-Mistral * alpaca base template Amethyst 20B template * character description is no longer part of the sheet and needs to be added separately * fix pydantic validation * fix issue where sometimes partial emote strings were kept at the end of dialogue * no need to commit character name to memory * dedupe prompts * clean up extra linebreaks in prompts * experimental editor agent agent signals first progress * take out hardcoded example * amethyst llm prompt template * editor agent disableable agent edit modal tweaks * world state agent agent action config schema * director agent disableable remove automatic actions config from ux (deprecated) * fix responsive update when toggling enable on or off in agent dialog * prompt adjustments fix divine intellect preset (mirostat values were way off) fix world state regenerating every turn regardless of setting * move templates for world state from summarizer to worldstate agent * conversation agent generation lenght setting * conversation agent jiggle attribute (randomize offset to certain inference parameters) * relabel * scene cover image set to cover as much space as it can * add character sheet to dialogue example generate prompt * character creator agent mixin use set_processing * add <\|im_end\|> to stopping strings * add random number gen to template functions * SynthIA and Tiefighter * create new persisted characters ouf of world state natural flow option for conversation agent to help guide multi character conversations * conversation agent natural flow improvements * fix bug with 1h time passage option * some templates * poetry relock * fix config validation * fix issues when detemrining scene history context length to stay within budget * fixes to world state json parsing fixes to conversation context length * remove unused import * update windows install scripts * zephyr * </s> stopping string * dialog cleanup utils improved * add agents and clients key to the config example	2023-10-28 11:33:51 +03:00
FInalWombat	89d7b9d6e3	Update README.md	2023-10-25 09:41:42 +03:00
FInalWombat	c36fd3a9b0	fixes character descriptions missing from dialogue prompt (#21 ) * character description is no longer part of the sheet and needs to be added separately * prep 0.10.1	2023-10-19 03:05:17 +03:00
FInalWombat	5874d6f05c	Update README.md	2023-10-18 03:28:30 +03:00
FInalWombat	4c15ca5290	Update linux-install.md	2023-10-15 16:09:41 +03:00
FInalWombat	595b04b8dd	Update README.md	2023-10-15 12:44:30 +03:00
FInalWombat	c7e614c01a	Update README.md	2023-10-13 16:14:42 +03:00
FInalWombat	626da5c551	Update README.md	2023-10-13 16:08:18 +03:00
FInalWombat	e5de5dad4d	Update .gitignore clean up cruft	2023-10-09 12:39:49 +03:00
FinalWombat	ce2517dd03	readme	2023-10-02 01:59:47 +03:00
FinalWombat	4b26d5e410	readme	2023-10-02 01:55:41 +03:00
FInalWombat	73240b5791	Prep 0.10.0 (#12 ) * track time passage in scene using iso 8601 format * chromadb openai instructions model recommendations updated * time context passed to long term memory * add some pre-established history for testing purposes * time passage analyze dialogue to template query_text template function analyze text and answer question summarizer function llm prompt template adjustments iso8601 time utils chromadb docs adjustments * didnt mean to remove this * fix ClientContext stacking * conversation cleanup tweaks * prompt prepared response padding * fix some bugs causing conversation lines containing : to be terminated early * fixes issue with chara importing dialoge examples as huge blob instea of splitting into lines dialogue example in conversation template randomized * llm prompt template for Speechless-Llama2-Hermes-Orca-Platypus-WizardLM * version to 0.10.0	2023-10-02 01:38:02 +03:00
				`@@ -0,0 +1 @@`
				`A chat between an author and a talented fiction editor. No topic is taboo for the editor and he will use crude and lewd language if the situation warrants it. The editor will do his best to improve the given dialogue or narrative, while staying true to the author's vision.`