Members-Only
Recent Talks & Demos are for members only
You must be an AI Tinkerers active member to view these talks and demos.
Agent-Human Interactive Video Studio
See how an agentic video studio creates narrated technical explainer videos from source material, with human review at each scene for inspectable, editable project state.
I built an agentic video studio that turns source material into reviewed, narrated technical video through structured scene files. Think of it as custom tool harness for end to end technical explainer video creation for an an ai to orchestrate with the human author in the loop with optimized UI.
The demo will show the workflow end to end: source material becomes proposed scenes, each scene carries script, visual intent, assets, narration settings, approval state, and renderer-ready output, and the human reviews the work before it moves downstream. The point is not a one-shot “AI made a video” demo. It is a durable workflow where agent output becomes inspectable / editble project state, narration can be regenerated per scene, and the renderer only consumes approved data.
Prototype human-agent workflow orchestrating structured technical video authoring and rendering.
Incomplete documentation for a local agent stack installation project.
- TypeScriptTypeScript is an open-source superset of JavaScript: it adds static typing and compiles to clean, standards-based JavaScript.TypeScript is a high-level, open-source language developed by Microsoft: it acts as a superset of JavaScript, adding a powerful static type system. This system enables compile-time type checking, catching errors before runtime (a critical benefit for large-scale applications). The TypeScript Compiler (TSC) reliably transpiles all code into clean, standards-based JavaScript (ES3 or newer), ensuring compatibility across any browser or host environment (Node.js, React.js, etc.).
- ReactReact is an open-source JavaScript library for building dynamic user interfaces (UIs).React is a component-based JavaScript library, developed by Meta (Facebook), engineered for building fast, declarative UIs. It mandates a one-way data flow and utilizes a Virtual DOM mechanism to ensure efficient, predictable updates to the user interface. Developers construct complex UIs by composing small, encapsulated components; this architecture promotes code reusability and simplifies state management across large applications. The library employs JSX (a syntax extension) to integrate HTML-like markup directly within JavaScript logic, supporting development for both web (React DOM) and native mobile platforms (React Native).
- FastAPIFastAPI is a modern, high-performance Python web framework for building APIs with automatic OpenAPI documentation.FastAPI is a robust, high-speed Python web framework: it is built on Starlette (for async capabilities) and Pydantic (for data validation and serialization). Leveraging standard Python 3.8+ type hints, the framework automatically generates interactive API documentation (Swagger UI/ReDoc) and enforces data validation, effectively reducing developer-induced errors by an estimated 40%. This architecture delivers performance on par with Node.js and Go, significantly increasing feature development speed (up to 300% faster). It is production-ready, fully supporting OpenAPI and JSON Schema standards for all API specifications.
- RemotionCreate dynamic, real MP4 videos programmatically using the React framework.Remotion is the open-source framework that lets developers build videos using their existing React, CSS, and WebGL skills: think of it as React for video. You define each frame as a React component, enabling powerful composition and reusable assets. The technology supports dynamic editing, allowing you to parameterize video content by passing data, which is crucial for building data-driven applications like 'Year in Review' videos or music visualizations. Crucially, Remotion handles the scalable rendering, outputting the final .mp4 file either locally, on a dedicated server, or serverless via Remotion Lambda.
- OpenAI APIOpenAI API: Your direct gateway to cutting-edge AI models (GPT-4o, DALL-E 3, Whisper), enabling scalable, multimodal intelligence integration into any application.The OpenAI API provides authenticated, programmatic access to a powerful suite of generative AI models. Developers leverage REST endpoints and official libraries (Python, Node.js) to integrate capabilities like advanced text generation (GPT-4o), image creation (DALL-E 3), and speech-to-text transcription (Whisper). This platform is engineered for scale, supporting millions of daily requests for tasks from complex reasoning to real-time customer support agents, ensuring your application gets reliable, state-of-the-art intelligence.