Python tool for capturing and logging human-computer interactions. Generate rich datasets for training multi-modal LLMs in autonomous computer control. Features screenshot, mouse, keyboard, and audio recording.
nlp machine-learning automation computer-vision screen-capture audio-recording dataset-generation human-computer-interaction computer-interaction ai-training ai-dataset autonomous-control multi-modal-llm input-logging
-
Updated
Sep 16, 2024 - Python