Open-Interface is an open-source project that enables users to control their computers using Large Language Models (LLMs). By leveraging LLMs like OpenAI's
GPT-4, Open-Interface interprets user requests, determines the necessary steps, and autonomously executes them through simulated keyboard and mouse inputs. This process includes real-time course correction by providing the LLM with updated screenshots to ensure accurate task completion.
The platform supports multiple operating systems, including Windows, macOS, and Linux. For Windows users, a pre-built executable is available for download from the latest release; simply unzip the folder and run the executable to get started. Alternatively, users can clone the repository, set up a Python virtual environment, install the required dependencies, and run the application script to utilize Open-Interface's capabilities.
To assist users in understanding its functionalities, Open-Interface provides demo videos showcasing tasks like writing a web application and navigating web pages. These resources are accessible in the MEDIA.md file within the repository. For detailed setup instructions and more information, users can refer to the README.md file in the repository.