🖱️ AI Cursor Helper

See in spanish/Ver en español

🖱️ AI Cursor Helper

AI Assistant that helps you localize and open programs using text input or voice.

📝 Technology Explanation

The program makes a screenshot of your current screen and sends it to Gemini with a prompt for it to identify the location of your request and (if checked) try to open it.

It is recommended to use the program with a plain background, where the program icons can be seen easily.

Currently, it uses Gemini Flash 2.5, the most recent and advanced version of the model, even so, it will often make mistakes, hopefully it gets sharper in newer versions.

Important

Be aware that Gemini AI doesn't actually know the programs installed in your device and can make mistakes, so enable the checkbox to let Gemini try to open only if you make sure nothing unexpected can happen.

📋 Prerequisites

You'll need two things to use this program: Gemini API KEY and the required Python libraries installed.

Obtain your Gemini API Key by visiting Google AI Studio. Ensure you are logged into your Google account, then press the blue button that says 'Create API key' and follow the steps to set up your Google Cloud Project and retrieve your API key. Make sure to save it in a safe place.
Google allows free use of this API without adding billing information, but there are some limitations.

In Google AI Studio, you can monitor the AI's usage by clicking 'View usage data' in the 'Plan' column where your projects are displayed. I recommend monitoring the 'Quota and system limits' tab and sorting by 'actual usage percentage,' as it provides further more detailed information.

Then, download or clone the Python script and run this command on the same folder:

python pip install pystray pillow speechrecognition google-generativeai pyaudio pyautogui

Or if it fails or you have a different Python version:

py -m pip install pystray pillow speechrecognition google-generativeai pyaudio pyautogui

💻 Technologies Used

Programming Language: Python
Libraries:
- tkinter
- pystray
- PIL
- io
- threading
- speech_recognition
- google.generativeai
- pyautogui
- re
Other:
- Gemini API Key
Recommended IDE: VS Code

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
AiCursorHelper.pyw		AiCursorHelper.pyw
LICENSE		LICENSE
README.es.md		README.es.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🖱️ AI Cursor Helper

📝 Technology Explanation

📋 Prerequisites

💻 Technologies Used

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🖱️ AI Cursor Helper

📝 Technology Explanation

📋 Prerequisites

💻 Technologies Used

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages