Use Other LLMs with "Transform and Replace Selection with ChatGPT"

bccrea · June 14, 2024, 3:22pm

Hi,

I had a question and a suggestion for the "Transform & Replace Selection With ChatGPT" function. Would it be possible and feasible to customize the type of LLMs in addition to the default ChatGPT from OpenAI?

The idea would be to connect to other LLMs providers like Groq or OpenRouter through their compatible OpenAI.

Customize the API URL with the key and be able to specify the desired language model.

This would allow for a broader range of language models to choose from.

Thank you in advance for your response.

Best regards,

iandol · June 15, 2024, 11:37pm

You can achieve this using Transform & Replace Selection With Java Script:

I already run local models and also openrouter.ai (it wraps many models with a single API address). I use a custom endpoint and can tweak other things like the system prompt. Allowing for other OpenAI compatible APIs in the ChatGPT action would be straightforward and great.

In the next step, I think expanding the support from OpenAI to other providers in "Transform and Replace Selection with ChatGPT" would be another feature request. Of course openrouter is neat, but it is still one provider. Unfortunately for native support each API is different (Anthropic vs. Google vs. OpenAI), and the APIs do change relatively quickly. There are other differences like model names and parameters each API supports, so it will need a bit of thought about how configurable this will be. It is an additional burden and so the development cost of maintaining this feature will be higher than other actions...

bccrea · June 19, 2024, 9:11am

@iandol Thank you for your feedback, this alternative solution is interesting. Would it be possible to share your preset with me?

iandol · June 19, 2024, 10:12am

So I use hyperkey+1 to trigger an AI editor and hyperkey+2 to trigger a general question answerer. These run named triggers that run the javascript.

AI.bttpreset (29.9 KB)

Here you can see the settings, and then an example where I select some text and use hyperkey+1 to improve the text clarity. I am using LM Studio (API address is http://localhost:4891/v1/chat/completions) as a local server with the Phi3 model loaded.

I also tried to use Clipboard Manager actions as a way to more easily store the JS, but don't use this actively (its included just for show).

For the editor and assistant the system prompts are:

const BEHAVIOR_DESCRIPTION = "You are an AI assistant that helps improve the clarity and grammar of written English texts. Please improve the following text: ";

const BEHAVIOR_DESCRIPTION = "You are a helpful AI assistant who gives precise and detailed step-by-step answers.";

You will need to tweak the address and probably model names for openrouter etc. Local LLMs are awesome enough I rarely use online models anymore...

Andreas_Hegenberg · June 19, 2024, 11:34am

I'll soon add some configuration options to set custom URLs for the chatgpt actions in BTT - then at least it should work with GPT4All and the like. For now @iandol's solution is great!

bccrea · June 20, 2024, 10:42am

Thank you anyway for sharing, I managed to use the Groq model with the ChatGPT script provided on the blog https://folivora.ai/blog. However, when I trigger the action, it doesn't replace the selection but keeps the original text and adds the output after the original text. How can I replace the selected text?

When I call for the action, it does not display the HUD to indicate that the action is in progress or completed.

async (clipboardContentString) => {
    // ------------------------------------------------
    // CONFIGURATION (API Key):
    // ------------------------------------------------
    const OPENAI_API_KEY = "API_KEY";

	// here you can pre-configure the AI with some configuration prompt:
    const BEHAVIOR_DESCRIPTION = "You are a helpful AI assistant.";


    // ------------------------------------------------
    // CODE  starts here (no change required)
    // ------------------------------------------------

    let previousChat = [];
    showHUD("Transforming...", "brain", 100);
    try {

		if(BEHAVIOR_DESCRIPTION) {
    			previousChat.push({"role": "system", "content": BEHAVIOR_DESCRIPTION});	
		}
	
    		previousChat.push({"role": "user", "content": clipboardContentString ? clipboardContentString : ''});
        
		const response = await fetch("https://api.groq.com/openai/v1/chat/completions", {
        	method: "POST",
        	headers: {
        	    "Content-Type": "application/json",
        	    Authorization: `Bearer ${OPENAI_API_KEY}`,
        	},
        	body: JSON.stringify({
        	    model: "llama3-8b-8192",
        	    messages: previousChat
        	}),
        });

        showHUD("Success", "checkmark.circle", 1);

        const data = await response.json();
        const text = data.choices[0].message.content;
        return `${clipboardContentString} ${text}`;
    } catch (error) {
        showHUD("Error", "x.circle", 3);
        return "Error";
    }

    function showHUD(text, icon, duration) {
        const hudConfig = {
        	BTTActionHUDHideWhenOtherHUDAppears: true,
        	BTTActionHUDDetail: text,
        	BTTIconConfigSFSymbolName: icon,
        	BTTActionHUDDuration: duration,
        	BTTActionHUDSlideDirection: 0,
        	BTTIconConfigImageHeight: 50,
        	BTTActionHUDTitle: "",
        	BTTIconConfigIconType: 2,
        };

        const actionDefinition = {
        	BTTWaitForReply: false,
        	BTTPredefinedActionType: 254,
        	BTTHUDActionConfiguration: JSON.stringify(hudConfig),
        };

        callBTT("trigger_action", { json: JSON.stringify(actionDefinition) });
    }
};

Andreas_Hegenberg · June 20, 2024, 10:47am

Change this

        return `${clipboardContentString} ${text}`;

To this

        return `${text}`;

bccrea · June 20, 2024, 10:49am

Thank you for the tip. Also, when I request the action, it does not show the HUD to indicate that the action is in progress or completed as seen on the demonstration video on the blog.

Andreas_Hegenberg · June 20, 2024, 10:53am

that is weird, I just tried again but the hud is showing up fine here.
Which version of macOS (and BTT) are you on?

bccrea · June 20, 2024, 10:56am

I use macOS 14.5 (23F79) and the following version of BetterTouch: 4.587 (45870)

Andreas_Hegenberg · June 20, 2024, 11:13am

Strange, I have no idea why that wouldn't work. I just tried on another system and it's also showing up fine there.

Is the action in general running correctly?

Anyhow, with 4.591 alpha you can now set a custom URL to the default actions:

(the custom URL will only be used if you also enter an API Key)

bccrea · June 20, 2024, 11:24am

I tested the new feature with a custom URL, but when I trigger the action, nothing happens. I filled in the model name, API key, and URL with https://api.groq.com/openai/v1/chat/completions.

I also noticed that when I filled out the URL, the other actions using the native ChatGPT function without an API broke and no longer work.

Andreas_Hegenberg · June 20, 2024, 11:43am

Ah, unfortunately groq is not api compatible with the OpenAI chat/completions endpoint yet. They seem to be working on this. In that case you'll need to stick with the Java Script approach for now

bccrea · June 20, 2024, 11:58am

Too bad, I thought it was possible as it worked well with the script. Thanks anyway for trying.

Andreas_Hegenberg · June 20, 2024, 12:04pm

The script is using a few less options, thus doesn't run into the "unsupported" error.

Andreas_Hegenberg · June 20, 2024, 3:39pm

4.593 should work fine with groq! (uploading now)

bccrea · June 20, 2024, 8:58pm

Perfect, it works perfectly, thank you for your promptness and for finding the solution.

I also just tested it, and it works perfectly with the OpenRouter API.

Have a good evening!

iandol · June 20, 2024, 10:55pm

Great news!

For local LLMs no API key is used or needed, but the URL is. I can enter a false key as a workaround but just to mention this relationship is not ideal for local AI tools...

iandol · June 21, 2024, 12:23am

Can confirm with these setting a local model run using LM Studio works well with this action,

One small issue, while the Action itself works, for some reason the "Test Execute" does not, it sends no request (I can check the LM Studio server log directly to confirm this)...

iandol · June 21, 2024, 12:27am

One other small issue, this is what BTT sends:

[2024-06-21 08:20:04.696] [INFO] Received POST request to /v1/chat/completions with body: {
  "model": "lmstudio-community/Phi-3-mini-4k-instruct-BPE-fix-GGUF",
  "messages": [
    {
      "content": "You are a helpful assistant who interprets every input as raw text unless instructed otherwise. Your answers do not include a description unless prompted to do so.",
      "role": "system"
    },
    {
      "content": "You are an English editor who will correct the following text for clarity and precision of language use: : What is it goona a be huh?",
      "role": "user"
    }
  ]
}

Note there is a system prompt and a user prompt. It would be good if we could ALSO edit the system prompt, as this is important to define the behaviour which may depend on the model...

OR, only allow the system prompt to be set, and just use the selected text in the user prompt, this way you can instruct the model with the system prompt and just use the raw text in the user prompt?