Skip to content

1.5.0

Compare
Choose a tag to compare
@Shackless Shackless released this 07 Aug 20:27
· 31 commits to main since this release

Changelog

Features

  • Autosave: You complained in our survey that the Wingman AI client was unresponsive and sometimes felt sluggish and as your favourite UX-driven development company, we listened. From now on, almost any action and config change you can make happens ad-hoc and no longer requires reloading the config you’re in. This brings a couple of benefits:
    • increased performance as loading a config is expensive
    • way less conversation history resets
    • no more “Save” buttons
    • more real estate in the UI as we don't need the ActionBar anymore
    • you can't forget to save
  • Config Migrations. Finally. Wingman AI 1.5.0 (and future versions) will now take your configs from previous versions and migrate them to the current format, meaning that you’ll keep almost all of your old secrets, settings, configs and Wingmen. You’ll find a new .migration file in your 1.5.0 config directory after the first start. This file is a log and as long it’s there, Wingman AI will not attempt to migrate again on start.
    • If you want to re-migrate your 1.4.0 configs for some reason, delete this .migration file (and the 1.5.0 configs), then restart Wingman AI. We only support migrating from 1.4.0 to 1.5.0 but future versions might also be able to migrate step-by-step starting from version 1.4.0.
    • If you are a developer and have built your own skills in older versions, you have to copy the files from /[old_version]/skills/[your-skill] to /[new_version]/skills/[your-skill] manually because we cannot migrate unknown skills reliably. Restart Wingman AI afterwards.
  • Our beloved TTS provider whispercpp is now bundled (and auto-started) with Wingman AI and is the new default for everything so that everyone can use it. Check out the new section the Settings view to set it up. If you have a NVIDIA RTX GPU, make sure to check the “use CUDA” option. If you’re using custom models (other than ggml-base.bin), just copy them into the new whispercpp-models directory in your Wingman AI installation directory and the UI will find them.
  • Edit default config: In the Settings, you’ll find a new button and view that lets you change the defaults config (defaults.yaml). This includes the highest level system_prompt that Wingman AI uses. Be careful if you modify this and please only do so if you know what you’re doing. Yes, editing the default config also has autosave.
  • Added volume slider to Wingman configuration. If it’s set to zero, TTS processing will be skipped entirely.
  • added OpenAI’s much cheaper and faster gpt-4o-mini model and made it the new default. The larger gpt-4o model is still available in case you need it.
  • added Google Gemini as LLM provider. Function calling doesn’t work (yet) but you can use it for chit-chat and roleplaying. It’s very fast and pretty good - try it! Unfortunately, we can’t provide it with Wingman Pro, so you’ll need your own API key. The smaller model is currently free.
    • Groq and Elevenlabs models are now fetched using their API so that new models will be available in Wingman AI immediately and without an update. Check out Groq’s new llama3.1 models with function calls and Elevenlabs’ brand-new eleven-turbo-2.5 model finally supporting multiple languages.
  • We removed the summarize_provider so that tool calls can now be chained, meaning that the response to a function call can call another function and so on. This doesn’t sound like much but is actually huge. You (and skills) can now basically give commands like “Use the XYZ skill and then do ABC with the result”.
  • We split settings and configuration for XVASynth and improved its UI support meaning that it has nice dropdown for all your downloaded models and voices now. If you want to use XVASynth, install it via Steam, then enable and configure it in the Wingman AI Settings view. After that, it will appear in the list of TTS providers. We cannot migrate your old XVA settings, so please do that once, even if you were already using XVASynth.
  • Wingman AI Core can now open directories and files using Windows Explorer or OSX Finder. We added a new useful buttons like “open logs” or “open config directory” to the client.
  • We added new custom property types "VoiceSelection" and "Slider" so that skills can display our fancy UI components for them. No more writing JSON into text boxes to select voices...
  • We removed DirectSound drivers for audio devices because it caused more problems than it solved. The only available driver is MME now. Therefore, we have to reset your audio device settings during the migration, sorry. Make sure you check the “Settings” view after upgrading.

Bugfixes

  • Links in LLM responses now have target="_blank" set so that they’ll open in a new browser Window
  • Skills are now unloaded correctly when removed. No more infinite zombie timers. We also added some new hooks for developers.
  • fixed an issue with our AudioPlayer that sometimes resulted in a crash when using the RADIO_LOW or RADIO_MEDIUM sound effects
  • fixed an issue with mic selection while using VA
  • fixed an error occurring when using a default audio device
  • fixed renamed configs being recreated on startup (again)
  • fixed not being able to un-default renamed Wingmen
  • fixed Wingmen being disabled when a secret from their configuration was missing on startup
  • fixed Elevenlabs get_available_voices API endpoint changing for no apparent reason and without warning us ;) Thanks again to @lugia19 who fixed this very quickly in his great elevenlabslib
  • “Request a feature” aka canny.io is now opened in a new browser window because integrating it could resolve in a “blank screen” error.
  • If Wingman AI Core fails to start, we now show better trouble-shooting infos pointing to our Discord #support channel in the client.

New Skills

  • APIRequest: This one is a powerhouse. Point it to a docs page or (OpenAPI) spec for an API of your choice and it will understand the endpoints and then be able to call them - all on-thy-fly! We already gave it the Wingman AI Core API spec and well.. it was able to “send text” to another Wingman immediately without us ever implementing that. Wingmanception! You can also feed it any public online API like Pokedex or whatever. Try it out!
  • RadioChatter plays customizable random AI-generated chatter over time. You can even answer to these messages if you want to. It also uses our new custom property types now (in case you were already using the Discord versions).
  • Auto-Screenshot: Takes a screenshot if you request it or in moments where your voice input suggests interesting, scary or funny moments. Example: “Aww look, a spaceship!” => takes a screenshot
  • ATSTelemetry: Retrieve game state information from American Truck Simulator or Euro Truck Simulator 2
  • NMSAssistant: fetch information about No Man's Sky items, elements, crafting, cooking, expeditions, community missions, game news, and patch notes. Powered by NMS Assistant API.

Skill changes

  • ControlWindows and FileManager got some new capabilities like new supported file formats, text-to-clipboard and appending text to existing files
  • Timer skill can now loop and works more efficiently
  • VoiceChanger now makes use of the new custom property types and offers a nice voice selection in the UI
  • Vision now outputs the direct LLM responses in debug_mode

Where is it?

The release is now too large to add it to GitHub because we bundled whispercpp. Please download it from https://wingman-ai.com