• QuadratureSurfer@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      6 months ago

      Similar use cases to what I’m doing right now, running LLMs like Mixtral8x7B (or something better by the time we start seeing these), Whisper (STT), or Stable Diffusion.

      I use a fine tuned version of Mixtral (dolphin-Mixtral) for coding purposes.

      Transcribing live audio for notes/search, or translating audio from different languages using Whisper (especially useful for verifying claims of translations for Russian/Ukrainian/Hebrew/Arabic especially with all of the fake information being thrown around).

      Combine the 2 models above with a text to speech system (TTS), a vision model like LLaVA and some animatronics and then I’ll have my own personal GLaDOS: https://github.com/dnhkng/GlaDOS

      And then there’s Stable Diffusion for generating images for DnD recaps, concept art, or even just avatar images.

      • Alphane Moon@lemmy.ml
        link
        fedilink
        arrow-up
        2
        ·
        6 months ago

        Thank you! I currently use my 3080 dGPU for Stable Diffusion. I wonder to what extent NPUs will be usable with Stable Diffusion XL.