Russ

Russ@bitforged.space · 5 days ago

It depends on who you’re referring to as a casual user. My mother for example would certainly have a hard time with it, then figuring out the key to bring up the boot menu (and being faced with a scary dialog that they’ve never seen), then selecting the right device, then likely being faced with GRUB which would also look scary to her, and by then she’d be overwhelmed before even getting to the install portion.

Russ@bitforged.space · 6 days ago

I personally use Sleep as Android which comes with a bunch of options to help ensure you’ve actually woken up. I utilize the “captcha” option in which when I go to turn off the alarm, it displays a screen full of sheep and all of them but one are sleeping - you have to click the one that is “awake” in order to dismiss the alarm. I guess the process wakes up my brain just enough so that I don’t go back to sleep, whereas with a regular alarm that has just a simple dismiss button I’ll absolutely either hit dismiss or one of the volume buttons to turn off the alarm before I’ve fully woken up.

I also have it set to buzz on my watch for 90 seconds before playing a sound on my phone (which escalates in volume) - I’ve not had a problem waking up with this in the years that I’ve been using it.

There are other options too, such as answering math questions, scanning a QR code, pressing your phone to an NFC tag, heavily shaking the phone, one called “Say cheese!” that makes you smile as hard as you can and uses the camera to detect it, and one that you have to “laugh out loud”.

Russ@bitforged.space · 8 days ago

Hmm, gotcha. I just tried out a fresh copy of text-gen-webui and it seems like the latest version is borked with ROCM (I get the CUDA error: invalid device function error).

My next recommendation then would be LM Studio which to my knowledge can still output an OpenAI compatible API endpoint to be used in SillyTavern - I’ve used it in the past before and I didn’t even need to run it within Distrobox (I have all of the ROCM stuff installed locally, but I generally run most of the AI stuff in distrobox since it tends to require an older version of Python than Arch is currently using) - it seems they’ve recently started supporting running GGUF models via Vulkan, which I assume probably doesn’t require the ROCM stuff to be installed perhaps?

Might be worth a shot, I just downloaded the latest version (the UI has definitely changed a bit since I last used it) and just grabbed a copy of the Gemma model and ran it, and it seemed to work without an issue for me directly on the host.

The advanced configuration settings no longer seem to directly mention GPU acceleration like it used to, however I can see it utilizing GPU resources in nvtop currently, and the speed it was generating at (the one in my screenshot was 83 tokens a second) couldn’t have possibly been done on the CPU so it seems to be fine on my side.

Russ@bitforged.space · 9 days ago

Yeah, I definitely am not a fan of how AMD handles rocm - there’s so many weird cases of “Well this card should work with rocm, but… [insert some weird quirk that you have to do, like the one I mentioned, or what you’ve run into]”.

Userspace/consumer side I enjoy AMD, but I fully understand why a lot of devs don’t make use of rocm and why Nvidia has such a tight hold on things in the GPU compute world with CUDA.

Russ@bitforged.space · 9 days ago

Ah, strange. I don’t suppose you specifically need a Fedora container? If not, I’ve been using this Ubuntu based distrobox container recipe for anything that requires ROCM and it has worked flawless for me.

If that still doesn’t work (I haven’t actually tried out kobolcpp yet), and you’re willing to try something other than kobolcpp, then I’d recommend the text-generation-webui project which supports a wide array of model types, including the GGUF types that Kobolcpp utilizes. Then if you really want to get deep into it, you can even pair it with SillyTavern (it is purely a frontend for a bunch of different LLM backends, text-generation-webui is one of the supported ones)!

Russ@bitforged.space · 9 days ago

What card do you use? I have a 6700XT and getting anything with ROCM running for me requires that I pass the HSA_OVERRIDE_GFX_VERSION=10.3.0 environmental variable to the related process, otherwise it just refuses to run properly. I wonder if it might be something similar for you too?

Russ@bitforged.space · 9 days ago

I did the same move for similar reasons! Although I still keep windows around on another SSS - and even the Windows Nvidia drivers were being funky for me.

Nvidia shares a lot of logic between their Windows and Linux driver as far as I’m aware, so I suppose it makes sense.

Russ@bitforged.space · 20 days ago

Good god, I was finally prescribed Ambien for the first time recently, and I definitely now realize why it has the reputation that it does.