My thoughts on AI

CriticalResist8@lemmygrad.ml · 3 days ago

My thoughts on AI

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 3 days ago

I very much agree with all that. This is already a very useful tool, and it can save you a lot of time once you learn how to apply it effectively. As with any tool, it takes time to develop intuition for cases where it works well, and how to use it to get the results you want. I get the impression that a lot of people try using LLMs out of spite already having a bias that the tool is not useful, then they naturally fail to produce good results on the first try and declare it to be useless.

As you point out, it’s an excellent tool for learning to work with new languages, to discover tricks for system configuration, and so on. I’ve been doing software development for over 20 years now professionally, and I know some languages well and others not so much. With LLMs, I can basically use any language like an expert. For example, I recently had to work on a Js project, and I haven’t touched the language in years. I wasn’t familiar with the ecosystem, current best practices, or popular libraries. Using an LLM allowed me to get caught up on that very quickly.

I’m also not too worried about the loss of skill or thinking capacity because the really useful skills lie in understanding the problem you’re trying to solve conceptually and designing a solution that will solve it. High level architecture tends to be the really important skill, and I find that’s basically where the focus is working with agents. The LLM can focus on the nitty gritty aspects of writing the code, while I focus on the structure and the logic flow. One approach I’ve found very effective is to stub out the functions myself, and have the agent fill in the blanks for me. This helps focus the LLM and prevent it from going off into the weeds.

Another trick I found is that’s handy is to ask the agent to first write a plan for the solution. Then I can review the plan and tell the agent to adjust it as needed before implementing. Agents are also pretty good at writing tests, and tests are much easier to evaluate for correctness because good tests are just independent functions that do one thing and don’t have a deep call stack. My current approach is to get the LLM to write the plan, add tests, and then focus on making sure I understand the tests and that they pass. At that point I have a fairly high degree of confidence that the code is indeed doing what’s needed. The tests act as a contract for the agent to fill.

I suspect that programming languages might start shifting in the direction of contracts in general. I can see stuff like this becoming the norm, where you simply specify the signature for the function. You could also specify parameters like computational complexity and memory usage. The agent could then try to figure out how to fill the contract you’ve defined. It would be akin to genetic algorithm approach where the agent could converge on a solution over time. If that’s the direction things will be moving in, then current skills could be akin to being able to write assembly by hand. Useful in some niche situations, but not necessary vast majority of the time.

Finally, it’s very helpful to structure things using small components components that can be tested independently and composed together to build bigger things. As long as the component functions in the intended way, I don’t necessarily care about the quality of the code internally. I can treat them as black boxes as long as they’re doing what’s expected. This is already the approach we take with libraries. We don’t audit every line of code in a library we include in a project. We just look at its surface level API.

Incidentally, I’m noticing that functional style seems to work really well here. Having an assembly line of pure functions naturally breaks up a problem into small building blocks that you can reason about in isolation. It’s kind of like putting Lego blocks together. The advantage over stuff like microservies here is that you don’t have to deal with the complexity of orchestration and communication between the services.

CriticalResist8@lemmygrad.ml · edit-2 3 days ago

This is exactly how I use LLMs to code too, I’m good at laying out the steps to solving the problem, not so good a coder (I basically hand code html and css because it’s faster for me than using an LLM but I never learned JS and have never felt like learning it even before AI was a thing).

I also have it create constitutive components, e.g. for the reading mode on prolewiki I had it make it trigger on 0 press and apply a class to <html> which I then customize myself in the CSS file, then after that I had it make other functions to have a page progress bar or hoverline. The hoverline was actually an idea from an LLM, to keep track of which line you are on. Finally just recently I gave deepseek these three different functions and told it to refactor and optimize efficiency, and it did just that. It doesn’t do everything in one step yet but if you know even passably well what it’s capable of you can have it do it in several steps.

edit - and of course just asking the AI to answer questions about itself. “Write as a Midjourney prompt” for example. That’s why I think it would be important having a proletarian guide to AI, so that everyone could start somewhere because a lot of the knowledge is gatekept individually.

What do you use for agents? I downloaded agent0 and it runs but gets stuck on ‘checking memory’ every time. I’m not on a great rig to be running local models right now but apparently this is a problem several people are facing.

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 3 days ago

If you’ve just been using the web UI for DeepSeek, I highly recommend checking out using tools that let you run models on the actual codebase you’re working with. It’s a much better experience because the model has a lot more context to work with.

There are two broad categories of tools. One is REPL style interface where you start a chat in the terminal, and the agent manages all the code changes while you prompt it with what you want to do. You don’t have as much control here, but the agents tend to do a pretty good job of analyzing the codebase holistically. The two main ones to look at are Aider and plandex.

The other approach is editor integration as seen with Cursor. Here you’re doing most of the driving and high level planning, and then use the agent contextually to add code like writing individual functions. You have a lot more granular control over what the agent is doing this way. It’s worth noting that you also have a chat mode here as well, and you can get the agent to analyze the code, find things in the project, etc. I find this is another aspect that’s often under appreciated where you can use the LLM to find the relevant code you need to change. A couple of projects to look at are Continue and Roo-Code.

All these projects work with ollama locally, but I’ve found DeepSeek API access is pretty cheap and you do tend to get better results that way. Obviously, caveat is that you are sending code to their servers.

devils_dust [none/use name]@hexbear.net · 2 days ago

I’d like to know your opinion on https://opencode.ai/ and/or https://charm.land/ if you have tried them - FOSS competitors to tools like Claude code.

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 2 days ago

Oh I haven’t tried either, looks like they’re in aider/plandex style but with a bit more UI. I’m guessing in the end these projects are all pretty similar since ultimately it’s the model that really matters. These tools largely just manage MCP interaction and communication with the model.

CriticalResist8@lemmygrad.ml · 2 days ago

Thanks I’ll take a look at the links! Lots of stuff to check out and try lol but that’s true of everything.

☆ Yσɠƚԋσʂ ☆@lemmygrad.ml · 2 days ago

yeah it can be a bit overwhelming :)