This article is within the scope of WikiProject Artificial Intelligence, a collaborative effort to improve the coverage of Artificial intelligence on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.Artificial IntelligenceWikipedia:WikiProject Artificial IntelligenceTemplate:WikiProject Artificial IntelligenceArtificial Intelligence
Please add the quality rating to the {{WikiProject banner shell}} template instead of this project banner. See WP:PIQA for details.
This article is within the scope of WikiProject Computing, a collaborative effort to improve the coverage of computers, computing, and information technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.ComputingWikipedia:WikiProject ComputingTemplate:WikiProject ComputingComputing
This article is within the scope of WikiProject Technology, a collaborative effort to improve the coverage of technology on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.TechnologyWikipedia:WikiProject TechnologyTemplate:WikiProject TechnologyTechnology
This article has been rated as Start-class on Wikipedia's content assessment scale.
Did you know nomination
The following is an archived discussion of the DYK nomination of the article below. Please do not modify this page. Subsequent comments should be made on the appropriate discussion page (such as this nomination's talk page, the article's talk page or Wikipedia talk:Did you know), unless there is consensus to re-open the discussion at this page. No further edits should be made to this page.
Sandizer - we've gone back and forth on this a couple times so maybe it's better to discuss here. I don't believe the sources support the unqualified claim that AutoGPT can meaningfully improve itself. If I'm missing something from the sources I apologize, though perhaps that warrants a more in-depth explanation with citations. My main concern with describing the self-improvement is that it can be misunderstood as suggesting AutoGPT is capable of a singularity-style self-improvement cycle. While I'm sure AutoGPT can non-trivially edit its Python harness code, it certainly cannot update the GPT model that provides all the seriously non-trivial power of the program. For context (I know this is original research), Auto-GPT is about 4000 lines of Python code, compared to presumably several terabytes behind the GPT model. Self-improvement to the python code is a very different thing than self-improvement to the model. StereoFolic (talk) 18:00, 17 April 2023 (UTC)[reply]
I would go a bit further and state that at present there are no examples of it understanding or changing those 4000 lines of Python, and it can't in any meaningful way. Those files aren't available unless manually copied into its workspace. That is why I changed "rewrite" to "write". What is meant, and what the Toronto Sun headline and narrative in that and other sources actually mean, is that it can write, store, and reuse programs from scratch, and debug them when they don't work correctly. An example which works is "Write a program to print the first N prime numbers. Modify that program to only print odd prime numbers. Then use the modified program to print the first 20 odd primes." I agree we should add a sentence clarifying the meaning of "its own code" as not including the Auto-GPT base system or anything having to do with the GPT LLMs. Sandizer (talk) 18:14, 17 April 2023 (UTC)[reply]
I see the confusion now around the phrase "its own". At least the Toronto Sun article seems to mean "its own" as Auto-GPT itself: "Perhaps most significant, the program boasts the ability to rewrite and improve on its own code, allowing it to “recursively debug, develop and self-improve,” according to Significant Gravitas. How effective these self-updates are remains to be seen." (emphasis mine). At least that's how I read it. I'll take a crack at clarifying these distinctions later if you don't beat me to it. Thank you! StereoFolic (talk) 18:19, 17 April 2023 (UTC)[reply]
I added this sentence which is a bit clunky and certainly welcome any further improvements. I am not entirely sure that there are no examples so far of it submitting any pull requests to its own repository, but on the other hand I am sure that someone will eventually manage that as a worthy challenging exercise. I will try to see whether there are any such PRs so far and report back if I find them.
One side note which I think is the source of some of its issues, is that (some of?) the "short-term memory" statements in its database do get included in its context window, but as far as I can tell, the "long-term memory" files in the workspace aren't, so it's generally unaware of them unless references to them are in the "short-term" database or something causes to look at its workspace. I am not sure whether the file catalog of names (and descriptions?) is always in the context window, though, or what all can cause it to get observed. I wouldn't be surprised if it's generally nondeterministic, and is certainly very much in flux among the (hundreds of!) developers. Sandizer (talk) 18:33, 17 April 2023 (UTC)[reply]