Jump to content

Talk:AutoGPT: Difference between revisions

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
Content deleted Content added
Line 16: Line 16:
::::I can confidently say that it hasn't submitted any pull requests for anything yet, as [https://github.com/Significant-Gravitas/Auto-GPT/blob/master/autogpt/commands/git_operations.py it's git operations are limited to cloning repositories], at present. The [https://github.com/Significant-Gravitas/Auto-GPT/blob/master/autogpt/prompt.py prompt.py prompt generation routine] shows its basic capabilities, which for files are limited to its workspace that does not include its base system code. There do not seem to be any pending PRs which would add the ability to modify its base system or perform any kinds of writes or submit pull requests to repos. Having said that, who knows whether it can figure out how to use selenium to operate the GitHub web interface. And if anyone has copied its base system into its workspace and asked it to do anything with it including running it, I can't find any mention of that. [[User:Sandizer|<span style="font-family:Garamond;">'''''Sandizer'''''</span>]] ([[User talk:Sandizer|talk]]) 19:13, 17 April 2023 (UTC)
::::I can confidently say that it hasn't submitted any pull requests for anything yet, as [https://github.com/Significant-Gravitas/Auto-GPT/blob/master/autogpt/commands/git_operations.py it's git operations are limited to cloning repositories], at present. The [https://github.com/Significant-Gravitas/Auto-GPT/blob/master/autogpt/prompt.py prompt.py prompt generation routine] shows its basic capabilities, which for files are limited to its workspace that does not include its base system code. There do not seem to be any pending PRs which would add the ability to modify its base system or perform any kinds of writes or submit pull requests to repos. Having said that, who knows whether it can figure out how to use selenium to operate the GitHub web interface. And if anyone has copied its base system into its workspace and asked it to do anything with it including running it, I can't find any mention of that. [[User:Sandizer|<span style="font-family:Garamond;">'''''Sandizer'''''</span>]] ([[User talk:Sandizer|talk]]) 19:13, 17 April 2023 (UTC)
:::::I've [https://en.wikipedia.org/enwiki/w/index.php?title=Auto-GPT&diff=prev&oldid=1150498976&diffmode=visual taken a crack at rephrasing this] to simplify and clarify it, also moved to a separate paragraph since that was getting long. I don't think the details about its Github API abilities are necessary and seem to be original (though accurate) research. [[User:StereoFolic|StereoFolic]] ([[User talk:StereoFolic|talk]]) 14:54, 18 April 2023 (UTC)
:::::I've [https://en.wikipedia.org/enwiki/w/index.php?title=Auto-GPT&diff=prev&oldid=1150498976&diffmode=visual taken a crack at rephrasing this] to simplify and clarify it, also moved to a separate paragraph since that was getting long. I don't think the details about its Github API abilities are necessary and seem to be original (though accurate) research. [[User:StereoFolic|StereoFolic]] ([[User talk:StereoFolic|talk]]) 14:54, 18 April 2023 (UTC)
::::::Looks good. I've moved on to [[LangChain]] and hope you might want to review that article. [[User:Sandizer|<span style="font-family:Garamond;">'''''Sandizer'''''</span>]] ([[User talk:Sandizer|talk]]) 15:20, 18 April 2023 (UTC)

Revision as of 15:20, 18 April 2023

Did you know nomination

The following is an archived discussion of the DYK nomination of the article below. Please do not modify this page. Subsequent comments should be made on the appropriate discussion page (such as this nomination's talk page, the article's talk page or Wikipedia talk:Did you know), unless there is consensus to re-open the discussion at this page. No further edits should be made to this page.

The result was: promoted by BorgQueen (talk02:10, 19 April 2023 (UTC)[reply]

Created by Sandizer (talk). Self-nominated at 21:06, 16 April 2023 (UTC). Post-promotion hook changes for this nom will be logged at Template talk:Did you know nominations/Auto-GPT; consider watching this nomination, if it is successful, until the hook appears on the Main Page.[reply]

General: Article is new enough and long enough
Policy: Article is sourced, neutral, and free of copyright problems
Hook: Hook has been verified by provided inline citation
QPQ: None required.
Overall: Looks good! 28bytes (talk) 12:44, 17 April 2023 (UTC)[reply]

Self improvement

Sandizer - we've gone back and forth on this a couple times so maybe it's better to discuss here. I don't believe the sources support the unqualified claim that AutoGPT can meaningfully improve itself. If I'm missing something from the sources I apologize, though perhaps that warrants a more in-depth explanation with citations. My main concern with describing the self-improvement is that it can be misunderstood as suggesting AutoGPT is capable of a singularity-style self-improvement cycle. While I'm sure AutoGPT can non-trivially edit its Python harness code, it certainly cannot update the GPT model that provides all the seriously non-trivial power of the program. For context (I know this is original research), Auto-GPT is about 4000 lines of Python code, compared to presumably several terabytes behind the GPT model. Self-improvement to the python code is a very different thing than self-improvement to the model. StereoFolic (talk) 18:00, 17 April 2023 (UTC)[reply]

I would go a bit further and state that at present there are no examples of it understanding or changing those 4000 lines of Python, and it can't in any meaningful way. Those files aren't available unless manually copied into its workspace. That is why I changed "rewrite" to "write". What is meant, and what the Toronto Sun headline and narrative in that and other sources actually mean, is that it can write, store, and reuse programs from scratch, and debug them when they don't work correctly. An example which works is "Write a program to print the first N prime numbers. Modify that program to only print odd prime numbers. Then use the modified program to print the first 20 odd primes." I agree we should add a sentence clarifying the meaning of "its own code" as not including the Auto-GPT base system or anything having to do with the GPT LLMs. Sandizer (talk) 18:14, 17 April 2023 (UTC)[reply]
I see the confusion now around the phrase "its own". At least the Toronto Sun article seems to mean "its own" as Auto-GPT itself: "Perhaps most significant, the program boasts the ability to rewrite and improve on its own code, allowing it to “recursively debug, develop and self-improve,” according to Significant Gravitas. How effective these self-updates are remains to be seen." (emphasis mine). At least that's how I read it. I'll take a crack at clarifying these distinctions later if you don't beat me to it. Thank you! StereoFolic (talk) 18:19, 17 April 2023 (UTC)[reply]
I added this sentence which is a bit clunky and certainly welcome any further improvements. I am not entirely sure that there are no examples so far of it submitting any pull requests to its own repository, but on the other hand I am sure that someone will eventually manage that as a worthy challenging exercise. I will try to see whether there are any such PRs so far and report back if I find them.
One side note which I think is the source of some of its issues, is that (some of?) the "short-term memory" statements in its database do get included in its context window, but as far as I can tell, the "long-term memory" files in the workspace aren't, so it's generally unaware of them unless references to them are in the "short-term" database or something causes to look at its workspace. I am not sure whether the file catalog of names (and descriptions?) is always in the context window, though, or what all can cause it to get observed. I wouldn't be surprised if it's generally nondeterministic, and is certainly very much in flux among the (hundreds of!) developers. Sandizer (talk) 18:33, 17 April 2023 (UTC)[reply]
I can confidently say that it hasn't submitted any pull requests for anything yet, as it's git operations are limited to cloning repositories, at present. The prompt.py prompt generation routine shows its basic capabilities, which for files are limited to its workspace that does not include its base system code. There do not seem to be any pending PRs which would add the ability to modify its base system or perform any kinds of writes or submit pull requests to repos. Having said that, who knows whether it can figure out how to use selenium to operate the GitHub web interface. And if anyone has copied its base system into its workspace and asked it to do anything with it including running it, I can't find any mention of that. Sandizer (talk) 19:13, 17 April 2023 (UTC)[reply]
I've taken a crack at rephrasing this to simplify and clarify it, also moved to a separate paragraph since that was getting long. I don't think the details about its Github API abilities are necessary and seem to be original (though accurate) research. StereoFolic (talk) 14:54, 18 April 2023 (UTC)[reply]
Looks good. I've moved on to LangChain and hope you might want to review that article. Sandizer (talk) 15:20, 18 April 2023 (UTC)[reply]