Jump to content

Talk:Folding@home/GA2

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

GA Review

GA toolbox
Reviewing

Article (edit | visual edit | history) · Article talk (edit | history) · Watch

Reviewer: Czarkoff (talk · contribs) 22:04, 5 April 2012 (UTC)[reply]

Status

This section is supposed to be edited only by reviewer(s); please, leave your comments in the Discussion section below.

Rate Attribute Review Comment
1. Well-written:
1a. the prose is clear, concise, and understandable to an appropriately broad audience; spelling and grammar are correct.
  1. In third paragraph of Project significance the sentence "The conformational states and the short simulations between them are then compiled into statistical Markov state models (MSMs), which essentially serve as a map of the protein's energy landscape and kinetic and equilibrium thermodynamics properties and illustrate folding pathways" (italics added) is problematic. The part I emphasized should be either split out or otherwise disconnected from the preceding long object: it requires rereading to be properly comprehended.  Done
  2. The last sentence of the same paragraph starts with "The Pande lab has used these MSMs to", as does the first sentence of next (fourth) paragraph, which significantly breaks the text flow. Done
  3. The first sentence of the fourth paragraph of the same section reads "The Pande lab has used these MSMs to parallelize simulations that overall require more than 10 million CPU hours." Different amount of CPU hours is required depending on the properties of CPU this statement bases off (eg. 10m CPU hours of i486sx are ways less work then 10m CPU hours of modern server CPU). Either a note about CPU this phrase refers to should be given or the phrase itself should be generalize to solve ambiguity.  Done
  4. What is "NTL9 protein"? (in next sentence)? If the exact protein is important for the topic, wikilink should be here.  Done
  5. In the second sentence of the first paragraph of Biomedical research the phrase "therapeutic intervention can be the next step, which take the form of molecules that alter the production of a certain protein" implies that intervention takes form of molecules; the sentence should be slightly reworded to avoid this impression. BTW, "takes".  Done
  6. In third sentence of the third (last) paragraph of Cancer the sentence "IL-2 binds to these pulmonary cells differently than it does to T cells, a key approach for IL-2 research" should be reworded, as the second phrase is logically disconnected.  Done
  7. In fourth sentence of Osteogenesis imperfecta the sentence "The Pande lab has produced a publication which uses quantum mechanical techniques to improve upon previous simulations of collagen, which may be useful for future computational studies of collagen" should be rephrased: publication can describe the use, but it doesn't actually use quantum mechanical techniques.  Done
  8. The seventh sentence of the first paragraph of Viruses is incomplete: "limited to several orders of magnitude shorter" then what?  Done
  9. In Participation the whole native/x86 FLOPS issue should be explained. Though it is somehow done in last paragraph of PetaFLOPS milestones, the reader should be able to understand whether what are the "native FLOPS" native for and does summing them up gives a reasonable figure. Though I didn't dive in, I get the impression that the adequateness issue with the total of differently counted FLOPS was the reason of the project's introduction of x86 FLOPS; if so, "native" FLOPS should probably be omitted for all the data available in x86 FLOPS and a note explaining the issue should be added.  Done
  10. The last sentence of the Participation section should be expanded to explain that users were asked to donate CPU time. At least I got the current text as a suggestion to donate money. Also either text or (preferably) the footnote for this sentence (via |at=) should specify the exact update.  Done
  11. The sentence in the Software (that single one, which doesn't belong to subsections) gives the impression that of exclusive list of components. Probably the "consists of three components" should be changed to something like "involves three primary concepts".  Done
  12. The article refers to GROMACS as "GROMACS" and "Gromacs". One case should be used.  Done
  13. The second sentence of Multi-core processing client section has the phrase "These cores work together to complete a single WU significantly faster". The word "core" needs to be disambiguated between the "core" in "multi-core processor" and program's component. If the former is meant, some explanation is required to avoid leaving impression that the effect is reached without specific code optimization.  Done
1b. it complies with the Manual of Style guidelines for lead sections, layout, words to watch, fiction, and list incorporation. I would note that contrary to Wikipedia's practice several articles are linked from this one more then once. Though I would ordinarily require that second, third and later occasions be unlinked, this article imposes a substantial load on the reader, so repeated links may actually facilitate reading it. Please don't get this comment as a call to further add duplicate links though.
2. Verifiable with no original research:
2a. it contains a list of all references (sources of information), presented in accordance with the layout style guideline.
2b. reliable sources are cited inline. All content that could reasonably be challenged, except for plot summaries and that which summarizes cited content elsewhere in the article, must be cited no later than the end of the paragraph (or line if the content is not in prose).
  1. Citation templates require more work: the authors' first and last names are ordered inconsistently, the |work= and |publisher= are used (and wikilinked) inconsistently,  Done the dates should be in MDY format.
2c. it contains no original research.
3. Broad in its coverage:
3a. it addresses the main aspects of the topic.
3b. it stays focused on the topic without going into unnecessary detail (see summary style).
4. Neutral: it represents viewpoints fairly and without editorial bias, giving due weight to each.
5. Stable: it does not change significantly from day to day because of an ongoing edit war or content dispute.
6. Illustrated, if possible, by media such as images, video, or audio:
6a. media are tagged with their copyright statuses, and valid non-free use rationales are provided for non-free content.
  1. The image in infobox is reported to be distributed under CC0 conditions, but no proof is demonstrated. The fact that uploader's name looks like the name of author (as written on description page) doesn't prove the licensing information on its own.  Done
6b. media are relevant to the topic, and have suitable captions.
  1. The caption of the image in Participation section includes unnecessary details: "shown, by device type, in teraFLOPS as recorded semi-daily from November 2006 until September 2007."  Done
7. Overall assessment.

Discussion

Please refer to the issues in the table above by their numbers (eg. 1a1 for first issue with "prose" criterion).

  1. 1a1: I think I've addressed this now. Jesse V. (talk) 15:37, 6 April 2012 (UTC)[reply]
  2. 1a2: This should be addressed as well. Jesse V. (talk) 15:37, 6 April 2012 (UTC)[reply]
  3. 1a3: The exact phrase in the publication is "We created MSMs based on molecular dynamics simulations of a fast-folding double-norleucine HP35 mutant, extending prior calculations to include more than 1 ms of simulation and requiring more than 10 million CPU hours of computation." Since it also says "Folding@Home donors provided computer resources", I think its reasonable to assume that the calculations were run on a broad range of hardware and that measurement is simply the sum of everyone's CPU time. Jesse V. (talk) 15:37, 6 April 2012 (UTC)[reply]
    • The problem is that this article will still be there in, say, 2050; I think we may safely assume that by then 10 million CPU hours would be an amount of computation that would be ways greater then the author of this quote was thinking of. Thus at least year should be specified. Eg. "that overall required more than 10 million CPU hours as of 2011". — Dmitrij D. Czarkoff (talk) 21:00, 6 April 2012 (UTC)[reply]
      That does make sense. But the year is already in the sentence, and that should take care of that clarification. Jesse V. (talk) 21:40, 6 April 2012 (UTC)[reply]
  4. 1a4: I think its important to specify which protein. An expert reading this article will probably be familiar which the chemical properties of that protein, but I don't. There's no Wikipedia article on NTL9. The name is so small IMO there's hardly a readability difference between "the NTL9 protein" and "a protein" but I can remove the name if you think that's important. Jesse V. (talk) 15:37, 6 April 2012 (UTC)[reply]
  5. 1a5: I added the qualifier "can", which I think also fixed the grammar issue. Molecular interventions are things like drugs, antibiotics, etc. Jesse V. (talk) 15:37, 6 April 2012 (UTC)[reply]
  6. 1a6: Better? Jesse V. (talk) 15:37, 6 April 2012 (UTC)[reply]
  7. 1a7: Thanks for catching that. Fixed. Jesse V. (talk) 15:37, 6 April 2012 (UTC)[reply]
  8. 1a8: Clarified. Jesse V. (talk) 15:37, 6 April 2012 (UTC)[reply]
  9. 1a9: I made an initial stab at clarifying the difference, and put the explanation above any mention of "native FLOPS". Removing this native FLOP measurement would obliterate the info in the PetaFLOPS Milestone section, almost all of which is based on native FLOPS before x86 FLOPS were displayed. I've seen nothing to indicate that summing the native FLOPS is particular bad, only that there's a difference between the two measurements. I personally don't see this as a big deal, but please let me know if there's something further I need to fix/clarify here. Jesse V. (talk) 01:25, 7 April 2012 (UTC)[reply]
    • Well, I dismiss this issue, as the accurate information is unverifiable. Hope some improvements can come in future. For the record: when the text says total of native petaFLOPS is 5.7, and in x86 petaFLOPS it is 8, this means that replacing all x86 units with PS unit with equal native petaFLOPS performance will result in no changes in native petaFLOPS and large increase in x86 petaFLOPS. That specifically means that one of this measurement system, namely native petaFLOPS is widely inaccurate. That's why "[b]y reporting both, Folding@home attempts to even out these hardware differences." — Dmitrij D. Czarkoff (talk) 10:02, 7 April 2012 (UTC)[reply]
  10. 1a10: The Pande lab does accept monetary donations, did you know that? :P In any case, I've clarified it. I'm very unfamiliar with the "at = " field. Can you explain? Jesse V. (talk) 15:37, 6 April 2012 (UTC)[reply]
  11. 1a11: I like your suggestion. So changed. Jesse V. (talk) 15:37, 6 April 2012 (UTC)[reply]
  12. 1a12: Addressed. Jesse V. (talk) 21:40, 6 April 2012 (UTC)[reply]
  13. 1a13: It now says "CPU cores". Since it states that they work together, isn't that a sufficient explanation? The section then goes into details as to the specific techniques used to make them work together. Jesse V. (talk) 21:40, 6 April 2012 (UTC)[reply]
  14. 2b1: I've inserted "work =" and "publisher =" information where it was missing, (diff) does that address that problem? If an article exists for an author/publisher I try to wikilink to it. Please specify where I'm "inconsistent" in that regard. Also, per this page the current format of "FirstName LastName" should be fine, no? Or do I need to do "LastName, FirstName" using first1 and last1 and all that? If you were referring to in-text dates, then I believe I've fixed this with this edit. Jesse V. (talk) 02:56, 6 April 2012 (UTC)[reply]
  15. 6a1: I see your point. However, I know for a fact that that user is who he says he is, and that statement is accurate. He is a Pande lab member, and this is his foldingforum.org account. I've also written to him via email, and he sent me the GIMP files for that image. Please advise as to how he can confirm his identity. Jesse V. (talk) 02:56, 6 April 2012 (UTC)[reply]
    • Though ideal solution is a link to the official website page stating usage rights for this image, e-mail message from the project (that they don't claim copyright or release it under CC0) would be OK. See, if this image was produced while the author was working for the copyright owner of this software, the copyright may belong to the project without author realizing it. Similarly, the way he submitted it may have transferred the rights. — Dmitrij D. Czarkoff (talk) 07:26, 6 April 2012 (UTC)[reply]
      If you emailed Dr. Pande and he replied back with clarification on this, would that be satisfactory? You can find his email address on this page. Jesse V. (talk) 15:45, 6 April 2012 (UTC)[reply]
      Dr. Pande confirmed the licensing status. I'll take care of noting this on the file's page. — Dmitrij D. Czarkoff (talk) 22:26, 6 April 2012 (UTC)[reply]
  16. 6b1: I believe I've addressed that issue with this edit. Jesse V. (talk) 02:56, 6 April 2012 (UTC)[reply]

Notice: I understand that there are certain things that you are looking for and some improvement suggestions would be outside the scope of this GA review. Therefore if you or anyone else feel that there's something specific that needs to be done to further improve the quality of this article, especially if if can be brought really close to FA standards, feel free to open a topic in the Talk page. I would be eager to hear such suggestions and if I couldn't take care of it myself I'm sure others could. Thanks, Jesse V. (talk) 04:50, 6 April 2012 (UTC)[reply]

The article frequently uses the constructions similar to "Pande lab using Folding@home". Could they be generalized to omit Pande lab, or there is a reason to constantly mention it? — Dmitrij D. Czarkoff (talk) 09:20, 6 April 2012 (UTC)[reply]
Thanks I'll think more about that, and examine Rosetta@home for some ideas. I guess I couldn't think of better terms, but you should have read the article before I made this edit! :D Jesse V. (talk) 15:37, 6 April 2012 (UTC)[reply]


Regarding 1a10 and |at=
Markup Renders as
{{cite news |url=http://www.stanford.edu/group/pandegroup/genome/new.html |title=Genome@home Updates |first1=Vijay |last1=Pande |first2=Stefan |last2=Larson |date=2002-03-04 |accessdate=2012-03-17 }}

Pande, Vijay; Larson, Stefan (2002-03-04). "Genome@home Updates". Retrieved 2012-03-17.

{{cite news |url=http://www.stanford.edu/group/pandegroup/genome/new.html |title=Genome@home Updates |at=April 15, 2004 Update |first1=Vijay |last1=Pande |first2=Stefan |last2=Larson |date=2002-03-04 |accessdate=2012-03-17 }}

Pande, Vijay; Larson, Stefan (2002-03-04). "Genome@home Updates". April 15, 2004 Update. Retrieved 2012-03-17.