Jump to content

The Art of Computer Programming

From Wikipedia, the free encyclopedia
The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

The Art of Computer Programming
The Art of Computer Programming, Volume 1: Fundamental Algorithms
AuthorDonald Knuth
LanguageEnglish
GenreNon-fiction
Monograph
PublisherAddison-Wesley
Publication date
1968– (the book is still incomplete)
Publication placeUnited States
Media typePrint (Hardcover)
ISBN0-201-03801-3
519
LC ClassQA76.75

The Art of Computer Programming (TAOCP) is a comprehensive monograph written by the computer scientist Donald Knuth presenting programming algorithms and their analysis. Volumes 1–5 are intended to represent the central core of computer programming for sequential machines.

When Knuth began the project in 1962, he originally conceived of it as a single book with twelve chapters. The first three volumes of what was then expected to be a seven-volume set were published in 1968, 1969, and 1973. Work began in earnest on Volume 4 in 1973, but was suspended in 1977 for work on typesetting prompted by the second edition of Volume 2. Writing of the final copy of Volume 4A began in longhand in 2001, and the first online pre-fascicle, 2A, appeared later in 2001.[1] The first published installment of Volume 4 appeared in paperback as Fascicle 2 in 2005. The hardback Volume 4A, combining Volume 4, Fascicles 0–4, was published in 2011. Volume 4, Fascicle 6 ("Satisfiability") was released in December 2015; Volume 4, Fascicle 5 ("Mathematical Preliminaries Redux; Backtracking; Dancing Links") was released in November 2019.

Volume 4B consists of material evolved from Fascicles 5 and 6.[2] The manuscript was sent to the publisher on August 1, 2022, and the volume was published in September 2022.[3] Fascicle 7, planned for Volume 4C, was the subject of Knuth's talk on August 3, 2022.[4]

History

Donald Knuth in 2005

After winning a Westinghouse Talent Search scholarship, Knuth enrolled at the Case Institute of Technology (now Case Western Reserve University), where his performance was so outstanding that the faculty voted to award him a master of science upon his completion of the bachelor's degree. During his summer vacations, Knuth was hired by the Burroughs Corporation to write compilers, earning more in his summer months than full professors did for an entire year.[5] Such exploits made Knuth a topic of discussion among the mathematics department, which included Richard S. Varga.

In January 1962, when he was a graduate student in the mathematics department at Caltech, Knuth was approached by Addison-Wesley to write a book about compiler design, and he proposed a larger scope. He came up with a list of twelve chapter titles the same day. In the summer of 1962 he worked on a FORTRAN compiler for UNIVAC, considering that he had "sold my soul to the devil" to develop a FORTRAN compiler[6]: 15  after ALGOL developments with Burroughs. He remained as a consultant to Burroughs over the period 1960 to 1968 while writing Volume 1 'Fundamental Algorithms'.

During this time, he also developed a mathematical analysis of linear probing, which convinced him to present the material with a quantitative approach. After receiving his Ph.D. in June 1963, he began working on his manuscript, of which he finished his first draft in June 1965, at 3000 hand-written pages.[7] He had assumed that about five hand-written pages would translate into one printed page, but his publisher said instead that about 1+12 hand-written pages translated to one printed page. This meant he had approximately 2000 printed pages of material, which closely matches the size of the first three published volumes.

The first volume of ‘The Art of Computer Programming’, ‘Fundamental Algorithms’, took five years to complete between 1963 and 1968 while working at both Caltech and Burroughs.

Knuth's dedication in Volume 1 reads:

This series of books is affectionately dedicated
to the Type 650 computer once installed at
Case Institute of Technology,
in remembrance of many pleasant evenings.[a]

In the preface, he thanks first his wife Jill, then Burroughs for the use of B220 and B5500 computers in testing most of the programs, and Caltech, the National Science Foundation, and the Office of Naval Research.[8]: xii 

Section 2.5 of ‘Fundamental Algorithms’ is on Dynamic Storage Allocation. Parts of this are used in the Burroughs approach to memory management. Knuth claims credit for “The “boundary-tag” method, introduced in Section 2.5, was designed by the author in 1962 for use in a control program for the B5000 computer.”[8]: 460 

Knuth received support from Richard S. Varga, who was the scientific adviser to the publisher. Varga was visiting Olga Taussky-Todd and John Todd at Caltech. With Varga's enthusiastic endorsement, the publisher accepted Knuth's expanded plans. In its expanded version, the book would be published in seven volumes, each with just one or two chapters.[9] Due to the growth in Chapter 7, which was fewer than 100 pages of the 1965 manuscript, per Vol. 4A p. vi, the plan for Volume 4 has since expanded to include Volumes 4A, 4B, 4C, 4D, and possibly more.

In 1976, Knuth prepared a second edition of Volume 2, requiring it to be typeset again, but the style of type used in the first edition (called hot type) was no longer available. In 1977, he decided to spend some time creating something more suitable. Eight years later, he returned with TEX, which is currently used for all volumes.

Another characteristic of the volumes is the variation in the difficulty of the exercises including a numerical rating varying from 0 to 50, where 0 is trivial, and 50 is an open question in contemporary research.

Bounty for finding errors

The offer of a so-called Knuth reward check worth "one hexadecimal dollar" (100HEX base 16 cents, in decimal, is $2.56) for any errors found, and the correction of these errors in subsequent printings, has contributed to the highly polished and still-authoritative nature of the work, long after its first publication.

Assembly language in the book

All examples in the books use a hypothetical language called "MIX assembly language" (MIXAL), which runs on "a mythical computer called MIX". Currently,[when?] the MIX computer is being replaced by the MMIX computer, which is a RISC version. The conversion from MIX to MMIX was a large ongoing project for which Knuth solicited volunteers for help. Software such as GNU MDK[10] exists to provide emulation of the MIX architecture. Knuth considers the use of assembly language necessary for the speed and memory usage of algorithms to be judged.

MIX was much like any computer then in existence, but nicer. The name ‘MIX’ is 1009 in Roman numerals and this is given by a formula involving series numbers of several computers of the time: (360 + 650 + 709 + U3 + SS80 + 1107 + 1604 + G2- + B220 + S2000 + 920 + 601 + H800 + PDP-4 + 11)/16 = 1009 or MIX. The name MMIX is 2009 in Roman numerals and Knuth claims MMIX is even nicer than MIX.

Critical response

Knuth was awarded the 1974 Turing Award "for his major contributions to the analysis of algorithms […], and in particular for his contributions to the 'art of computer programming' through his well-known books in a continuous series by this title."[11] American Scientist has included this work among "100 or so Books that shaped a Century of Science", referring to the twentieth century.[12] Covers of the third edition of Volume 1 quote Bill Gates as saying, "If you think you're a really good programmer… read (Knuth's) Art of Computer Programming… You should definitely send me a résumé if you can read the whole thing."[13] The New York Times referred to it as "the profession's defining treatise".[14]

Volumes

Completed

Planned

  • Volume 4C, 4D, ...  Combinatorial Algorithms (chapters 7 & 8 released in several subvolumes)
    • Chapter 7 – Combinatorial searching (continued)
    • Chapter 8 – Recursion
  • Volume 5 – Syntactic Algorithms
  • Volume 6 – The Theory of Context-Free Languages
    • Chapter 11 – Mathematical Linguistics
  • Volume 7 – Compiler Techniques
    • Chapter 12 – Programming Language Translation

Chapter outlines

Completed

Volume 1 – Fundamental Algorithms

Volume 2 – Seminumerical Algorithms

Volume 3 – Sorting and Searching

  • Chapter 5 – Sorting
    • 5.1. Combinatorial Properties of Permutations
      • 5.1.1. Inversions
      • 5.1.2. Permutations of a Multiset
      • 5.1.3. Runs
      • 5.1.4. Tableaux and Involutions
    • 5.2. Internal sorting
      • 5.2.1. Sorting by Insertion
      • 5.2.2. Sorting by Exchanging
      • 5.2.3. Sorting by Selection
      • 5.2.4. Sorting by Merging
      • 5.2.5. Sorting by Distribution
    • 5.3. Optimum Sorting
      • 5.3.1. Minimum-Comparison Sorting
      • 5.3.2. Minimum-Comparison Merging
      • 5.3.3. Minimum-Comparison Selection
      • 5.3.4. Networks for Sorting
    • 5.4. External Sorting
      • 5.4.1. Multiway Merging and Replacement Selection
      • 5.4.2. The Polyphase Merge
      • 5.4.3. The Cascade Merge
      • 5.4.4. Reading Tape Backwards
      • 5.4.5. The Oscillating Sort
      • 5.4.6. Practical Considerations for Tape Merging
      • 5.4.7. External Radix Sorting
      • 5.4.8. Two-Tape Sorting
      • 5.4.9. Disks and Drums
    • 5.5. Summary, History, and Bibliography
  • Chapter 6 – Searching
    • 6.1. Sequential Searching
    • 6.2. Searching by Comparison of Keys
      • 6.2.1. Searching an Ordered Table
      • 6.2.2. Binary Tree Searching
      • 6.2.3. Balanced Trees
      • 6.2.4. Multiway Trees
    • 6.3. Digital Searching
    • 6.4. Hashing
    • 6.5. Retrieval on Secondary Keys

Volume 4A – Combinatorial Algorithms, Part 1

Volume 4B – Combinatorial Algorithms, Part 2

Planned

Volumes 4C, 4D, 4E, 4F – Combinatorial Algorithms[15]

Volume 5 – Syntactic Algorithms

Volume 6 – The Theory of Context-free Languages[16]

  • Chapter 11 – Mathematical Linguistics[17]

Volume 7 – Compiler Techniques

  • Chapter 12 – Programming Language Translation[17]

English editions

Current editions

These are the current editions in order by volume number:

  • The Art of Computer Programming, Volumes 1-4B Boxed Set. (Reading, Massachusetts: Addison-Wesley, 2023), 3904pp. ISBN 978-0-13-793510-9, 0-13-793510-2
    • Volume 1: Fundamental Algorithms. Third Edition (Reading, Massachusetts: Addison-Wesley, 1997), xx+650pp. ISBN 978-0-201-89683-1, 0-201-89683-4. Errata: [1] (2011-01-08), [2] (2022, 49th printing). Addenda: [3] (2011).
    • Volume 2: Seminumerical Algorithms. Third Edition (Reading, Massachusetts: Addison-Wesley, 1997), xiv+762pp. ISBN 978-0-201-89684-8, 0-201-89684-2. Errata: [4] (2011-01-08), [5] (2022, 45th printing). Addenda: [6] (2011).
    • Volume 3: Sorting and Searching. Second Edition (Reading, Massachusetts: Addison-Wesley, 1998), xiv+780pp.+foldout. ISBN 978-0-201-89685-5, 0-201-89685-0. Errata: [7] (2011-01-08), [8] (2022, 45th printing). Addenda: [9] (2011).
    • Volume 4A: Combinatorial Algorithms, Part 1. First Edition (Upper Saddle River, New Jersey: Addison-Wesley, 2011), xv+883pp. ISBN 978-0-201-03804-0, 0-201-03804-8. Errata: [10] (2011), [11] (2022, 22nd printing).
    • Volume 4B: Combinatorial Algorithms, Part 2. First Edition (Upper Saddle River, New Jersey: Addison-Wesley, 2023), xviii+714pp. ISBN 978-0-201-03806-4, 0-201-03806-4. Errata: [12] (2023, 1st printing).
  • Volume 1, Fascicle 1: MMIX – A RISC Computer for the New Millennium. (Addison-Wesley, 2005-02-14), 144pp. ISBN 0-201-85392-2. Errata: [13] (2024-05-14) (will be in the fourth edition of volume 1)

Previous editions

Complete volumes

These volumes were superseded by newer editions and are in order by date.

Fascicles

Volume 4, Fascicles 0–4 were revised and published as Volume 4A.

  • Volume 4, Fascicle 0: Introduction to Combinatorial Algorithms and Boolean Functions. (Addison-Wesley Professional, 2008-04-28) vi+240pp, ISBN 0-321-53496-4. Errata: [17] (2011-01-01).
  • Volume 4, Fascicle 1: Bitwise Tricks & Techniques; Binary Decision Diagrams. (Addison-Wesley Professional, 2009-03-27) viii+260pp, ISBN 0-321-58050-8. Errata: [18] (2011-01-01).
  • Volume 4, Fascicle 2: Generating All Tuples and Permutations. (Addison-Wesley, 2005-02-14) v+127pp, ISBN 0-201-85393-0. Errata: [19] (2011-01-01).
  • Volume 4, Fascicle 3: Generating All Combinations and Partitions. (Addison-Wesley, 2005-07-26) vi+150pp, ISBN 0-201-85394-9. Errata: [20] (2011-01-01).
  • Volume 4, Fascicle 4: Generating All Trees; History of Combinatorial Generation. (Addison-Wesley, 2006-02-06) vi+120pp, ISBN 0-321-33570-8. Errata: [21] (2011-01-01).

Volume 4, Fascicles 5–6 were revised and published as Volume 4B.

  • Volume 4, Fascicle 5: Mathematical Preliminaries Redux; Backtracking; Dancing Links. (Addison-Wesley, 2019-11-22) xiii+382pp, ISBN 978-0-13-467179-6. Errata: [22] (2020-03-27)
  • Volume 4, Fascicle 6: Satisfiability. (Addison-Wesley, 2015-12-08) xiii+310pp, ISBN 978-0-13-439760-3. Errata: [23] (2020-03-26)

Pre-fascicles

Volume 1

  • Pre-fascicle 1 was revised and published as Volume 1, fascicle 1.

Volume 4

  • Pre-fascicles 0A, 0B, and 0C were revised and published as Volume 4, fascicle 0.
  • Pre-fascicles 1A and 1B were revised and published as Volume 4, fascicle 1.
  • Pre-fascicles 2A and 2B were revised and published as Volume 4, fascicle 2.
  • Pre-fascicles 3A and 3B were revised and published as Volume 4, fascicle 3.
  • Pre-fascicles 4A and 4B were revised * and published as Volume 4, fascicle 4.
  • Pre-fascicles 5A, 5B, and 5C were revised and published as Volume 4, fascicle 5.
  • Pre-fascicle 6A was revised and published as Volume 4, fascicle 6.

The remaining pre-fascicles contain draft material that is set to appear in future fascicles and volumes.

See also

References

Notes

  1. ^ The dedication was worded slightly differently in the first edition.

Citations

  1. ^ "note for box 3, folder 1".
  2. ^ Pearson InformIT webpage book Content tab. Addison-Wesley Professional. 2022-09-28. ISBN 9780201038064.
  3. ^ Pearson InformIT webpage. Addison-Wesley Professional. 2022-09-28. ISBN 9780201038064.
  4. ^ "CP 2022 All Questions Answered, July 31–August 5, 2022, Haifa, Israel".
  5. ^ Frana, Philip L. (2001-11-08). "An Interview with Donald E. Knuth". hdl:11299/107413.
  6. ^ Feigenbaum, Edward (2007). "Oral History of Donald Knuth" (PDF). Computer History Museum. Archived (PDF) from the original on 2008-12-09. Retrieved 2020-09-17.
  7. ^ Knuth, Donald E. (1993-08-23). "This Week's Citation Classic" (PDF). Current Contents. p. 8.
  8. ^ a b Knuth, Donald Ervin (2019-08-03). "The Art of Computer Programming (TAOCP) 2nd Edition, 1973". Archived from the original on 2019-08-03. Retrieved 2018-02-06.
  9. ^ Albers, Donald J. (2008). "Donald Knuth". In Albers, Donald J.; Alexanderson, Gerald L. (eds.). Mathematical People: Profiles and Interviews (2 ed.). A. K. Peters. ISBN 978-1-56881-340-0.
  10. ^ "GNU MDK - GNU Project - Free Software Foundation". www.gnu.org. Retrieved 2022-10-23.
  11. ^ "Donald E. Knuth – A. M. Turing Award Winner". AM Turing. Retrieved 2017-01-25.
  12. ^ Morrison, Philip; Morrison, Phylis (November–December 1999). "100 or so Books that shaped a Century of Science". American Scientist. 87 (6). Sigma Xi, The Scientific Research Society. Archived from the original on 2008-08-20. Retrieved 2008-01-11.
  13. ^ Weinberger, Matt. "Bill Gates once said 'definitely send me a résumé' if you finish this fiendishly difficult book". Business Insider. Retrieved 2016-06-13.
  14. ^ Lohr, Steve (2001-12-17). "Frances E. Holberton, 84, Early Computer Programmer". The New York Times. Retrieved 2010-05-17.
  15. ^ D'Agostino, Susan (2020-04-16). "The Computer Scientist Who Can't Stop Telling Stories". Quanta Magazine. Retrieved 2023-06-26. Now 82, he's hard at work on part B of volume 4, and he anticipates that the book will have at least parts A through F.
  16. ^ "TAOCP – Future plans".
  17. ^ a b "TAOCP – Brochure" (PDF).
  18. ^ a b Wells, Mark B. (1973). "Review: The Art of Computer Programming, Volume 1. Fundamental Algorithms and Volume 2. Seminumerical Algorithms by Donald E. Knuth" (PDF). Bulletin of the American Mathematical Society. 79 (3): 501–509. doi:10.1090/s0002-9904-1973-13173-8.

Sources