Galactic Algorithm

A galactic algorithm is one that runs faster than any other algorithm for problems that are sufficiently large, but where "sufficiently large" is so big that the algorithm is never used in practice. Galactic algorithms were so named by Richard Lipton and Ken Regan,[1] as they will never be used on any of the merely terrestrial data sets we find here on Earth.

An example of a galactic algorithm is the fastest known way to multiply two numbers,[2] which is based on a 1729-dimensional Fourier transform.[3] This means it will not reach its stated efficiency until the numbers have at least 21729 digits, (vastly) more digits than there are atoms in the universe. So this algorithm is never used in practice.

Despite the fact that they will never be used, galactic algorithms may still contribute to computer science:

  • An algorithm, even if impractical, may show new techniques that may eventually be used to create practical algorithms.
  • Computer sizes may catch up to the crossover point, so that a previously impractical algorithm becomes practical.
  • An impractical algorithm can still demonstrate that conjectured bounds can be achieved, or alternatively show that conjectured bounds are wrong. As Lipton says "This alone could be important and often is a great reason for finding such algorithms. For an example, if there were tomorrow a discovery that showed there is a factoring algorithm with a huge but provably polynomial time bound that would change our beliefs about factoring. The algorithm might never be used, but would certainly shape the future research into factoring." Similarly, a O(N2100) algorithm for the Boolean satisfiability problem, although unusable in practice, would settle the P versus NP problem, the most important open problem in computer science.[4] One immediate practical effect would be to earn the discoverer a million dollar prize from the Clay Mathematics Institute.

Examples

There are several well-known algorithms with world-beating asymptotic behavior, but only on impractically large problems:

  • Matrix multiplication: The first improvement over brute-force multiplication, O(N3) is the Strassen algorithm, a recursive algorithm that is O(N2.807). This algorithm is not galactic and used in practice. Further extensions of this, using sophisticated group theory, are the Coppersmith–Winograd algorithm and its slightly better successors, delivering O(N2.373). These are galactic – "We nevertheless stress that such improvements are only of theoretical interest, since the huge constants involved in the complexity of fast matrix multiplication usually make these algorithms impractical."[5]
  • Claude Shannon showed a simple but impractical code that could reach the capacity of a channel. It requires assigning a random code word to every possible N bit message, then decoding by finding the closest code word. If N is chosen large enough, this beats any existing code and can get arbitrarily close to the capacity of the channel. Unfortunately, any N big enough to beat existing codes is also completely impractical.[6] These codes, though never used, inspired decades of research into more practical algorithms that today can come quite close to channel capacity.
  • The problem of deciding whether a graph G contains H as a minor is NP-complete in general, but where H is fixed, it can be solved in polynomial time. The running time for testing whether H is a minor of G in this case is O(n2),[7] where n is the number of vertices in G and the big O notation hides a constant that depends superexponentially on H. The constant is greater than c>2↑↑(2↑↑(2↑↑(h/2))){\displaystyle c>2\uparrow \uparrow (2\uparrow \uparrow (2\uparrow \uparrow (h/2)))}{\displaystyle c>2\uparrow \uparrow (2\uparrow \uparrow (2\uparrow \uparrow (h/2)))} (using Knuth's up-arrow notation), and where h is the number of vertices in H.[8]
  • For cryptographers, a cryptographic "break" is anything faster than a brute-force attack – i.e., performing one trial decryption for each possible key in sequence. In many cases, even though they are the best known methods, they are still infeasible with current technology. One example is the best attack known against 128 bit AES, which takes only 2126 operations.[9] Despite being impractical, theoretical breaks can sometimes provide insight into vulnerability patterns.

References

  1. ^ Lipton, Richard J., and Kenneth W. Regan (2013). "David Johnson: Galactic Algorithms". People, Problems, and Proofs. Heidelberg: Springer Berlin. pp. 109–112.CS1 maint: multiple names: authors list (link)
  2. ^ David Harvey and Joris Van Der Hoeven (March 2019). "Integer multiplication in time O(n log n)".
  3. ^ David Harvey (Apr 10, 2019). "We've found a quicker way to multiply really big numbers". Phys.org.
  4. ^ Fortnow, L. (2009). "The status of the P versus NP problem" (PDF). Communications of the ACM. 52 (9): 78–86.
  5. ^ Le Gall, F. (2012), "Faster algorithms for rectangular matrix multiplication", Proceedings of the 53rd Annual IEEE Symposium on Foundations of Computer Science (FOCS 2012), pp. 514–523, arXiv:1204.1111, doi:10.1109/FOCS.2012.80.
  6. ^ Larry Hardesty (Jan 19, 2010). "Explained: The Shannon limit". MIT News Office.
  7. ^ Kawarabayashi, K. I., Kobayashi, Y., & Reed, B. (2012). "The disjoint paths problem in quadratic time". Journal of Combinatorial Theory, Series B. 102 (2): 424–435.CS1 maint: multiple names: authors list (link)
  8. ^ Johnson, David S. (1987). "The NP-completeness column: An ongoing guide (edition 19)". Journal of Algorithms. 8 (2): 285–303. CiteSeerX 10.1.1.114.3864. doi:10.1016/0196-6774(87)90043-5.
  9. ^ Biaoshuai Tao & Hongjun Wu (2015). Information Security and Privacy. Lecture Notes in Computer Science. 9144. pp. 39–56. doi:10.1007/978-3-319-19962-7_3. ISBN 978-3-319-19961-0.


from Hacker News https://ift.tt/2FAxwZh