| www.Michael-Maniscalco.com : The MSufSort Algorithm | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Home Blog Compression Sorting Papers References Links History Sudoku About me Contact | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
MSufSort 3.1.1 beta is available! Results will be posted soon.
MSufSort.dll is also available. It exports the BWT, reverse BWT and suffix array functionality of 3.1 beta.
MSufSort 3.1.1 is a high speed and light weight suffix sorting algorithm. The comparisons below show the speed of this suffix sorting algorithm vs. several other leading suffix sorting algorithms including Archon 4r0 (Dima Malyshev), Deep-Shallow (Manzini & Ferragina), QSufSort (Larsson & Sadakane) and DivSufSort 1.2.2 (Mori). The first table is the timing results for each of the algorithms when applied to the files of Manzini's large file corpus. The machine used during testing was a Toshiba A75 S2112 laptop with 1.5GB and 3.08GHz clock speed. Processor speed was set to maximum with silent mode selected.
This second table shows the results for each suffix sorting algorithm on the new universal robustness corpus called "The Gauntlet." This test set is comprised of files which are typically difficult to sort and is intended to test the algorithms for both robustness and seek out catastrophic performance cases. This test set may be added to by anyone provided the file to be added demonstrates that any one of the suffix sorting algorithms listed demonstrates clear catastrophic performance. The file needs to be at least large enough to measure the algorithms overall performance. Smaller files can show times which are more reflective of a program's initialization time etc rather than actual algorithm robustness. The machine used during testing was a Toshiba A75 S2112 laptop with 1.5GB and 3.08GHz clock speed. Processor speed was set to maximum with silent mode selected.
Gauntlet corpus contributors: abba (10,500,600 bytes) - Michael Mansicalco book1x20 (15,375,420 bytes) - Michael Maniscalco fib_s14930352 (14,930,352 bytes) - Simon Puglisi fss9 (2,851,443 bytes) - Simon Puglisi fss10 (12,078,908 bytes) - Simon Puglisi houston (3,840,000 bytes) - Graham Houston paper5x80 (981,924 bytes) - Michael Maniscalco abac (200,000 bytes) - Yuta Mori test1 (2,097,152 bytes) - Yuta Mori test2 (2,097,152 bytes) - Yuta Mori test3 (2,097,152 bytes) - Yuta Mori |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||