Friday, January 29, 2021

Having fun with bzip2

  bzip2, an standard compression utility in Linux, Mac and friends has a very efficient compression ratio for low entropy files. I just tested to compress a big 89Gigabytes "all-white-spaces" file. The final compressed size was about 0.7Megabtyes, with a compression ratio of about x130.000. This is not very impressive from a mathematical point of view, since all the information could be packed in just 6 few bytes (a single byte for the white space plus 5 more bytes for the total size or around 16.000 million compression ratio). But the important thing is that bzip2 packs data in an standard format, and by just looking at the bz2 extensions, many software knows what to do with it.


 Now the funny part. 89 Gigabytes is much more that the storage capacity of most old/unattended servers and low end cloud servers, that curiously, are the main target of worms infecting the Internet. I have a personal web server and it is a bit frustrating checking that around 99% of visits are (were) attacks trying to find holes in some WordPress, php services that could be installed on the machine or just trying good luck searching for passwords files and crypto wallets.


 I was wondering what would happen if I uploaded my compressed 0.7Megabyte bz2 file, renamed it as "password.bz2", and redirected visits like /login, /wp-content, /backup, /administrator, /back/wallet.dat to it. The preliminary results looks to indicate that Internet worms like to download the bz2 file, but then they have some problems to "digest" the uncompressed 89 GB file. One day after installing the file, "non-polite" visits decreased to just 60% (from 95%+ one day before). Two days later, they just represented 40%. With some more tuning for forward-rules I guess results can improve.