Google’s robots.txt Parser is Now Open Source

Google’s robots.txt Parser is Now Open Source

While this is an important step, it means extra work for developers who parse robots.txt files.We’re here to help: we open sourced the C++ library that our production systems use for parsing and matching rules in robots.txt files. Since then, the library evolved; we learned a lot about how webmasters write robots.txt files and corner cases that we had to cover for, and added what we learned over the years also to the internet draft when it made sense.We also included a testing tool in the open source package to help you test a few rules. Once built, the usage is very straightforward:If you want to check out the library, head over to our GitHub repository for the robots.txt parser .

Source: opensource.googleblog.com