Features

Offers multiple pipelines for natural language extraction at granular level.
Provides comprehensive contextual information for source code comments, more particularly extracts the preceding and succeding source code for each comment apart from other location based attributes.
Treats multiple consecutive comments as single comment.
Serializes mined data in highly interchangeable data format JSON and offers explicit loading mechanism of mined data enabling easier transfer and reuse.
Available as commandline executable and pip installable package (for use in other applications).