Natural Language Processing libraries
Blazingly Fast ( < 1ms )
Offline, On device

C++ NLP library - You will get a header file and a .a file (binary static library) that you can use with your C++ program.

Python NLP library - You will get a .so file (binary) that you can import in your Python code.

Java NLP library - Coming soon !

Native, Offline libraries

Our commercial Natural language processing library is a high performance tool for text analysis of your user search queries. Use it as a text analyzer for better understanding of the natural language queries. Deliver better search results (i.e convert natural language to sql query).
It has been implemented as C and C++ library, gives you native efficiency and can be integrated with Java backends or other high level languages like Python, PHP, Perl etc.

Real time language processing

Fast analysis of natural language with average time of 1ms for 10 word queries. This is an order of magnitude faster than some of the big names providing such services. (PAAS, IAAS). This is ideal for real time NLP applications involving text processing.

No GPU required!

ThatNeedle's NLP libraries are not dependent on GPUs/hardware accelerators.
Our libraries work on CPUs and this makes for cost efficient operations.
This is unlike many other frameworks where GPU or hardware accelerator is mandatory for getting the system ready.


Most of our nlp libraries are less than 4MB on disk and are ideal for embedding in edge devices and offline, on device nlp applications.
They can be configured to run as on-premise microservices also.

Custom Needs?

We can incorporate your custom language processing needs into a custom library. It will enjoy the same level of high performance ThatNeedle is known for. We only charge a reasonable fee for the customization that's only a fraction your inhouse engineering cost.
You should definitely consider ThatNeedle library as an alternative to Stanford NLP, NLTK or other open-source frameworks especially for realtime offline nlp applications or where speed, size are important. It also performs better than commercial alternatives like Watson, LUIS,,, google nlp etc. and also the open source alternatives like NLTK, spacy etc. Please get in touch with your needs if you are looking for alternatives to above vendors.

Ready to be plugged in

Integration of ThatNeedle library into your application is easy. All it takes is a restful call or a native function call to harness the the power of many years of intense research into NLP.
Apart from the restful API that enable you to have NLP offline (on your premises without internet), we also have python, C++ interfaces and others to the nlp library.

List of some ThatNeedle's proprietary binary NLP libraries

Custom Entity Recognition

Real time detection of custom, niche entities from unstructured text (Custom NER). Supports compound word and multiword recognition.

Voice Commands

Add custom vocabulary to your default speech to text engine. Improve the accuracy of custom voice command recognition.

Auto Suggest / Autocomplete

Show instant auto suggestions as the user types (typeahead). Real time fast.

Name Gender Prediction

Predict whether a person is a male or a female from their name. Good for marketing analytics and insights.

Topic Extraction

Extract technology topics from text. Technology topics and other niche topics can be extracted from raw text.

Homophones correction

Library to automatically correct homophone errors in English transcriptions. (eg to/too/two)

Fast Tokenizer

Fast text tokenization capability for better performance than the default tokenizers.

Niche word prediction

Predict missing words in a sentence based on training data from large text corpuses.

Text diff library

Detect text edits, corrections. This is an advanced text diff tool. Handles 1-N, N-1, N-M edits apart from 1-1 text edits.


Library to convert Natural language to SQL for NLP database queries.

Numbers NLP

Natural language number handling. Handle numbers, ranges, time, price etc

Text Classification

Semantic Text classifiers for various verticals. Brings in niche knowledge depth for better and more relevant categorization.

Tell us about your business for custom optimized real time libraries:

contact us

Benefits of Offline, On-device NLP

1. Zero Network latency

On device NLP means that you donot have to go over the network to analyse the intent and look up information for the query. This easily saves you about a 100milliseconds in round trip time.

2. Data Security and Privacy

Working with your data on device / on premises keeps it safe from illegitimate network access which would otherwise be a risk when data is in motion. This leads to better data security.