Python SDK for NLP tasks, built on Spark NLP
Top 40.0% on sourcepulse
NLU (Natural Language Understanding) is a Python library designed to simplify the application of state-of-the-art NLP models. It acts as a facade for Spark NLP, offering over 1000 pre-trained models across 200+ languages, accessible with a single line of code. This library is ideal for data scientists and developers looking for a fast, accurate, and scalable solution for text analysis tasks.
How It Works
NLU leverages the power of Spark NLP, a distributed NLP library built on Apache Spark ML. It provides a unified API that abstracts away the complexities of Spark NLP, allowing users to load and apply models directly on various data structures like Pandas DataFrames, Spark DataFrames, and NumPy arrays. This approach enables efficient processing of large datasets and seamless integration into existing data science workflows.
Quick Start & Requirements
pip install nlu pyspark==3.0.2
Highlighted Details
Maintenance & Community
Licensing & Compatibility
Limitations & Caveats
pyspark==3.0.2
as a requirement, which might limit compatibility with newer Spark versions.6 months ago
Inactive