John Snow Labs Spark NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment.
Before you start, install anaconda-client and conda-build:
conda install anaconda-client conda-buildMake sure you are logged in as JohnSnowLabs
conda loginPurge the previous builds:
conda build purgeTurn off auto-upload:
conda config --set anaconda_upload noBuild spark-nlp from the latest PyPI tar:
conda build . --python=3.6 && conda build . --python=3.7Example of uploading Conda package to Anaconda Cloud:
anaconda upload /anaconda3/conda-bld/noarch/spark-nlp-version-py36_0.tar.bz2Install spark-nlp by using conda:
conda install -c johnsnowlabs spark-nlphttps://github.com/JohnSnowLabs/spark-nlp
Take a look at our official spark-nlp page: http://nlp.johnsnowlabs.com/ for user documentation and examples
Apache Licence 2.0