Models and Pipelines for the Spark NLP library
- 
            Updated
            Aug 12, 2021 
- Jupyter Notebook
Models and Pipelines for the Spark NLP library
Tutorial for Topic Modelling using PySpark and Spark NLP
Miscellaneous codes and writings for MLOps
Deep Learning notes and practical implementation with Tensorflow and keras. Text Analytics and practical application implementation with NLTK, Spacy and Gensim.
A real-time data cleaning pipeline for medical and healthcare data using Apache Spark, SparkNLP, Spark Streaming, and Kafka.
Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"
Python scripts to process, and analyze log files using PySpark.
Compilation of NLP notebooks from various sources that address several technical challenges.
NLP functions with John Snow's Spark NLP in the Java language
contains notebooks on topic modeling, spark and pandas implementation
SparkNLP and Healthcare SparkNLP based analysis of scientific literature on equine colic.
Final Project for Harvard's Scala for Big Data Systems course
Testing and benchmarking some of the existing NLP libraries in Apache Spark
A scalable and real-time data pipeline for processing, analyzing, and visualizing Twitter data.
A repository of notebooks and data sources for data engineers, data analysts and data scientists, chiefly proof of concept level
Final project of "Big Data Analytics and Business Intelligence" course.
Add a description, image, and links to the spark-nlp topic page so that developers can more easily learn about it.
To associate your repository with the spark-nlp topic, visit your repo's landing page and select "manage topics."