SlideShare a Scribd company logo
Building Complete State-of-the-art
Natural Language Processing Projects
with Free Software
David Talby
CTO, John Snow Labs
2
Agenda
Introduction
Python’s NLU Library
The Annotation Lab
The NLP Server
1.
2.
3.
4.
60% of Technical Leaders stated that their NLP budget was at least 10% higher vs. 2020
33% of Technical Leaders stated that their NLP budget was at least 30% higher vs. 2020
NLP is Growing Into a Foundational Enterprise Technology
NLP Industry Survey 2021, Gradient Flow, July-August 2021
“Half of Technical Leaders are using technologies for
data integration, NLP, business intelligence, and data warehousing.”
Healthcare AI Survey 2021, Gradient Flow, January-March 2021
SO, HOW DO WE BEST HELP THE
COMMUNITY & INDUSTRY SUCCEED?
BY GIVING YOU MORE FREE SOFTWARE
(THAT IS PROVEN, ROBUST, SCALABLE, AND SECURE)
6
BUILDING ON SPARK NLP
>10 million
O’Reilly Media
31% share
of NLP Practitioners
use Spark NLP
Gradient Flow
24x growth
In downloads of the
library during 2020
PyPI Download Stats
Downloads. Most popular
NLP tool in the enterprise
7
Agenda
Introduction
Python’s NLU Library
The Annotation Lab
The NLP Server
1.
2.
3.
4.
INTRODUCING THE NLU LIBRARY
The Simplicity of Python, the Power of Spark NLP
EXAMPLE: TOKENIZATION & SPELL CHECKING
SENTIMENT ANALYSIS
LANGUAGE DETECTION
CALCULATING EMBEDDINGS
NAMED ENTITY RECOGNITION
UNSUPERVISED KEYWORD EXTRACTION
MULTI-CLASS TOXIC CONTENT CLASSIFIER
MULTI-CLASS TOXIC CONTENT CLASSIFIER
SUMMARIZATION
CLOSED-BOOK QUESTION ANSWERING
NLU ALSO HAS ONE-LINER VISUALIZATIONS
DEPENDENCY PARSER VISUALIZATION
NLU EVEN HAS ONE-LINER STREAMLIT APPS
NLU STREAMLIT APPS CHEAT SHEET
Getting Started with the NLU Library
ONE LINE INSTALLATION:
DOCUMENTATION:
https://nlu.johnsnowlabs.com/docs
COMMUNITY SUPPORT:
https://spark-nlp.slack.com @ #nlu
24
Agenda
Introduction
Python’s NLU Library
The Annotation Lab
The NLP Server
1.
2.
3.
4.
JOHN SNOW LABS’ ANNOTATION LAB IS NOW FREE
Free
Unlimited users, documents,
projects, tasks, models, …
Share nothing:
Self hosted on your infrastructure
Robust
2 years in production
Now in its 23rd version
Actively developed & supported
Capable
Active learning, Transfer learning
Model training & deployment
Projects & Teams
Roles & Workflows
Versioning & Audit
Analytics: Productivity,
Agreement, Bias, Teamwork
High Productivity Annotation
Label entities, relations,
assertions, classes,
values, sequences, …
Keyboard Shortcuts
Versioned &
Auditable
Completions
Start from model
predictions
Shared Guidelines
One-click edits
Teamwork
Projects & Roles
Pre-Annotate: Don’t Start from Scratch
Filter Selected Tasks
Choose Model & Embedding
Filter Selected Model Labels
Asynchronous Model Server
(Runs on Kubernetes)
Automated Model Training & Active Learning
Async Training Container Realtime Training Log
Filter Completions
by State or Tag
Edit Hyper-Parameters
Choose
Embeddings
Train & Deploy
Or Fully Automate
Download, Upload, and Publish Models
Analytics
Project Tracking Productivity
Bias in Dataset or Guidelines
Inter-Annotator Agreement
Agreement between annotators,
or between human & model
Drill-down to specific cases
Getting Started with the Annotation Lab
ONE LINE INSTALLATION:
DOCUMENTATION:
https://nlp.johnsnowlabs.com/docs/en/alab/quickstart
COMMUNITY SUPPORT:
https://spark-nlp.slack.com @ #annotation-lab
35
Agenda
Introduction
Python’s NLU Library
The Annotation Lab
The NLP Server
1.
2.
3.
4.
JOHN SNOW LABS’ NLP SERVER IS NOW FREE
Free
Unlimited models, calls, servers
Share nothing:
Self hosted on your infrastructure
Early Stage
Started in 2021
Now in its 3rd version
Actively developed & supported
Capable
No coding required
3,000+ models & growing
UI + REST API
The User Interface
1. Search spells by name or description
2. Type text to analyze or upload a CSV
3. Hit the ‘Preview’ button
Viewing Results
1. Models are downloaded to local storage
2. Models are loaded & cached in memory
3. Model inference is executed
Downloading Results
The REST API
Same functionality as the user interface, delivered as a single Docker container
NLP Server Roadmap
Model training & auto-tuning
Serve your custom trained NLP models
Run on air-gap enterprise environments
Model publishing from the Annotation Lab
Run Healthcare NLP & Spark OCR models
Getting Started with the NLP Server
ONE LINE INSTALLATION:
DOCUMENTATION:
https://nlp.johnsnowlabs.com/docs/en/nlp_server/nlp_server
COMMUNITY SUPPORT:
https://spark-nlp.slack.com @ #nlp-server
Please put it all to good use!
https://nlp.johnsnowlabs.com/docs
44
Thank you!
© 2015-2021 John Snow Labs Inc. All rights reserved. The John Snow Labs logo is a trademarks of John Snow Labs Inc. The included information is for informational purposes only and represents the current
view of John Snow Labs as of the date of this presentation. Since John Snow Labs must respond to changing market conditions, it should not be interpreted to be a commitment on its part, and John Snow
Labs cannot guarantee the accuracy of any information provided after the date of this presentation. John Snow Labs makes no warranties, express
or statutory, as to the information in this presentation.
david@JohnSnowLabs.com
@davidtalby
in/davidtalby

More Related Content

PDF
Apache Spark NLP: Extending Spark ML to Deliver Fast, Scalable & Unified Nat...
PDF
Spark NLP: State of the Art Natural Language Processing at Scale
PDF
Advanced Natural Language Processing with Apache Spark NLP
PDF
Advanced Natural Language Processing with Apache Spark NLP
PDF
Advanced Natural Language Processing with Apache Spark NLP
PPTX
Building NLP solutions using Python
PPT
lect36-tasks.ppt
PPT
NLP Tasks and Applications.ppt useful in
Apache Spark NLP: Extending Spark ML to Deliver Fast, Scalable & Unified Nat...
Spark NLP: State of the Art Natural Language Processing at Scale
Advanced Natural Language Processing with Apache Spark NLP
Advanced Natural Language Processing with Apache Spark NLP
Advanced Natural Language Processing with Apache Spark NLP
Building NLP solutions using Python
lect36-tasks.ppt
NLP Tasks and Applications.ppt useful in

Similar to Building State-of-the-art Natural Language Processing Projects with Free Software - Scale by the Bay 2021 (20)

PPTX
Building NLP solutions for Davidson ML Group
PPTX
LONGSEM2024-25_CSE3015_ETH_AP2024256000125_Reference-Material-I.pptx
PPTX
Natural Language Processing (NLP).pptx
PPTX
Natural language processing and search
PPTX
Presentacion_Procesamiento_Lenguaje.pptx
PDF
Benchmarking nlp toolkits for enterprise application
PPTX
NLP Introduction - Natural Language Processing and Artificial Intelligence Ov...
PPTX
Deep Learning for Natural Language Processing_FDP on 16 June 2025 MITS.pptx
PPTX
Text Mining_big_data_machine_learning.pptx
PDF
Introduction to Natural Language Processing
PPTX
Natural Language Processing 20 March.pptx
PDF
NLP With Machine Learning.pdf which gives
PPTX
NLP.pptx
PPTX
Unlocking the Power of Language: A Beginner’s Guide to Natural Language Proce...
PDF
ICS1020 NLP 2020
PDF
An In-Depth Exploration of Natural Language Processing: Evolution, Applicatio...
PDF
Natural Language Processing, Techniques, Current Trends and Applications in I...
PPTX
An Overview of Natural Language Processing.pptx
PDF
Natural Language Processing .pdf
PPTX
NLP edmund retrievel system presentation.pptx
Building NLP solutions for Davidson ML Group
LONGSEM2024-25_CSE3015_ETH_AP2024256000125_Reference-Material-I.pptx
Natural Language Processing (NLP).pptx
Natural language processing and search
Presentacion_Procesamiento_Lenguaje.pptx
Benchmarking nlp toolkits for enterprise application
NLP Introduction - Natural Language Processing and Artificial Intelligence Ov...
Deep Learning for Natural Language Processing_FDP on 16 June 2025 MITS.pptx
Text Mining_big_data_machine_learning.pptx
Introduction to Natural Language Processing
Natural Language Processing 20 March.pptx
NLP With Machine Learning.pdf which gives
NLP.pptx
Unlocking the Power of Language: A Beginner’s Guide to Natural Language Proce...
ICS1020 NLP 2020
An In-Depth Exploration of Natural Language Processing: Evolution, Applicatio...
Natural Language Processing, Techniques, Current Trends and Applications in I...
An Overview of Natural Language Processing.pptx
Natural Language Processing .pdf
NLP edmund retrievel system presentation.pptx
Ad

More from David Talby (13)

PPTX
Turning Medical Expert Knowledge into Responsible Language Models - K1st World
PPTX
How to Apply NLP to Analyze Clinical Trials
PPTX
New Frontiers in Applied NLP​ - PAW Healthcare 2022
PPTX
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
PPTX
Applying NLP to Personalized Healthcare - 2021
PPTX
Introducing the Open-Source Library for Testing NLP Models - Healthcare NLP S...
PPTX
Natural Language Understanding in Healthcare
PPTX
Architecting an Open Source AI Platform 2018 edition
PPTX
Deep learning for natural language understanding
PPTX
Build your open source data science platform
PPTX
Natural Language Understanding with Machine Learned Annotators and Deep Learn...
PPTX
Architecting a Predictive, Petabyte-Scale, Self-Learning Fraud Detection System
PPTX
Semantic Natural Language Understanding with Spark, UIMA & Machine Learned On...
Turning Medical Expert Knowledge into Responsible Language Models - K1st World
How to Apply NLP to Analyze Clinical Trials
New Frontiers in Applied NLP​ - PAW Healthcare 2022
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Applying NLP to Personalized Healthcare - 2021
Introducing the Open-Source Library for Testing NLP Models - Healthcare NLP S...
Natural Language Understanding in Healthcare
Architecting an Open Source AI Platform 2018 edition
Deep learning for natural language understanding
Build your open source data science platform
Natural Language Understanding with Machine Learned Annotators and Deep Learn...
Architecting a Predictive, Petabyte-Scale, Self-Learning Fraud Detection System
Semantic Natural Language Understanding with Spark, UIMA & Machine Learned On...
Ad

Recently uploaded (20)

PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PDF
How to run a consulting project- client discovery
PPTX
CYBER SECURITY the Next Warefare Tactics
PDF
Global Data and Analytics Market Outlook Report
PPTX
modul_python (1).pptx for professional and student
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Managing Community Partner Relationships
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PDF
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
PDF
Business Analytics and business intelligence.pdf
PDF
annual-report-2024-2025 original latest.
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PDF
Introduction to Data Science and Data Analysis
PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
How to run a consulting project- client discovery
CYBER SECURITY the Next Warefare Tactics
Global Data and Analytics Market Outlook Report
modul_python (1).pptx for professional and student
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Managing Community Partner Relationships
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
OneRead_20250728_1808.pdfhdhddhshahwhwwjjaaja
Business Analytics and business intelligence.pdf
annual-report-2024-2025 original latest.
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
retention in jsjsksksksnbsndjddjdnFPD.pptx
Introduction to Data Science and Data Analysis
ISS -ESG Data flows What is ESG and HowHow
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
Optimise Shopper Experiences with a Strong Data Estate.pdf

Building State-of-the-art Natural Language Processing Projects with Free Software - Scale by the Bay 2021