0% found this document useful (0 votes)

6 views2 pages

Libraries and Code Breakdown

Uploaded by

dan.nguyen200206

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views2 pages

Libraries and Code Breakdown

Uploaded by

dan.nguyen200206

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Libraries Used in the Notebook

1. spacy

SpaCy is a library for Natural Language Processing (NLP). It allows you to analyze and
understand text with Python.

In this notebook, spaCy is used to:

• Split a sentence into individual words or punctuation marks (called tokens).

• Figure out what grammatical role each word plays (called part-of-speech or POS
tagging), such as noun, verb, or adjective.

The line used to load the English model:

nlp = [Link]('en_core_web_sm')

loads a small English model that comes with vocabulary, grammar rules, and statistical
patterns.

Note: If you haven’t downloaded this model before, you'll need to run this in your terminal:

python -m spacy download en_core_web_sm

2. pandas

A popular Python library for working with structured data.

In this notebook, it’s used to organize and display the POS tagging results in a readable format
called a DataFrame, which looks like a table with rows and columns.

Example of creating an empty DataFrame:

pos_df = [Link](columns=['token', 'pos_tag'])

What the Code Does

Step 1: Load the NLP Model
nlp = [Link]('en_core_web_sm')

This line prepares spaCy to process English text.

The model understands grammar and can label each word with its role in the sentence.
Step 2: Add a Text Sample

emma_ja = "emma woodhouse handsome clever and rich..."

This is a paragraph from Jane Austen’s Emma.

The text is already cleaned: it’s all lowercase and doesn’t contain punctuation.

This makes it simpler to analyze.

Step 3: Process the Text

spacy_doc = nlp(emma_ja)

The text is passed through the NLP model.

The result is a Doc object, which contains all the individual words and information about them
(like POS tags).

Step 4: Set Up a Data Table

pos_df = [Link](columns=['token', 'pos_tag'])

This creates a table structure where each word and its part-of-speech tag will be added.

Tutorial Text Classification in Phyton Using Spacy
No ratings yet
Tutorial Text Classification in Phyton Using Spacy
22 pages
TP1 3
No ratings yet
TP1 3
5 pages
Introduction to NLP with spaCy
No ratings yet
Introduction to NLP with spaCy
28 pages
Spacy Library
No ratings yet
Spacy Library
3 pages
Python Text Processing and NLP Basics
No ratings yet
Python Text Processing and NLP Basics
32 pages
NLP Journl
No ratings yet
NLP Journl
15 pages
NLP Full Overview
No ratings yet
NLP Full Overview
37 pages
Text Pre Processing (NLTK SpaCy) (1) .HTML
No ratings yet
Text Pre Processing (NLTK SpaCy) (1) .HTML
25 pages
Spacy Yy Yyy Yyy Yyy
No ratings yet
Spacy Yy Yyy Yyy Yyy
19 pages
Mastering NLP with spaCy Basics
No ratings yet
Mastering NLP with spaCy Basics
1 page
Spacy Cheat Sheet Python For Data Science: Spans Visualizing
No ratings yet
Spacy Cheat Sheet Python For Data Science: Spans Visualizing
2 pages
NLP Word Analysis with spaCy
No ratings yet
NLP Word Analysis with spaCy
3 pages
Rajeev Mishra 20 SCSE1180087
No ratings yet
Rajeev Mishra 20 SCSE1180087
29 pages
1a NLTK
No ratings yet
1a NLTK
10 pages
NLP Practicals
No ratings yet
NLP Practicals
6 pages
NLP - Cheatsheet
No ratings yet
NLP - Cheatsheet
10 pages
Handling Corpus Raw Text
No ratings yet
Handling Corpus Raw Text
15 pages
NLP with Textacy: A Developer's Guide
No ratings yet
NLP with Textacy: A Developer's Guide
184 pages
spaCy 101: NLP Basics & Features Guide
No ratings yet
spaCy 101: NLP Basics & Features Guide
10 pages
Session2 3
No ratings yet
Session2 3
18 pages
Introduction To Python - Ipynb - Colaboratory
No ratings yet
Introduction To Python - Ipynb - Colaboratory
4 pages
Tokenization
No ratings yet
Tokenization
4 pages
spaCy POS Tagging & Dependency Parsing
No ratings yet
spaCy POS Tagging & Dependency Parsing
9 pages
CSDM2-Text Preprocessing For NL Data - 011050
No ratings yet
CSDM2-Text Preprocessing For NL Data - 011050
6 pages
NLP Techniques: POS and NER Lab Guide
No ratings yet
NLP Techniques: POS and NER Lab Guide
36 pages
Intro To NLP: Natural Language Toolkit
No ratings yet
Intro To NLP: Natural Language Toolkit
11 pages
NLP Record300
No ratings yet
NLP Record300
24 pages
Spacy
No ratings yet
Spacy
5 pages
NLP Lab - Manual
No ratings yet
NLP Lab - Manual
33 pages
spaCy Data Structures Explained
100% (1)
spaCy Data Structures Explained
28 pages
Unit V Natural Language Processing
No ratings yet
Unit V Natural Language Processing
20 pages
Text Mining and Dataset Creation in Python
No ratings yet
Text Mining and Dataset Creation in Python
13 pages
Dokumen - Pub - Natural Language Processing Practical Using Transformers With Python
No ratings yet
Dokumen - Pub - Natural Language Processing Practical Using Transformers With Python
275 pages
NLP Text Classification Week4
No ratings yet
NLP Text Classification Week4
26 pages
NLP Lab
No ratings yet
NLP Lab
63 pages
spaCy: Vocab and Semantic Similarity
No ratings yet
spaCy: Vocab and Semantic Similarity
28 pages
Unit 4
No ratings yet
Unit 4
8 pages
NLP
No ratings yet
NLP
29 pages
Machine Learning for NLP: Tokenization & Features
No ratings yet
Machine Learning for NLP: Tokenization & Features
37 pages
NLP For ML - Spam Classifier
No ratings yet
NLP For ML - Spam Classifier
14 pages
Text Mining Notes
No ratings yet
Text Mining Notes
28 pages
NLP Practical Journal 2023-24
No ratings yet
NLP Practical Journal 2023-24
27 pages
Lecture 8 - Text Analytics NLP
No ratings yet
Lecture 8 - Text Analytics NLP
24 pages
With The Telescope Saw With The Telescope The Man
No ratings yet
With The Telescope Saw With The Telescope The Man
11 pages
Unit2 Full
No ratings yet
Unit2 Full
28 pages
Jal Patel NLP
No ratings yet
Jal Patel NLP
32 pages
NLP Exp2
No ratings yet
NLP Exp2
6 pages
NLP 1 New
No ratings yet
NLP 1 New
306 pages
Sample
No ratings yet
Sample
8 pages
Natural Language Processing
No ratings yet
Natural Language Processing
25 pages
Trained Models & Pipelines SpaCy Models Documentation
No ratings yet
Trained Models & Pipelines SpaCy Models Documentation
6 pages
SocrAI Day 3
No ratings yet
SocrAI Day 3
43 pages
CS-875-Lecture 4
No ratings yet
CS-875-Lecture 4
47 pages
NLP Preprocessing Steps
No ratings yet
NLP Preprocessing Steps
20 pages
NLP Smitpatel
No ratings yet
NLP Smitpatel
32 pages
ch5&6 Lecture AI
No ratings yet
ch5&6 Lecture AI
69 pages
Introduction To Spacy: Ines Montani
No ratings yet
Introduction To Spacy: Ines Montani
26 pages
Text Vectorization Techniques in NLP
No ratings yet
Text Vectorization Techniques in NLP
5 pages

Libraries and Code Breakdown

Uploaded by

Libraries and Code Breakdown

Uploaded by

Libraries Used in the Notebook

In this notebook, spaCy is used to:

• Split a sentence into individual words or punctuation marks (called tokens).

The line used to load the English model:

python -m spacy download en_core_web_sm

A popular Python library for working with structured data.

Example of creating an empty DataFrame:

pos_df = [Link](columns=['token', 'pos_tag'])

What the Code Does

This line prepares spaCy to process English text.

emma_ja = "emma woodhouse handsome clever and rich..."

This is a paragraph from Jane Austen’s Emma.

This makes it simpler to analyze.

Step 3: Process the Text

The text is passed through the NLP model.

Step 4: Set Up a Data Table

pos_df = [Link](columns=['token', 'pos_tag'])

You might also like