0% found this document useful (0 votes)

79 views6 pages

Python Facebook Crawler Guide

The document outlines requirements for developing an open Facebook crawler in Python. It specifies details like the crawler needing a GUI to input login credentials and a page URL, crawling posts and comments from the given page, and saving the extracted data to an Excel file with fields like brand, post ID, date, content, reactions.

Uploaded by

Asim Anayat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

79 views6 pages

Python Facebook Crawler Guide

Uploaded by

Asim Anayat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

OFFICIAL (CLOSED) \ NON-SENSITIVE

Open Facebook Crawler based on Python

Requirements and Deliverables: To implement and deliver an Open Facebook
Crawler in python that will allow automatic collection of open Facebook posts
and comments based on a given Open Facebook page according to the
specifications 1) Open Facebook Crawler UI and 2) Open Facebook Python
Crawler. The Open Facebook Scraper based on Python will be able to run on
any Windows Notebook.

This will include: Python source code and necessary python libraries installers
and user instructions to set-up and run the crawler on a windows Notebook.

Specifications of the Open Facebook Crawler GUI

 There will be a Graphical User Interface (GUI) where users will be able to
enter their Facebook personal information: User_ID and Password
information and the Open Facebook Page URL for configuring the crawl as
shown in Figure 1.

 A SAVE button to save the information. Make an excel file in same where
the application exist. Save username, password and page url in that file.

 A START button to start the crawler.

 A STOP button to provide hard stop for the crawler.

 Status of the Crawler will be updated: “Scraper is Running or Scraper is not

Running”.

 Status of the Crawl will be updated every 5-10 secs according to the total
number of posts and number of comments crawled.

 Incorrect User_ID or/and Password will invoke a warning prompt to

encourage the user to check and re-enter their personal Facebook
information as shown in Figure 2.

1
OFFICIAL (CLOSED) \ NON-SENSITIVE

 Incorrect open Facebook page URL will also invoke a warning prompt to
encourage the user to check and re-enter the URL as shown in Figure 2.

Figure 1: GUI interface for configuring Open Facebook Crawl

Figure 2: Appearance of Prompts dialog boxes when information is not

correctly entered in the GUI interface for configuring Open Facebook Crawl

Specifications of the Open Facebook Python Crawler

2
OFFICIAL (CLOSED) \ NON-SENSITIVE

 The Open Facebook Python crawler must be able to crawl the following
information found on the Facebook Pages specifically: all Posts and
Comments found on the page.

 The following is a list of items to be extracted and placed in a output excel

file as shown In Figure 3 and Table 1 from the all the posts and comments
found in the Open Facebook Page:

Facebook POSTS: Brand, Post ID, Date, Content, No. Likes, No.
Shares, No. Comments
Facebook Comments: User Names and Comments, Post ID

Table 1: Typical items to be crawled

Figure 3: A typical Open Facebook Post and all the items to be crawled

3
OFFICIAL (CLOSED) \ NON-SENSITIVE

How to get post ID

To get post ID click on date. Post ID will visible in url.

Location of output excel file

4
OFFICIAL (CLOSED) \ NON-SENSITIVE

Output file should be saved in the same folder where application is placed.
Name of output excel file should be the name of crawled Facebook page.
How output file should formated
A sample output file is provided. Follow that
Meaning of open facebook pages
Open pages are those which are visible on facebook and published. If page is
unpublished and cannot be accessed, show error (as discussed above).
Note
 For the company Facebook pages let's say they have 300 posts and
maybe 3000 comments. You must ensure your tool is able to scrape all
the posts and comments as much as possible.
 Scrap data in a way that facebook should not block account due to
scrapping activities.

Final Delivery
As discussed, please follow the specifications, example output file and deliver
the following: 1) souce code of python facebook posts and comments scraper
2) user-guide on installation of code to run on window platform, 3) necessary
python installers libraries. so that we can run it on my end. Thanks

Example lists of Typical Open Facebook Pages for Potential Crawling

The following are some examples of open-Facebook pages
No Open-Facebook Pages

1 [Link]
2 [Link]
3 [Link]
4 [Link]
5 [Link]
6 [Link]
7 [Link]

5
OFFICIAL (CLOSED) \ NON-SENSITIVE

Programming 2 Lectures
No ratings yet
Programming 2 Lectures
52 pages
Data Analysis by Web Scraping Using Python
No ratings yet
Data Analysis by Web Scraping Using Python
6 pages
Web Scraping Techniques in Python
No ratings yet
Web Scraping Techniques in Python
21 pages
Web Crawling and Scraping with Python
No ratings yet
Web Crawling and Scraping with Python
34 pages
Web Scraping & API Guide
No ratings yet
Web Scraping & API Guide
24 pages
Retrieving Data From The Web
No ratings yet
Retrieving Data From The Web
9 pages
Web Scraping with Python and urllib
100% (1)
Web Scraping with Python and urllib
57 pages
Fun With Python
100% (5)
Fun With Python
113 pages
3252 Ids 10
No ratings yet
3252 Ids 10
5 pages
Web Scraping Report
No ratings yet
Web Scraping Report
14 pages
Api and Data Structure
No ratings yet
Api and Data Structure
3 pages
Web Crawling and Social Media Mining: Module No. 5
No ratings yet
Web Crawling and Social Media Mining: Module No. 5
77 pages
Build A Facebook Scraper in Python For Facebook Marketplace Data
No ratings yet
Build A Facebook Scraper in Python For Facebook Marketplace Data
1 page
UI Ex 6 (61) - 1
No ratings yet
UI Ex 6 (61) - 1
3 pages
Unit 11 Application Development Using Python
No ratings yet
Unit 11 Application Development Using Python
19 pages
Web Scraping CheatSheet Guide
No ratings yet
Web Scraping CheatSheet Guide
10 pages
Web Scraping
No ratings yet
Web Scraping
28 pages
RajSingh WIexp4
No ratings yet
RajSingh WIexp4
7 pages
Snarfed Mockfacebook GitHub
No ratings yet
Snarfed Mockfacebook GitHub
4 pages
Data Engineering Concepts #2 - Sending Data Using An API - by Bar Dadon - Dev Genius
No ratings yet
Data Engineering Concepts #2 - Sending Data Using An API - by Bar Dadon - Dev Genius
14 pages
Web Scraping with Python for Econometrics
No ratings yet
Web Scraping with Python for Econometrics
14 pages
Software Engineering Project
No ratings yet
Software Engineering Project
55 pages
I) Web Crawling: Yash Pahlani D17B 49
No ratings yet
I) Web Crawling: Yash Pahlani D17B 49
7 pages
Instagram Automation Tool Project Report
No ratings yet
Instagram Automation Tool Project Report
15 pages
Upload PDF
No ratings yet
Upload PDF
11 pages
Internship Report
No ratings yet
Internship Report
19 pages
Web Scraping Techniques in Python
100% (1)
Web Scraping Techniques in Python
20 pages
Template
No ratings yet
Template
21 pages
Web Scraping Guide for Data Scientists
No ratings yet
Web Scraping Guide for Data Scientists
25 pages
Unit I
No ratings yet
Unit I
12 pages
Web Scraping with Python & Selenium
No ratings yet
Web Scraping with Python & Selenium
5 pages
Instagram Automation Tool: Project Description
100% (1)
Instagram Automation Tool: Project Description
10 pages
Python Web Scraping Guide
100% (2)
Python Web Scraping Guide
35 pages
Dap Mod 4-5
No ratings yet
Dap Mod 4-5
19 pages
DAP 4 Module
No ratings yet
DAP 4 Module
45 pages
Strip HTML Tags Using Python
No ratings yet
Strip HTML Tags Using Python
8 pages
Class Assign
No ratings yet
Class Assign
3 pages
Python Unit-4
No ratings yet
Python Unit-4
10 pages
Data - Collection Python
No ratings yet
Data - Collection Python
40 pages
Web Scraping Using Python
No ratings yet
Web Scraping Using Python
18 pages
Web Scraping Using Python: A Step by Step Guide: September 2019
No ratings yet
Web Scraping Using Python: A Step by Step Guide: September 2019
7 pages
Icrawler
No ratings yet
Icrawler
35 pages
Basic Scraping Techniques
No ratings yet
Basic Scraping Techniques
7 pages
Practical Web Scraping For Economists 1744341390
No ratings yet
Practical Web Scraping For Economists 1744341390
33 pages
Web Scraping With Python
No ratings yet
Web Scraping With Python
16 pages
Python Units 4 Notes
No ratings yet
Python Units 4 Notes
11 pages
Programming Assignment Unit 07 - CS 3308 - Information Retrieval - University of The People
No ratings yet
Programming Assignment Unit 07 - CS 3308 - Information Retrieval - University of The People
4 pages
Image Scrapper From Scratch To Proudction
No ratings yet
Image Scrapper From Scratch To Proudction
22 pages
4F IntroToWebScraping
No ratings yet
4F IntroToWebScraping
6 pages
Building a Python Web Scraper
No ratings yet
Building a Python Web Scraper
13 pages
Introduction To Web Crawling Chapter - 13
No ratings yet
Introduction To Web Crawling Chapter - 13
3 pages
Social Networking
No ratings yet
Social Networking
21 pages
Python Tools for Data Scientists
100% (1)
Python Tools for Data Scientists
23 pages
Web Scraping for Job Portals
No ratings yet
Web Scraping for Job Portals
13 pages
20+ Real-World Java and Python Projects To Expand Your Dev Portfolio
100% (1)
20+ Real-World Java and Python Projects To Expand Your Dev Portfolio
25 pages
Scraping Book Python PDF
No ratings yet
Scraping Book Python PDF
50 pages
Scraping Book
No ratings yet
Scraping Book
50 pages
Web Scraping Using Python: A Step by Step Guide: September 2019
0% (1)
Web Scraping Using Python: A Step by Step Guide: September 2019
7 pages
Advanced VLSI Design: Timing Issues
No ratings yet
Advanced VLSI Design: Timing Issues
88 pages
RISC V Modularity
No ratings yet
RISC V Modularity
16 pages
Frame Relay Setup for Cisco Users
No ratings yet
Frame Relay Setup for Cisco Users
5 pages
Biostar B760mx2-E d4 Spec
No ratings yet
Biostar B760mx2-E d4 Spec
5 pages
Quectel BG96 TCPIP at Commands Manual V1.1
No ratings yet
Quectel BG96 TCPIP at Commands Manual V1.1
44 pages
GC 2024 09 20
No ratings yet
GC 2024 09 20
17 pages
New Router Checklist PDF
No ratings yet
New Router Checklist PDF
15 pages
Cloud Computing Technology
No ratings yet
Cloud Computing Technology
274 pages
Canon iR1023/iR1025 Scan Setup Guide
No ratings yet
Canon iR1023/iR1025 Scan Setup Guide
2 pages
57.design & Development of GSM Based Vehicle Theft Control System
No ratings yet
57.design & Development of GSM Based Vehicle Theft Control System
3 pages
SRS - Library Management System
No ratings yet
SRS - Library Management System
16 pages
C Good
No ratings yet
C Good
197 pages
DCN Training
No ratings yet
DCN Training
52 pages
Omnipcx Enterprise Purple - Presales
No ratings yet
Omnipcx Enterprise Purple - Presales
149 pages
Lab 3 - Introduction To Packet Tracer
No ratings yet
Lab 3 - Introduction To Packet Tracer
8 pages
Iot Hand Written Notes
No ratings yet
Iot Hand Written Notes
11 pages
HP BIOS Sure Start - Technical White Paper - 4AA7-4554ENW
No ratings yet
HP BIOS Sure Start - Technical White Paper - 4AA7-4554ENW
19 pages
CompTIA A+ Certification (Chapter 01)
No ratings yet
CompTIA A+ Certification (Chapter 01)
0 pages
CPU Archeticture
No ratings yet
CPU Archeticture
26 pages
8560 FW Update PS 4.10.0
No ratings yet
8560 FW Update PS 4.10.0
4 pages
Harmony Email Presentation
No ratings yet
Harmony Email Presentation
10 pages
Cadence Spb/Orcad 16.5 Release Installation Guide For Windows
No ratings yet
Cadence Spb/Orcad 16.5 Release Installation Guide For Windows
84 pages
Huawei Atlas 200 Technical White Paper 01
No ratings yet
Huawei Atlas 200 Technical White Paper 01
17 pages
C1140bootm12423cJA1binl PDF
No ratings yet
C1140bootm12423cJA1binl PDF
4 pages
Windows Registry Demistified
No ratings yet
Windows Registry Demistified
12 pages
srx345 Sys JB Datasheet
No ratings yet
srx345 Sys JB Datasheet
4 pages
The ImgBurn Functions - ImgBurn Support Forum
No ratings yet
The ImgBurn Functions - ImgBurn Support Forum
88 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
18 pages
Imx Yocto Project Users Guide
No ratings yet
Imx Yocto Project Users Guide
27 pages
Computer Types and Generations
No ratings yet
Computer Types and Generations
17 pages

Python Facebook Crawler Guide

Uploaded by

Python Facebook Crawler Guide

Uploaded by

OFFICIAL (CLOSED) \ NON-SENSITIVE

Open Facebook Crawler based on Python

Specifications of the Open Facebook Crawler GUI

 A START button to start the crawler.

 A STOP button to provide hard stop for the crawler.

 Status of the Crawler will be updated: “Scraper is Running or Scraper is not

 Incorrect User_ID or/and Password will invoke a warning prompt to

Figure 1: GUI interface for configuring Open Facebook Crawl

Figure 2: Appearance of Prompts dialog boxes when information is not

Specifications of the Open Facebook Python Crawler

 The following is a list of items to be extracted and placed in a output excel

Table 1: Typical items to be crawled

How to get post ID

Location of output excel file

Example lists of Typical Open Facebook Pages for Potential Crawling

You might also like