Parler Parser

The parler parser is used to parse parler HTML posts and user profiles. Parler post dumps can be found from here.

Parsed Entities:

Refer to here

Example Post Parser:

import glob

from parler.parser.postParser import PostParser
from parler.dataType.post import Post

files = glob.glob('posts/*')

data = []
for file in files:
  post = PostParser(file).parse()
  if (post is not None):
    data.append(post.convert())

print(data)

Example Profile Page Parser:

from parler.parser.profilePageParser import ProfilePageParser

file = r".\profile\00KimPossible00\posts\index.html"
timestamp = 20201124075219

profilePage = ProfilePageParser(file, timestamp)

user, posts = profilePage.parse()

print(user.convert())
print()

for post in posts:
    print(post.convert())
    print()

Sample Output

You should get the same results as shown in sample_output.

Parsing Logic

Determine what type of post we are dealing with:
- New Post
- Echoed Post
- Echoed Post with Reply
- Echoed Post with Root Echo and No Reply
- Echoed Post with Root Echo and Reply
If new post, parse the only post as main post else parse the reply post as main post.
If not new post, parse the echoed post.
If echoed post or echoed post with root echo and no reply:
- Use the "Echoed by ... " line to fill out main post info with the user and created_at
- Grab username from the meta information stored in the header.
- No profile badge can be found in the post this way.
- The comment_count, echo_count, upvote_count belongs to the echoed post.
Else:
- The comment_count, echo_count, upvote_count belongs to the main post.
If Echoed Post with Root Echo and No Reply or Echoed Post with Root Echo and Reply:
- Parse the first post for the root echo.

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
parler		parler
posts		posts
profile		profile
.gitignore		.gitignore
README.md		README.md
databaseExporter.py		databaseExporter.py
parsedEntities.md		parsedEntities.md
sample_output.json		sample_output.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Parler Parser

Parsed Entities:

Example Post Parser:

Example Profile Page Parser:

Sample Output

Parsing Logic

About

Uh oh!

Releases

Packages

Languages

RSTZZZ/parler_parser

Folders and files

Latest commit

History

Repository files navigation

Parler Parser

Parsed Entities:

Example Post Parser:

Example Profile Page Parser:

Sample Output

Parsing Logic

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages