CSV Wattpad 270M Cleaned / Parsed
by stealth167 - April 23, 2021 at 11:01 PM
#1
[attachment=1691]This is parsed CSV from the original 117GB SQL dump which you can get from here
https://raidforums.com/Thread-Wattpad-Da...d-Download

Original file was hard to work with because it has 100.000 lines with 2500 records per line
i used php custom script to parse it to 1 record per line csv,
i also excluded columns that i not need
End up with 3.8GB file1 (13.2 uncompressed) with 160M records that have RealName included
and 2.4GB file2 (9.7GB uncompressed) with 100M records (only username, no realname)
parsed fields are
$ecsv=array(
'NAME'=>"",
'EMAIL'=>"",
'LOGINDATE'=>"",
'ACTIVE'=>"",
'VERIFIED'=>"",
'REALNAME'=>"",
'LOCATION'=>"",
'COUNTRY'=>"",
'WEBSITE'=>"",
'PHONE'=>"",
'SUBSCRIBE'=>"",
'DOB'=>"",
'GENDER'=>"",
'FBID'=>"",
'TWITTERID'=>"",
'AVATAR_TIMESTAMP'=>"",
'BACKGROUND_TIMESTAMP'=>"",
'EMAILBOUNCEDATE'=>"",
);

password hash is not included in csv, i don't need it, i'm sure you can figure out how to pull it from original csv
all timestamps are year only, all NULLs are blank

Considering the total uncompressed size is 23GB vs 117GB original, i think some of you might find it handy
sample csv attached

mega.nz links Hidden Content
You must register or login to view this content.
Reply
#2
(April 23, 2021 at 11:01 PM)stealth167 Wrote: This is parsed CSV from the original 117GB SQL dump which you can get from here
https://raidforums.com/Thread-Wattpad-Da...d-Download

Original file was hard to work with because it has 100.000 lines with 2500 records per line
i used php custom script to parse it to 1 record per line csv,
i also excluded columns that i not need
End up with 3.8GB file1 (13.2 uncompressed) with 160M records that have RealName included
and 2.4GB file2 (9.7GB uncompressed) with 100M records (only username, no realname)
parsed fields are
$ecsv=array(
'NAME'=>"",
'EMAIL'=>"",
'LOGINDATE'=>"",
'ACTIVE'=>"",
'VERIFIED'=>"",
'REALNAME'=>"",
'LOCATION'=>"",
'COUNTRY'=>"",
'WEBSITE'=>"",
'PHONE'=>"",
'SUBSCRIBE'=>"",
'DOB'=>"",
'GENDER'=>"",
'FBID'=>"",
'TWITTERID'=>"",
'AVATAR_TIMESTAMP'=>"",
'BACKGROUND_TIMESTAMP'=>"",
'EMAILBOUNCEDATE'=>"",
);

password hash is not included in csv, i don't need it, i'm sure you can figure out how to pull it from original csv
all timestamps are year only, all NULLs are blank

Considering the total uncompressed size is 23GB vs 117GB original, i think some of you might find it handy
sample csv attached

[Hidden Content]

Do u have Russian , Belarusian and Ukrainian lines there?
Reply
#3
(April 23, 2021 at 11:51 PM)ZedStore Wrote: Do u have Russian , Belarusian and Ukrainian lines there?
strings are parsed as is in the original sql, except for replacing all ' and "  with spaces ( used for enclosure in the csv output) 
so if you meant cyrilyc letters, all should be there as it was written in the sql
Reply
#4
Thanks a lot for adding to mega (fastest), and cleaning up the DB. appreciated Heart
This forum account is currently banned. Ban Length: Permanent (N/A).
Ban Reason: Posted beastiality in SB.
Reply
#5
Thanks for this. can you tell how many us records are there.
Reply
#6
Does it contain a password?
Reply
#7
Awesome bro. Also, can't you upload in another service?
Reply
#8
(April 24, 2021 at 01:06 PM)egaquzu144 Wrote: Does it contain a password?
Password hashes are not included, Read the post properly OP mentioned it clearly.
Reply
#9
What is the unzip password?
Reply
#10
(April 26, 2021 at 10:59 AM)0w0 Wrote: What is the unzip password?
No password
Maybe your file got corrupted during download
Reply
#11
(April 27, 2021 at 12:42 AM)stealth167 Wrote:
(April 26, 2021 at 10:59 AM)0w0 Wrote: What is the unzip password?
No password
Maybe your file got corrupted during download
Sorry, I downloaded the wrong link. Your link works fine and smooth
Reply

Possibly Related Threads…
Thread Author Replies Views Last Post
CSV Raychat users CSV cleaned 3.5kk Tonderba 5 582 2 hours ago
Last Post: Tonderba
Wedmegood.com members+ CLEANED louisthomas82 2 405 9 hours ago
Last Post: louisthomas82
CSV Apollo.io v5 - 61M Records - Parsed LeakedInsta 49 11,986 May 04, 2021 at 03:38 PM
Last Post: torospeed

 Users browsing this thread: 2 Guest(s)