Processed Linkedin data from 700 *.gz files
by andyleung0927 - October 05, 2021 at 11:33 AM
#1
Hello RaidForums Community,
Thanks for RaidForums, the original linkedin file were 700 *.gz files.  I have obtained LinkedIn user data in 9 countries or regions , through data filtering by the data field of "countries", such as usa (99,847,021 lines), india(29,407,650 lines), japan(793,658 lines), philippines(3,489,898 lines), south korea(191,545 lines), vietnam(1,085,354 lines), hong kong(880,127 lines), macau(39,090 lines), taiwan(613,016 lines).
[align=center]2021.12.26 update content:indonesia,singapore,malaysia.
Compromised data: id,full_name,first_name,middle_initial,middle_name,last_name,gender,birth_year,birth_date,linkedin_url,linkedin_username,linkedin_id,facebook_url,facebook_username,facebook_id,twitter_url,twitter_username,github_url,github_username,work_email,mobile_phone,industry,job_title,job_title_role,job_title_sub_role,job_title_levels,job_company_id,job_company_name,job_company_website,job_company_size,job_company_founded,job_company_industry,job_company_linkedin_url,job_company_linkedin_id,job_company_facebook_url,job_company_twitter_url,job_company_location_name,job_company_location_locality,job_company_location_metro,job_company_location_region,job_company_location_geo,job_company_location_street_address,job_company_location_address_line_2,job_company_location_postal_code,job_company_location_country,job_company_location_continent,job_last_updated,job_start_date,job_summary,location_name,location_locality,location_metro,location_region,location_country,location_continent,location_street_address,location_address_line_2,location_postal_code,location_geo,location_last_updated,linkedin_connections,inferred_salary,inferred_years_experience,summary,phone_numbers,emails,interests,skills,location_names,regions,countries,street_addresses,experience,education,profiles,certifications,languages,version_status
contained lines: 44.1GB (563.9 GB Compressed)
Sample:https://pastebin.com/LpyXt2ci
The data was classified by country
hongkong:https://gofile.io/d/YihA8I
Hidden Content
You must register or login to view this content.
Reply
#2
is it in csv or json the processed one ?
This forum account is currently banned. Ban Length: Permanent (N/A).
Ban Reason: stealing samples from other threads.
Reply
#3
Did you obtained Pakistan's data from it. If so then I'll definitely unlock this
Reply
#4
(October 05, 2021 at 12:43 PM)amdryzen7 Wrote: is it in csv or json the processed one ?
It is in json style. I feel it was good for me.
Reply
#5
(October 05, 2021 at 11:33 AM)andyleung0927 Wrote:
Hello RaidForums Community,
Thanks for RaidForums, the original linkedin file were 700 *.gz files.  I have obtained LinkedIn user data in 9 countries or regions , through data filtering by the data field of "countries", such as usa (99,847,021 lines), india(29,407,650 lines), japan(793,658 lines), philippines(3,489,898 lines), south korea(191,545 lines), vietnam(1,085,354 lines), hong kong(880,127 lines), macau(39,090 lines), taiwan(613,016 lines).

Compromised data: id,full_name,first_name,middle_initial,middle_name,last_name,gender,birth_year,birth_date,linkedin_url,linkedin_username,linkedin_id,facebook_url,facebook_username,facebook_id,twitter_url,twitter_username,github_url,github_username,work_email,mobile_phone,industry,job_title,job_title_role,job_title_sub_role,job_title_levels,job_company_id,job_company_name,job_company_website,job_company_size,job_company_founded,job_company_industry,job_company_linkedin_url,job_company_linkedin_id,job_company_facebook_url,job_company_twitter_url,job_company_location_name,job_company_location_locality,job_company_location_metro,job_company_location_region,job_company_location_geo,job_company_location_street_address,job_company_location_address_line_2,job_company_location_postal_code,job_company_location_country,job_company_location_continent,job_last_updated,job_start_date,job_summary,location_name,location_locality,location_metro,location_region,location_country,location_continent,location_street_address,location_address_line_2,location_postal_code,location_geo,location_last_updated,linkedin_connections,inferred_salary,inferred_years_experience,summary,phone_numbers,emails,interests,skills,location_names,regions,countries,street_addresses,experience,education,profiles,certifications,languages,version_status
contained lines: 44.1GB (563.9 GB Compressed)
Sample:https://pastebin.com/LpyXt2ci
[Hidden Content]

This same shit is being posted over and over which is torrent magnet link which is now public data
Reply
#6
thanks lets see what we got here
Reply
#7
are the files sorted by country? If so which files are canadian? any help is appreciated!
This forum account is currently banned. Ban Length: Permanent (N/A).
Ban Reason: Mass Leeching/Spammer
Reply
#8
(October 05, 2021 at 11:33 AM)andyleung0927 Wrote:
Hello RaidForums Community,
Thanks for RaidForums, the original linkedin file were 700 *.gz files.  I have obtained LinkedIn user data in 9 countries or regions , through data filtering by the data field of "countries", such as usa (99,847,021 lines), india(29,407,650 lines), japan(793,658 lines), philippines(3,489,898 lines), south korea(191,545 lines), vietnam(1,085,354 lines), hong kong(880,127 lines), macau(39,090 lines), taiwan(613,016 lines).

Compromised data: id,full_name,first_name,middle_initial,middle_name,last_name,gender,birth_year,birth_date,linkedin_url,linkedin_username,linkedin_id,facebook_url,facebook_username,facebook_id,twitter_url,twitter_username,github_url,github_username,work_email,mobile_phone,industry,job_title,job_title_role,job_title_sub_role,job_title_levels,job_company_id,job_company_name,job_company_website,job_company_size,job_company_founded,job_company_industry,job_company_linkedin_url,job_company_linkedin_id,job_company_facebook_url,job_company_twitter_url,job_company_location_name,job_company_location_locality,job_company_location_metro,job_company_location_region,job_company_location_geo,job_company_location_street_address,job_company_location_address_line_2,job_company_location_postal_code,job_company_location_country,job_company_location_continent,job_last_updated,job_start_date,job_summary,location_name,location_locality,location_metro,location_region,location_country,location_continent,location_street_address,location_address_line_2,location_postal_code,location_geo,location_last_updated,linkedin_connections,inferred_salary,inferred_years_experience,summary,phone_numbers,emails,interests,skills,location_names,regions,countries,street_addresses,experience,education,profiles,certifications,languages,version_status
contained lines: 44.1GB (563.9 GB Compressed)
Sample:https://pastebin.com/LpyXt2ci
[Hidden Content]

(October 05, 2021 at 04:44 PM)Snakecharmer Wrote: are the files sorted by country? If so which files are canadian? any help is appreciated!
As him said, there were 9 files in the thread.
Reply
#9
u dumb idiot, give a list of the files. is it really that hard?
Reply
#10
(October 06, 2021 at 09:20 AM)kateido Wrote: u dumb idiot, give a list of the files. is it really that hard?

usa (99,847,021 lines), india(29,407,650 lines), japan(793,658 lines), philippines(3,489,898 lines), south korea(191,545 lines), vietnam(1,085,354 lines), hong kong(880,127 lines), macau(39,090 lines), taiwan(613,016 lines).
Reply
#11
The data is helpful. I can do some job based the processed data.
Reply
#12
Could somebody remind me what is this base? Could I've seen it before?
Reply

Possibly Related Threads…
Thread Author Replies Views Last Post
Processed US 236m records from 263GB Uncompressed data andyleung0927 76 45,802 51 minutes ago
Last Post: Lendesec
TXT France LinkedIn Full Database 2021 oldpraktik 5 709 8 hours ago
Last Post: oldpraktik
TXT Brasil LinkedIn Full Database 2021 oldpraktik 5 1,174 January 27, 2022 at 11:50 PM
Last Post: t3st3rND5

 Users browsing this thread: 2 Guest(s)