Notes and Sources

This resource aggregates data on:

The data in Silk is organized in "collections", which group pages with similar types of data. In this specific case, there are three collections: "Companies", "Companies - Crowdsourced data" and "Women Graduates, Bachelors'". Details about the sources and structure of each are detailed in the following sections.

Data collection "Companies"

Contains data from each company's lates diversity report. We aggregated the information from each and normalize the format.



Page Title
Source
Data Refers to:
Airbnbhttp://www.theverge.com/2015/1/20/7856467/airbnb-diversity-report-gender-race-interactive-chart2014
Alcatelhttps://www.alcatel-lucent.com/sustainability/social-indicators2014
Amazonhttp://www.amazon.com/b?node=100800920112014
Applehttp://www.apple.com/diversity/2015
Ciscohttp://www.cisco.com/web/about/ac49/ac55/workforce_diversity.html2014
eBayhttps://www.ebayinc.com/stories/news/building-a-more-diverse-ebay-and-paypal/2015
Facebookhttp://newsroom.fb.com/news/2014/06/building-a-more-diverse-facebook/2015
Googlehttp://www.google.com/diversity/index.html2015
HPhttp://h20195.www2.hp.com/V2/GetDocument.aspx?docname=c046739362014
Indiegogohttps://go.indiegogo.com/blog/2014/08/diversity-matters-always.html2014
Intelhttp://www.intel.com/content/www/us/en/diversity/diversity-in-technology-intel-2015-midyear-progress-report.html2015
LinkedInhttp://www.slideshare.net/linkedin/linked-in-2015-workforce-diversity?ref=http://blog.linkedin.com/2015/06/08/linkedins-2015-workforce-diversity/2015
Microsofthttp://www.microsoft.com/en-us/diversity/inside-microsoft/default.aspx#fbid=TboWYRJiHG-?epgDivFocusArea2015
NVIDIAhttp://www.nvidia.com/object/fy15-workforce-performance.html2015
Pandorahttp://www.pandora.com/careers/#diversity2014
Pinteresthttps://blog.pinterest.com/en/our-plan-more-diverse-pinterest2015
Qualcommhttps://www.qualcomm.com/documents/inclusion-and-diversity-creating-company-reflects-world2014
Salesforcehttp://www.salesforce.com/company/careers/diversity-numbers.jsp2014
SanDiskhttp://www.sandisk.com/about-sandisk/corporate/diversity/?
Symantechttps://www.symantec.com/corporate_responsibility/topic.jsp?id=diversity_inclusion2015
Telstrahttp://www.telstra.com.au/aboutus/investors/governance-at-telstra/diversity-inclusion/2014
Twitterhttps://blog.twitter.com/2014/building-a-twitter-we-can-be-proud-of2014
Verizonhttp://www.verizon.com/about/sites/default/files/2014_Verizon_Corporate_Social_Responsibility_Report.pdf2014
Yahoo!http://yahoo.tumblr.com/post/89085398949/workforce-diversity-at-yahoo2014
Yelphttp://officialblog.yelp.com/2014/08/workforce-diversity-at-yelp.html2014



Data Collection "Companies - Crowdsourced Data"

Tracy Chou, Software Engineer at Pinterest, has been collecting data on women software engineers in different tech companies, and on their number as a percentage of the whole engineering team. 



She is "counting 'female engineers' as women who are writing or architecting software, and are in full-time roles. This generally does not include people just writing HTML/CSS (depending on the level of sophistication of the CSS being written), designers, PMs, sysadmins, etc.". The data is crowdsourced, but the contributors are traceable and Tracy maintains some oversight on the project, as you can read more here.

The pages in the collection contain the following variables, which you can combine and filter to create your custom visualizations:

  • Team: When the data applies only to a specific section/team of the company, it is specified here (like the case for the Parse team for Facebook)
  • Year Founded, Type, Industry, Company Size (employees), Address, City, State (US),  Country: Data retrieved from the LinkedIn pages of each company. The Geographical data refers to the headquarter's of the company.
  • Female Engineers, Total Engineers,  Percentage Female Engineers, Updated on: data from the project Women in Software by Tracy Chou.
  •  Founders,  Funding, Categories (CrunchBase), Description: Additional data about the company retrieved mostly from CrunchBase. In a few cases where the company wasn't on CruchBase, the information is taken from AngelList.
  • CrunchBase / AngelList Profile, LinkedIn Profile, Homepage, Twitter Handle: Specific sources used for the company's profile on Silk

Data Collection "Women Graduates, Bachelors"

The data in this collection is organized by year (1970 - 2011), and each page contains information about the percentage of woman obtaining a Bachelor in each of the following areas:

  • Agriculture
  • Architecture

  • Art and Performance

  • Biology

  • Business

  • Communications and Journalism

  • Computer Science

  • Education

  • Engineering

  • English

  • Foreign Languages

  • Health Professions

  • Math and Statistics

  • Physical Sciences

  • Psychology

  • Public Administration

  • Social Sciences and History

The data comes from a spreadsheet created by Randy Olson, by cleaning the data from the US NCES 2013 Digest of Education. You can read more in his LinkedIn article.

Made with Silk

Silk is a place to explore the world through data.Silk displays data as beautiful interactive charts, maps and web pages. Create your own free Silk now.

Articles: