In summary, we found structured data within 620 million HTML pages out of the 2.01 billion pages contained in the crawl (30%).
These pages originate from 2.72 million different pay-level-domains out of the 15.68 million pay-level-domains covered by the crawl (17%).
Altogether, the extracted data sets consist of 20.48 billion RDF quads.
Instructions on how to download the RDFa, Microdata, and Microformats data sets are given on the page how to get the data.
Triples Extracted | 2,566,827,347 |
URLs with Triples | 257,251,367 |
Average Triples per URL | 9.9779 |
Domains with Triples | 571,581 |
Average Triples per Domain | 4490.7499 |
Top Domains by Extracted Triples | Show top domains
- tripadvisor.com (147,170,416 triples)
- imore.com (80,480,808 triples)
- angieslist.com (66,088,657 triples)
- tripadvisor.de (39,444,199 triples)
- tripadvisor.fr (38,784,422 triples)
- tripadvisor.es (36,956,501 triples)
- akihabaranews.com (23,125,201 triples)
- ifood.tv (21,245,027 triples)
- migrationpolicy.org (20,945,825 triples)
- worldcat.org (19,976,355 triples)
- epicsports.com (19,592,111 triples)
- tripadvisor.it (17,894,486 triples)
- daodao.com (17,360,784 triples)
- hotels.com (15,486,832 triples)
- tripadvisor.com.tw (14,290,938 triples)
- tripadvisor.co.kr (14,197,546 triples)
- tripadvisor.ru (13,961,779 triples)
- tripadvisor.com.gr (13,793,696 triples)
- tripadvisor.jp (13,777,394 triples)
- tripadvisor.com.ar (12,606,049 triples)
- More
|
---|
Typed Entities | 405,541,283 |
Top Classes | Show top values by domain count
- og:"website" (164,324 Domains)
- og:"article" (141,679 Domains)
- foaf:Image (53,467 Domains)
- foaf:Document (51,694 Domains)
- gd:Breadcrumb (49,771 Domains)
- sioc:Item (33,019 Domains)
- og:"blog" (27,913 Domains)
- og:"product" (14,592 Domains)
- skos:Concept (12,600 Domains)
- sioc:UserAccount (12,217 Domains)
- og:"summit" (7,000 Domains)
- gd:Review-aggregate (5,945 Domains)
- sioc:Post (4,642 Domains)
- gd:Rating (4,331 Domains)
- og:"object" (3,133 Domains)
- og:"activity" (2,757 Domains)
- og:"company" (2,644 Domains)
- sioctypes:Comment (2,438 Domains)
- sioctypes:BlogPost (2,431 Domains)
- http://www.w3.org/1999/xhtml (2,351 Domains)
|
| Show top values by entity count
- foaf:Image (143,818,149 Entities)
- og:"article" (65,233,945 Entities)
- gd:Breadcrumb (56,755,178 Entities)
- foaf:Document (35,991,377 Entities)
- sioc:Item (34,880,432 Entities)
- skos:Concept (26,315,007 Entities)
- og:"website" (23,429,568 Entities)
- sioc:Post (19,457,818 Entities)
- sioc:Comment (18,946,600 Entities)
- gd:Review-aggregate (14,970,496 Entities)
- sioc:UserAccount (14,846,680 Entities)
- gd:Organization (14,046,230 Entities)
- og:"product" (9,955,913 Entities)
- schema:PostalAddress (9,181,289 Entities)
- schema:LocalBusiness (7,769,007 Entities)
- og:"blog" (5,156,708 Entities)
- http://www.w3.org/1999/xhtml (4,282,491 Entities)
- schema:Recipe (4,119,881 Entities)
- schema.Intangible (2,138,398 Entities)
- schema:Place (2,032,075 Entities)
|
Top Properties | Show top values by domain count
- ogp-org:title (204,604 Domains)
- ogp-org:url (202,630 Domains)
- ogp-org:site_name (196,656 Domains)
- ogp-org:type (194,241 Domains)
- ogp-org:image (165,003 Domains)
- ogp-org:description (129,809 Domains)
- ogp:title (128,223 Domains)
- ogp:url (112,837 Domains)
- ogp:description (107,326 Domains)
- ogp:type (106,713 Domains)
- ogp:image (106,368 Domains)
- ogp:site_name (101,896 Domains)
- fb2008:admins (65,037 Domains)
- dc:title (62,082 Domains)
- fb2008:app_id (59,225 Domains)
- content:encoded (50,825 Domains)
- gd:title (48,990 Domains)
- gd:url (48,448 Domains)
- ogp:app_id (37,971 Domains)
- ogp-org:locale (34,561 Domains)
|
| Show top values by entity count
- fb2008:app_id (83,642,515 Entities)
- ogp:title (73,524,261 Entities)
- ogp:image (61,715,495 Entities)
- ogp-org:image (59,328,753 Entities)
- ogp:url (59,101,010 Entities)
- ogp:type (58,644,840 Entities)
- ogp:site_name (54,553,506 Entities)
- gd:url (51,879,155 Entities)
- gd:title (51,872,293 Entities)
- ogp-org:title (50,094,779 Entities)
- ogp:description (49,488,773 Entities)
- ogp-org:site_name (46,532,089 Entities)
- ogp-org:url (46,347,784 Entities)
- ogp-org:type (45,519,271 Entities)
- ogp-org:description (37,687,398 Entities)
- fb2008:admins (37,049,049 Entities)
- ogp:app_id (33,716,157 Entities)
- gd:street-address (27,009,375 Entities)
- gd:locality (23,530,888 Entities)
- gd:postal-code (22,261,093 Entities)
|
Detailed Statistics as Excel-File |
html-rdfa.xlsx (127kb) |
Triples Extracted | 9,438,536,906 |
URLs with Triples | 292,601,824 |
Average Triples per URL | 101.9260 |
Domains with Triples | 819,990 |
Average Triples per Domain | 11,510.55123 |
Top Domains by Extracted Triples | Show top domains
- fotolia.com (207,446,001 triples)
- crateandbarrel.com (134,394,300 triples)
- aliexpress.com (110,547,794 triples)
- flightaware.com (108,128,834 triples)
- competitivecyclist.com (95,003,603 triples)
- snagajob.com (90,229,714 triples)
- coupons.com (88,005,164 triples)
- repairpal.com (86,750,952 triples)
- ebay.com.au (86,716,239 triples)
- bentgate.com (86,387,523 triples)
- meetup.com (78,164,005 triples)
- ebay.co.uk (75,003,377 triples)
- ebay.com (74,373,316 triples)
- tripadvisor.com (73,704,575 triples)
- backcountry.com (70,918,922 triples)
- dreamstime.com (60,841,770 triples)
- ebay.ca (58,248,051 triples)
- indeed.com (55,107,060 triples)
- maxstudio.com (54,472,590 triples)
- shutterstock.com (52,417,965 triples)
- More
|
Typed Entities | 2,209,497,281 |
Top Classes | Show top values by domain count
- schema:WebPage (148,893 Domains)
- schema:Blog (110,663 Domains)
- schema:PostalAddress (101,086 Domains)
- schema:Product (89,608 Domains)
- schema:Article (88,700 Domains)
- schema:Thing (80,139 Domains)
- datavoc:Breadcrumb (76,894 Domains)
- schema:BlogPosting (65,397 Domains)
- schema:Offer (62,849 Domains)
- schema:LocalBusiness (62,191 Domains)
- schema:Organization (52,733 Domains)
- schema:AggregateRating (50,510 Domains)
- schema:Person (47,936 Domains)
- schema:ImageObject (25,573 Domains)
- schema:Review (20,124 Domains)
- datavoc:Product (16,003 Domains)
- datavoc:Review-aggregate (14,094 Domains)
- schema:Rating (12,187 Domains)
- datavoc:Offer (11,640 Domains)
- datavoc:Organization (10,649 Domains)
|
| Show top values by entity count
- schema:Product (288,082,823 Entities)
- datavoc:Breadcrumb (269,088,458 Entities)
- schema:Offer (236,952,507 Entities)
- schema:Person (115,375,710 Entities)
- schema:Organization (101,768,743 Entities)
- schema:AggregateRating (59,069,748 Entities)
- schema:Article (54,972,301 Entities)
- schema:WebPage (51,757,077 Entities)
- datavoc:Person (49,580,005 Entities)
- schema:PostalAddress (48,804,397 Entities)
- schema:Review (42,561,245 Entities)
- schema:Rating (39,170,723 Entities)
- schema:ImageObject (35,356,426 Entities)
- schema:Place (29,710,151 Entities)
- schema:Airport (26764420 Entities)
- schema:NewsArticle (23,864,388 Entities)
- schema:JobPosting (22,804,280 Entities)
- schema:Thing (22,238,453 Entities)
- schema:LocalBusiness (20,194,229 Entities)
- schema:MediaObject (18,027,168 Entities)
|
Top Properties | Show top values by domain count
- http://www.w3.org/1999/xhtml/microdata#item (819,964 Domains)
- dc:title (788,956 Domains)
- schema:PostalAddress/streetAddress (93,562 Domains)
- schema:PostalAddress/addressLocality (93,558 Domains)
- schema:PostalAddress/postalCode (81,902 Domains)
- schema:PostalAddress/addressRegion (81,010 Domains)
- schema:Thing/name (79,651 Domains)
- schema:Product/name (78,213 Domains)
- schema:Thing/url (77,776 Domains)
- data-voc:Breadcrumb/title (73,974 Domains)
- data-voc:Breadcrumb/url (72,183 Domains)
- schema:Thing/image (66,092 Domains)
- schema:Thing/thumbnailUrl (65,232 Domains)
- schema:Article/name (63,760 Domains)
- schema:Blog/name (62,163 Domains)
- schema:Product/image (59,441 Domains)
- schema:Offer/price (59,373 Domains)
- schema:Product/description (58,167 Domains)
- schema:Product/offers (57,638 Domains)
- schema:LocalBusiness/name (51,006 Domains)
|
| Show top values by entity count
- http://www.w3.org/1999/xhtml/microdata#item (292,594,057 Entities)
- dc:title (276,857,043 Entities)
- datavoc:Breadcrumb/title (259,007,896 Entities)
- datavoc:Breadcrumb/url (247,987,539 Entities)
- schema:Product/name (216,079,570 Entities)
- schema:Offer/price (185,219,528 Entities)
- schema:Product/offers (165,541,708 Entities)
- schema:Product/image (151,994,492 Entities)
- schema:Product/url (110,801,142 Entities)
- schema:Person/name (106,620,700 Entities)
- schema:Offer/priceCurrency (74,472,724 Entities)
- schema:Organization/name (62,176,608 Entities)
- schema:Product/description (57,811,011 Entities)
- schema:Organization/url (57,720,350 Entities)
- schema:Person/url (57,399,320 Entities)
- schema:Person/image (55,589,204 Entities)
- schema:Offer/name (55,353,245 Entities)
- schema:Offer/itemOffered (53,750,353 Entities)
- schema:Offer/availability (51,172,614 Entities)
- schema:AggregateRating/ratingValue (49,111,762 Entities)
|
Detailed Statistics as Excel-File |
html-microdata.xlsx (221kb) |
Triples Extracted | 169,557,078 |
URLs with Triples | 3,496,061 |
Average Triples per URL | 48.4995 |
Domains with Triples | 24,208 |
Average Triples per Domain | 7,004.1754 |
Top Domains by Extracted Triples | Show top domains
- razorgator.com (27,661,991 triples)
- ticketstogo.com (19,788,750 triples)
- ticketsnow.com (10,271,149 triples)
- ents24.com (9,612,044 triples)
- wikipedia.org (5,409,886 triples)
- alc.edu (4,469,242 triples)
- conventionscene.com (3,733,979 triples)
- ticketsinventory.com (3,650,266 triples)
- jerseyvillelibrary.org (3,487,507 triples)
- abctickets.com (3,355,361 triples)
- mountainx.com (2,776,627 triples)
- ticketnetwork.com (2,737,928 triples)
- gamestub.com (2,665,084 triples)
- eventbrite.com (2,650,712 triples)
- sabrespace.com (2,087,943 triples)
- colibraries.org (1,688,667 triples)
- redmountainresort.com (1,655,231 triples)
- ticketliquidator.com (1,618,111 triples)
- mainlinehealth.org (1,585,592 triples)
- haverford.edu (1,475,984 triples)
- More
|
Typed Entities | 34,595,069 |
Top Classes | Show top values by domain count
- icaltzd:vcalendar (24,208 Domains)
- icaltzd:Vevent (22,114 Domains)
- icaltzd:DomainOf_rrule (21 Domains)
- icaltzd:Vtodo (7 Domains)
- icaltzd:Vjournal (1 Domains)
|
| Show top values by entity count
- icaltzd:Vevent (31,086,401 Entities)
- icaltzd:vcalendar (4,194,971 Entities)
- icaltzd:DomainOf_rrule (1,838 Entities)
- icaltzd:Vtodo (207 Entities)
- icaltzd:Vjournal (1 Entities)
|
Top Properties | Show top values by domain count
- icaltzd:component (22,108 Domains)
- icaltzd:summary (18,954 Domains)
- icaltzd:dtstart (17,983 Domains)
- icaltzd:dtend (13,062 Domains)
- icaltzd:location (9,341 Domains)
- icaltzd:url (8,720 Domains)
- icaltzd:description (8,553 Domains)
- icaltzd:uid (1,057 Domains)
- icaltzd:categories (701 Domains)
- icaltzd:dtstamp (107 Domains)
- icaltzd:organizer (66 Domains)
- icaltzd:calAddress (66 Domains)
- icaltzd:status (37 Domains)
- icaltzd:rrule (21 Domains)
- icaltzd:class (12 Domains)
- icaltzd:freq (3 Domains)
- icaltzd:transp (1 Domains)
|
| Show top values by entity count
- icaltzd:summary (28,588,970 Entities)
- icaltzd:dtstart (26,470,220 Entities)
- icaltzd:location (19,444,652 Entities)
- icaltzd:url (16,351,764 Entities)
- icaltzd:dtend (6,228,917 Entities)
- icaltzd:description (4,798,803 Entities)
- icaltzd:component (3,190,967 Entities)
- icaltzd:categories (633,223 Entities)
- icaltzd:dtstamp (152,810 Entities)
- icaltzd:uid (146,747 Entities)
- icaltzd:status (26,252 Entities)
- icaltzd:calAddress (10,561 Entities)
- icaltzd:organizer (6,456 Entities)
- icaltzd:rrule (1,590 Entities)
- icaltzd:class (1,150 Entities)
- icaltzd:freq (11 Entities)
- icaltzd:transp (6 Entities)
|
Triples Extracted | 3,850,290,103 |
URLs with Triples | 101,606,009 |
Average Triples per URL | 37.8943 |
Domains with Triples | 1,095,517 |
Average Triples per Domain | 3,514.5873 |
Top Domains by Extracted Triples | Show top domains
- blogspot.com (599,115,969 triples)
- wordpress.com (105,364,921 triples)
- sonsofstevegarvey.com (59,832,726 triples)
- theclothdiaperwhisperer.com (54,929,447 triples)
- razorgator.com (46,042,134 triples)
- wikipedia.org (42,234,223 triples)
- ticketstogo.com (39,165,033 triples)
- mlblogs.com (35,780,505 triples)
- couriernews.com (30,371,037 triples)
- wikitravel.org (23,678,304 triples)
- bonhams.com (22,554,547 triples)
- jsonline.com (22,418,021 triples)
- ticketsnow.com (18,762,728 triples)
- themanchesterenterprise.com (18,629,660 triples)
- autosport.com (18,378,840 triples)
- justanswer.com (17,675,492 triples)
- sincerelyjules.com (17,457,079 triples)
- cricutholiday.com (16,993,943 triples)
- consumerist.com (16,847,664 triples)
- techpb.com (16,452,121 triples)
- More
|
Typed Entities | 1,349,620,300 |
Top Classes | Show top values by domain count
- vcard2006:Name (1,095,512 Domains)
- vcard2006:VCard (1,094,934 Domains)
- vcard2006:Organization (140,755 Domains)
|
| Show top values by entity count
- vcard2006:Name (669,563,772 Entities)
- vcard2006:VCard (669,498,942 Entities)
- vcard2006:Organization (74,096,090 Entities)
|
Top Properties | Show top values by domain count
- vcard2006:n (1,095,515 Domains)
- vcard2006:fn (906,654 Domains)
- vcard2006:given-name (901,683 Domains)
- vcard2006:family-name (901,559 Domains)
- vcard2006:url (642,221 Domains)
- vcard2006:photo (248,537 Domains)
- vcard2006:adr (162,557 Domains)
- vcard2006:organization-name (140,755 Domains)
- vcard2006:org (140,755 Domains)
- vcard2006:tel (129,441 Domains)
- vcard2006:email (64,702 Domains)
- vcard2006:geo (24,768 Domains)
- vcard2006:nickname (10,539 Domains)
- vcard2006:title (6,591 Domains)
- vcard2006:note (4,547 Domains)
- vcard2006:logo (3,129 Domains)
- vcard2006:additional-name (2,638 Domains)
- vcard2006:additional-name (2,317 Domains)
- vcard2006:category (2,196 Domains)
- vcard2006:role (2,113 Domains)
|
| Show top values by entity count
- vcard2006:n (605,959,590 Entities)
- vcard2006:given-name (360,284,620 Entities)
- vcard2006:family-name (360,281,075 Entities)
- vcard2006:fn (330,553,000 Entities)
- vcard2006:photo (255,471,743 Entities)
- vcard2006:url (130,623,783 Entities)
- vcard2006:organization-name (74,095,934 Entities)
- vcard2006:org (67,648,953 Entities)
- vcard2006:adr (36,640,688 Entities)
- vcard2006:tel (19,124,997 Entities)
- vcard2006:nickname (8,130,988 Entities)
- vcard2006:title (5,527,043 Entities)
- vcard2006:email (4,969,550 Entities)
- vcard2006:geo (3,878,733 Entities)
- vcard2006:note (2,117,011 Entities)
- vcard2006:logo (1,603,941 Entities)
- vcard2006:category (807,677 Entities)
- vcard2006:role (756,671 Entities)
- vcard2006:bday (445,140 Entities)
- vcard2006:workTel (426,887 Entities)
|
Triples Extracted | 18,838,183 |
URLs with Triples | 202,889 |
Average Triples per URL | 92.8497 |
Domains with Triples | 3,167 |
Average Triples per Domain | 5,948.2738 |
Top Domains by Extracted Triples | Show top domains- gumtree.com (31,420,916 triples)
- remax.com (9,563,875 triples)
- forrent.com (3,741,207 triples)
- sothebysrealty.com (1,905,693 triples)
- iberia.com (1,207,149 triples)
- quoka.de (552,227 triples)
- fyndtorget.se (94,873 triples)
- forrentuniversity.com (66,198 triples)
- sapo.pt (53,064 triples)
- immobilmente.com (49,763 triples)
- boston.com (49,531 triples)
- viprutv.com (37,278 triples)
- remax-ni.net (32,308 triples)
- mitsubishiwaukesha.com (27,800 triples)
- agenteimovel.com.br (26,388 triples)
- chryslerofmadison.com (26,200 triples)
- tonkingreen.com (25,425 triples)
- ocregister.com (24,573 triples)
- domain.com.au (22,962 triples)
- camionsupermarket.it (22,164 triples)
- More
|
Typed Entities | 4,473,631 |
Top Classes | Show top values by domain count
- hlisting:Lister (3,167 Domains)
- hlisting:Listing (3,167 Domains)
- hlisting:Item (2,419 Domains)
|
| Show top values by entity count
- hlisting:Lister (1,681,869 Entities)
- hlisting:Listing (1,681,869 Entities)
- hlisting:Item (1,153,487 Entities)
|
Top Properties | Show top values by domain count
- hlisting:lister (3,167 Domains)
- hlisting:price (2,824 Domains)
- hlisting:item (2,419 Domains)
- hlisting:itemUrl (2,419 Domains)
- hlisting:itemPhoto (2,419 Domains)
- hlisting:description (2,293 Domains)
- hlisting:summary (686 Domains)
- hlisting:itemName (386 Domains)
- hlisting:action (384 Domains)
- hlisting:listerLogo (308 Domains)
- hlisting:listerUrl (301 Domains)
- hlisting:listerName (288 Domains)
- hlisting:listerOrg (246 Domains)
- vcard2006:tel (193 Domains)
- hlisting:dtlisted (191 Domains)
- hlisting:permalink (21 Domains)
- foaf:mbox (11 Domains)
- hlisting:dtexpired (5 Domains)
|
| Show top values by entity count
- hlisting:lister (1,638,275 Entities)
- hlisting:description (1,189,417 Entities)
- hlisting:itemPhoto (1,153,487 Entities)
- hlisting:itemUrl (1,153,487 Entities)
- hlisting:item (1,118,623 Entities)
- hlisting:itemName (1,015,132 Entities)
- hlisting:price (904,221 Entities)
- hlisting:listerLogo (786,712 Entities)
- hlisting:listerUrl (785,957 Entities)
- hlisting:action (533,193 Entities)
- vcard2006:tel (504,711 Entities)
- hlisting:listerName (342,630 Entities)
- hlisting:summary (188,087 Entities)
- hlisting:listerOrg (82,848 Entities)
- hlisting:dtlisted (19,130 Entities)
- hlisting:dtexpired (1,327 Entities)
- foaf:mbox (241 Entities)
- hlisting:permalink (241 Entities)
|
Triples Extracted | 24,756,234 |
URLs with Triples | 630,402 |
Average Triples per URL | 39.2706 |
Domains with Triples | 3,476 |
Average Triples per Domain | 7,122.0466 |
Top Domains by Extracted Triples | Show top domains
- grouprecipes.com (2,951,767 triples)
- seriouseats.com (1,632,945 triples)
- delish.com (1,578,357 triples)
- bakespace.com (1,462,040 triples)
- landolakes.com (1,319,922 triples)
- chefkoch.de (1,257,352 triples)
- cookingchanneltv.com (1,041,434 triples)
- closetcooking.com (1,002,235 triples)
- goodhousekeeping.com (698,451 triples)
- deepsouthdish.com (571,984 triples)
- sheknows.com (554,701 triples)
- womansday.com (446,855 triples)
- yankeemagazine.com (442,500 triples)
- cooks.com (393,866 triples)
- deliaonline.com (380,910 triples)
- wholeliving.com (305,654 triples)
- rachaelraymag.com (278,264 triples)
- drinksmixer.com (259,677 triples)
- bbc.co.uk (258,769 triples)
- recipe.com (237,620 triples)
- More
|
---|
Typed Entities | 5,781,217 |
Top Classes | Show top values by domain count
- sindice:hrecipe/Recipe (3,476 Domains)
- sindice:hrecipe/Ingredient (2,480 Domains)
- sindice:hrecipe/Duration (939 Domains)
- sindice:hrecipe/Nutrition (373 Domains)
|
| Show top values by entity count
- hrecipe:Ingredient (5,353,209 Entities)
- hrecipe:Recipe (770,996 Entities)
- hrecipe:Duration (296,053 Entities)
- hrecipe:Nutrition (134,981 Entities)
|
Top Properties | Show top values
- sindice:hrecipe/fn (2,999 Domains)
- sindice:hrecipe/ingredient (2,480 Domains)
- sindice:hrecipe/ingredientName (2,446 Domains)
- sindice:hrecipe/photo (2,209 Domains)
- sindice:hrecipe/instructions (2,205 Domains)
|
| Show top values by entity count
- hrecipe:ingredientName (4,707,778 Entities)
- hrecipe:fn (642,880 Entities)
- hrecipe:ingredient (550,530 Entities)
- hrecipe:instructions (535,669 Entities)
- hrecipe:photo (462,484 Entities)
|
Triples Extracted | 69,802,632 |
URLs with Triples | 2,496,303 |
Average Triples per URL | 27.9624 |
Domains with Triples | 13,772 |
Average Triples per Domain | 5,068.4455 |
Top Domains by Extracted Triples | Show top domains
- orbitz.com (8,324,144 triples)
- homeaway.com (6,214,128 triples)
- qvc.com (3,935,076 triples)
- indeed.com (3,321,370 triples)
- glassdoor.com (3,250,601 triples)
- kiddicare.com (2,129,328 triples)
- vacationrentals.com (2,071,928 triples)
- hotelanacapri.co.uk (1,748,250 triples)
- everydayhealth.com (1,542,060 triples)
- cherrybankguesthouse.com (1,503,530 triples)
- menupages.com (1,456,864 triples)
- foulsykefarmhouse.co.uk (1,380,190 triples)
- carolinaguesthouse.co.uk (1,375,080 triples)
- homeaway.co.uk (1,270,260 triples)
- theboatsideinn.com (1,093,680 triples)
- easytobook.com (961,242 triples)
- gatewayhotel.co.uk (959,980 triples)
- blenheimedge.co.uk (935,200 triples)
- trailspace.com (916,580 triples)
- goodreads.com (911,711 triples)
- More
|
---|
Typed Entities | 16,186,868 |
Top Classes | Show top values by domain count- rev:Review (13,772 Domains)
|
| Show top values by entity count- rev:Review (9,994,204 Entities)
|
Top Properties | Show top values by domain count
- rev:reviewer (10,831 Domains)
- rev:hasReview (10,179 Domains)
- vcard2006:url (10,178 Domains)
- rev:rating (8,867 Domains)
- dc:date (8,863 Domains)
- vcard2006:fn (8,137 Domains)
- rev:text (7,723 Domains)
- rev:title (6,631 Domains)
- rev:type (2,962 Domains)
- vcard2006:photo (893 Domains)
|
| Show top values by entity count
- rev:hasReview (7,841,574 Entities)
- vcard2006:url (7,841,338 Entities)
- rev:reviewer (6,685,018 Entities)
- dc:date (6,092,294 Entities)
- rev:rating (5,330,727 Entities)
- vcard2006:fn (5,266,865 Entities)
- rev:text (4,934,961 Entities)
- rev:title (4,921,890 Entities)
- rev:type (878,443 Entities)
- vcard2006:photo (696,994 Entities)
|
Triples Extracted | 653,111 |
URLs with Triples | 31,444 |
Average Triples per URL | 20.7706 |
Domains with Triples | 96 |
Average Triples per Domain | 6,803.2396 |
Top Domains by Extracted Triples | Show top domains
- wikipedia.org (538,555 triples)
- blekko.com (76,624 triples)
- preen.com (8,184 triples)
- thefullwiki.org (4,967 triples)
- oiseaux.net (4,035 triples)
- blogspot.com (3,866 triples)
- wiktionary.org (3,610 triples)
- mashpedia.com (1,717 triples)
- schools-wikipedia.org (1,449 triples)
- wikidoc.org (1,255 triples)
- wikia.com (1,121 triples)
- eol.org (1,053 triples)
- sensagent.com (892 triples)
- digplanet.com (874 triples)
- bbc.co.uk (810 triples)
- birdsguides.com (295 triples)
- marefa.org (247 triples)
- wordpress.com (230 triples)
- doleaf.com (220 triples)
- tfode.com (201 triples)
- More
|
---|
Typed Entities | 218,463 |
Top Classes | Show top values by domain count
- wo:Species (96 Domains)
- wo:Genus (66 Domains)
- wo:Family (65 Domains)
- wo:Order (61 Domains)
- wo:Kingdom (60 Domains)
- wo:Phylum (52 Domains)
- wo:species (50 Domains)
- wo:Class (42 Domains)
|
| Show top values by entity count
- wo:species (40,365 Entities)
- wo:Genus (31,796 Entities)
- wo:Species (30,807 Entities)
- wo:Order (30,188 Entities)
- wo:Family (30,157 Entities)
- wo:Kingdom (29,775 Entities)
- wo:Class (20,743 Entities)
- wo:Phylum (19,780 Entities)
|
Top Properties | Show top values by domain count
- wo:genusName (66 Domains)
- wo:genus (66 Domains)
- wo:family (65 Domains)
- wo:familyName (63 Domains)
- wo:order (61 Domains)
- wo:orderName (61 Domains)
- wo:kingdomName (60 Domains)
- wo:kingdom (60 Domains)
- wo:speciesName (58 Domains)
- wo:phylumName (52 Domains)
- wo:phylum (52 Domains)
- wo:scientificName (50 Domains)
- wo:species (50 Domains)
- wo:class (42 Domains)
- wo:className (42 Domains)
|
| Show top values by entity count
- wo:speciesName (31,272 Entities)
- wo:genus (29,734 Entities)
- wo:genusName (29,729 Entities)
- wo:species (28,762 Entities)
- wo:scientificName (28,634 Entities)
- wo:order (28,247 Entities)
- wo:orderName (28,245 Entities)
- wo:family (28,201 Entities)
- wo:familyName (27,958 Entities)
- wo:kingdom (27,853 Entities)
- wo:kingdomName (27,850 Entities)
- wo:className (19,332 Entities)
- wo:class (19,332 Entities)
- wo:phylumName (18,466 Entities)
- wo:phylum (18,466 Entities)
|
Triples Extracted | 219,772,447 |
URLs with Triples | 17,032,646 |
Average Triples per URL | 12.9030 |
Domains with Triples | 170,202 |
Average Triples per Domain | 1,291.2448 |
Top Domains by Extracted Triples | Show top domains
- wordpress.com (9,917,394 triples)
- nadaguides.com (8,381,998 triples)
- stackexchange.com (8,086,058 triples)
- grouprecipes.com (7,180,024 triples)
- cato.org (6,014,852 triples)
- hendrickmotorsports.com (5,249,138 triples)
- dreamincode.net (4,900,464 triples)
- forrent.com (4,463,072 triples)
- packersproshop.com (4,377,404 triples)
- 2modern.com (4,125,012 triples)
- vimeo.com (3,858,614 triples)
- bergdorfgoodman.com (3,781,256 triples)
- knfilters.com (2,921,854 triples)
- temptalia.com (2,617,350 triples)
- contactmusic.com (2,607,192 triples)
- idakoos.com (2,603,344 triples)
- unvlog.com (2,526,378 triples)
- rockfordsquire.com (2,315,450 triples)
- fonts.com (2,233,710 triples)
- thinkadvisor.com (1,918,048 triples)
- More
|
---|
Typed Entities | 52,187,702 |
Top Classes | Show top values by domain count- foaf:Person (170,200 Domains)
|
| Show top values by entity count- foaf:Person (35,237,361 Entities)
|
Top Properties | Show top values by domain count
- xfn:mePage (170,201 Domains)
- xfn:me-hyperlink (127,465 Domains)
- xfn:friend (34,324 Domains)
- xfn:friend-hyperlink (34,318 Domains)
- xfn:colleague (22,561 Domains)
- xfn:colleague-hyperlink (22,555 Domains)
- xfn:met (20,212 Domains)
- xfn:met-hyperlink (20,210 Domains)
- xfn:contact (18,144 Domains)
- xfn:contact-hyperlink (18,133 Domains)
- xfn:acquaintance (12,024 Domains)
- xfn:acquaintance-hyperlink (12,021 Domains)
- xfn:co-worker (11,094 Domains)
- xfn:co-worker-hyperlink (11,089 Domains)
- xfn:neighbor (5,381 Domains)
- xfn:neighbor-hyperlink (5,378 Domains)
- xfn:co-resident (3,765 Domains)
- xfn:co-resident-hyperlink (3,764 Domains)
- xfn:spouse (2,543 Domains)
- xfn:spouse-hyperlink (2,542 Domains)
|
| Show top values by entity count
- xfn:mePage (35,222,343 Entities)
- xfn:me-hyperlink (13,613,623 Entities)
- xfn:contact (2,131,831 Entities)
- xfn:contact-hyperlink (2,098,429 Entities)
- xfn:friend (1,504,651 Entities)
- xfn:friend-hyperlink (1,452,823 Entities)
- xfn:colleague (999,641 Entities)
- xfn:colleague-hyperlink (958,474 Entities)
- xfn:met (696,221 Entities)
- xfn:met-hyperlink (674,660 Entities)
- xfn:acquaintance (580,857 Entities)
- xfn:acquaintance-hyperlink (562,704 Entities)
- xfn:co-worker (433,217 Entities)
- xfn:co-worker-hyperlink (417,516 Entities)
- xfn:neighbor (222,931 Entities)
- xfn:neighbor-hyperlink (217,696 Entities)
- xfn:co-resident (130,138 Entities)
- xfn:co-resident-hyperlink (126,901 Entities)
- xfn:spouse (66,466 Entities)
- xfn:muse (65,019 Entities)
|