In summary, we found structured data within 1.2 billion HTML pages out of the 3.2 billion pages contained in the crawl (38.9%).
These pages originate from 7.4 million different pay-level-domains out of the 26 million pay-level-domains covered by the crawl (28.4%).
Altogether, the extracted data sets consist of 38.7 billion RDF quads.
Instructions on how to download the RDFa, Microdata, Embedded JSON-LD and Microformats data sets are given on the page how to get the data.
Triples Extracted | 24,359,443,316 |
URLs with Triples | 646,409,625 |
Average Triples per URL | 37.68 |
Domains with Triples | 3,743,822 |
Average Triples per Domain | 6,506.57 |
Typed Entities | 4,837,635,224 |
Top Domains by Extracted Triples | Show top domains
- skyrock.com (203,429,342 triples)
- moosejaw.com (147,690,525 triples)
- blogspot.com (144,157,696 triples)
- peternyssen.com (106,756,630 triples)
- untitled-magazine.com (106,468,683 triples)
- justia.com (104,739,832 triples)
- canalblog.com (103,135,044 triples)
- leadferret.com (92,088,387 triples)
- drom.ru (80,352,891 triples)
- tutete.com (74,013,436 triples)
- prom.ua (73,992,919 triples)
- repairpal.com (68,768,136 triples)
- callersmart.com (68,117,826 triples)
- lyst.com (67,716,741 triples)
- kidsroom.de (66,743,895 triples)
- farpost.ru (66,505,726 triples)
- alumnius.net (65,488,619 triples)
- epicsports.com (63,970,298 triples)
- hotels.com (63,050,150 triples)
- More
|
Top Classes | Show top values by domain count
- schema:WebPage (897,418 Domains)
- schema:Product (581,482 Domains)
- schema:Article (495,831 Domains)
- schema:Offer (485,959 Domains)
- data-voc:Breadcrumb (447,687 Domains)
- schema:PostalAddress (418,807 Domains)
- schema:SiteNavigationElement (379,655 Domains)
- schema:WPHeader (357,234 Domains)
- schema:Organization (352,671 Domains)
- schema:WPFooter (325,015 Domains)
- schema:Blog (311,632 Domains)
- https://schema.org/WPHeader (292,857 Domains)
- schema:BlogPosting (292,115 Domains)
- schema:Person (289,233 Domains)
- https://schema.org/SiteNavigationElement (289,032 Domains)
- https://schema.org/WPFooter (274,786 Domains)
- schema:WPSideBar (244,862 Domains)
- https://schema.org/ImageObject (231,322 Domains)
- schema:LocalBusiness (230,844 Domains)
- https://schema.org/Person (225,823 Domains)
|
| Show top values by entity count
- data-voc:Breadcrumb (552,328,195 Entities)
- schema:Product (439,832,806 Entities)
- schema:Offer (390,161,561 Entities)
- schema:ListItem (272,316,081 Entities)
- schema:Person (204,359,065 Entities)
- schema:PostalAddress (172,053,567 Entities)
- schema:Article (153,952,642 Entities)
- schema:ImageObject (148,290,043 Entities)
- schema:Organization (145,420,648 Entities)
- https://schema.org/ImageObject (99,049,597 Entities)
- schema:AggregateRating (90,083,031 Entities)
- schema:SiteNavigationElement (84,639,083 Entities)
- http://data-vocabulary.org/Person (83,700,647 Entities)
- schema:BreadcrumbList (79,251,543 Entities)
- schema:WebPage (78,353,742 Entities)
- schema:LocalBusiness (66,794,473 Entities)
- https://schema.org/Organization (66,715,388 Entities)
- https://schema.org/Person (65,452,952 Entities)
- schema:BlogPosting (64,300,878 Entities)
- schema:Place (52,072,047 Entities)
|
Top Properties | Show top values by domain count
- schema:Product/name (535,625 Domains)
- schema:WebPage/name (483,959 Domains)
- schema:Offer/price (462,444 Domains)
- schema:Product/offers (462,233 Domains)
- http://data-vocabulary.org/Breadcrumb/title (436,166 Domains)
- http://data-vocabulary.org/Breadcrumb/url (435,810 Domains)
- schema:Offer/priceCurrency (430,556 Domains)
- schema:Product/image (419,391 Domains)
- schema:Product/description (377,639 Domains)
- schema:WebPage/url (346,114 Domains)
- schema:PostalAddress/streetAddress (339,502 Domains)
- schema:Offer/availability (337,876 Domains)
- schema:PostalAddress/addressLocality (330,791 Domains)
- schema:PostalAddress/postalCode (296,078 Domains)
- schema:Product/url (263,720 Domains)
- schema:Person/name (260,970 Domains)
- schema:Article/name (239,337 Domains)
- schema:WebPage/description (237,410 Domains)
- schema:Article/articleBody (229,110 Domains)
- schema:PostalAddress/addressRegion (224,952 Domains)
|
| Show top values by entity count
- data-voc:Breadcrumb/title (527,274,573 Entities)
- data-voc:Breadcrumb/url (512,476,423 Entities)
- schema:Product/name (413,844,251 Entities)
- schema:Offer/price (362,253,629 Entities)
- schema:Product/image (331,262,399 Entities)
- schema:Offer/priceCurrency (312,396,647 Entities)
- schema:Product/offers (301,407,538 Entities)
- schema:Product/url (239,902,641 Entities)
- schema:ListItem/name (219,141,234 Entities)
- schema:ListItem/item (212,838,161 Entities)
- schema:ListItem/position (200,923,255 Entities)
- schema:Person/name (171,513,764 Entities)
- schema:Product/description (149,819,815 Entities)
- schema:PostalAddress/addressLocality (143,247,228 Entities)
- schema:Offer/availability (135,352,464 Entities)
- schema:PostalAddress/streetAddress (118,658,970 Entities)
- schema:Organization/name (106,111,502 Entities)
- https://schema.org/ImageObject/url (90,237,246 Entities)
- schema:Person/url (84,267,924 Entities)
- schema:Article/name (84,233,926 Entities)
|
Detailed Statistics as Excel-File |
html-microdata.xlsx (450kb) |
Triples Extracted | 3,623,025,088 |
URLs with Triples | 190,890,906 |
Average Triples per URL | 18.97 |
Domains with Triples | 2,685,738 |
Average Triples per Domain | 1,348.98 |
Typed Entities | 818,557,558 |
Top Domains by Extracted Triples | Show top domains
- maxpreps.com (44,942,447 triples)
- hotels.com (42,918,977 triples)
- tunein.com (37,831,829 triples)
- forumotion.com (32,146,616 triples)
- forumactif.com (26,949,223 triples)
- singleplatform.com (26,540,908 triples)
- apple.com (25,968,142 triples)
- foroactivo.com (24,670,898 triples)
- bibliocommons.com (24,367,820 triples)
- yoo7.com (24,254,737 triples)
- spreadshirt.com (23,758,450 triples)
- bark.com (20,146,758 triples)
- forumactif.org (19,607,460 triples)
- edimdoma.ru (17,939,949 triples)
- allmenus.com (16,525,224 triples)
- agrofoto.pl (14,769,273 triples)
- forumeiros.com (11,760,827 triples)
- huffingtonpost.ca (11,711,039 triples)
- slacker.com (10,995,416 triples)
- More
|
Top Classes | Show top values by domain count
- schema:WebSite (2,573,118 Domains)
- schema:SearchAction (2,223,126 Domains)
- schema:Organization (872,751 Domains)
- schema:Person (221,663 Domains)
- schema:LocalBusiness (154,877 Domains)
- schema:PostalAddress (69,401 Domains)
- schema:Place (45,316 Domains)
- schema:Event (45,249 Domains)
- schema:WebPage (41,206 Domains)
- schema:ListItem (38,670 Domains)
- schema:BreadcrumbList (37,735 Domains)
- schema:ImageObject (35,194 Domains)
- schema:GeoCoordinates (29,133 Domains)
- schema:Offer (27,761 Domains)
- schema:ContactPoint (23,288 Domains)
- schema:BlogPosting (16,640 Domains)
- schema:Article (15,427 Domains)
- schema:Product (12,514 Domains)
- schema:AggregateRating (11,491 Domains)
- schema:NewsArticle (6,899 Domains)
|
| Show top values by entity count
- schema:ListItem (121,949,431 Entities)
- schema:WebSite (116,658,098 Entities)
- schema:SearchAction (99,526,381 Entities)
- schema:Organization (89,001,090 Entities)
- schema:ImageObject (61,136,919 Entities)
- schema:Person (60,880,297 Entities)
- schema:BreadcrumbList (32,173,505 Entities)
- schema:WebPage (23,479,655 Entities)
- schema:NewsArticle (23,313,874 Entities)
- schema:Offer (20,372,479 Entities)
- schema:InteractionCounter (15,683,355 Entities)
- schema:DiscussionForumPosting (14,489,931 Entities)
- schema:Product (14,422,594 Entities)
- schema:PostalAddress (14,290,321 Entities)
- schema:Country (14,256,546 Entities)
- schema:ContactPoint (7,637,556 Entities)
- schema:Place (7,627,670 Entities)
- schema:LocalBusiness (4,501,581 Entities)
- schema:Article (4,307,925 Entities)
- schema:GeoCoordinates (3,689,686 Entities)
|
Top Properties | Show top values by domain count
- schema:url (2,668,659 Domains)
- schema:name (2,604,103 Domains)
- schema:potentialAction (2,224,004 Domains)
- schema:target (2,223,915 Domains)
- schema:query-input (2,223,156 Domains)
- schema:logo (712,637 Domains)
- schema:sameAs (574,771 Domains)
- schema:description (429,511 Domains)
- schema:alternateName (300,869 Domains)
- schema:image (280,817 Domains)
- schema:telephone (230,235 Domains)
- schema:address (220,377 Domains)
- schema:legalName (148,859 Domains)
- schema:email (142,256 Domains)
- schema:openingHours (89,168 Domains)
- schema:addressLocality (76,743 Domains)
- schema:streetAddress (74,170 Domains)
- schema:postalCode (71,167 Domains)
- schema:addressRegion (63,189 Domains)
- schema:addressCountry (54,079 Domains)
|
| Show top values by entity count
- schema:name (456,983,496 Entities)
- schema:url (323,684,932 Entities)
- schema:position (121,986,148 Entities)
- schema:item (117,778,381 Entities)
- schema:target (101,072,466 Entities)
- schema:potentialAction (100,205,117 Entities)
- schema:query-input (99,504,404 Entities)
- schema:logo (78,867,781 Entities)
- schema:image (75,745,389 Entities)
- schema:sameAs (56,784,288 Entities)
- schema:description (56,635,394 Entities)
- schema:author (49,214,290 Entities)
- schema:headline (47,657,634 Entities)
- schema:height (47,638,493 Entities)
- schema:width (47,300,147 Entities)
- schema:datePublished (45,640,113 Entities)
- schema:itemListElement (33,918,761 Entities)
- schema:publisher (33,129,360 Entities)
- schema:dateModified (26,585,016 Entities)
- schema:mainEntityOfPage (26,328,557 Entities)
|
Detailed Statistics as Excel-File |
html-embedded-jsonld.xlsx (75kb) |
Triples Extracted | 8,371,745,745 |
URLs with Triples | 418,095,860 |
Average Triples per URL | 20.02 |
Domains with Triples | 3,645,662 |
Average Triples per Domain | 2,296.35 |
Typed Entities | 3,186,672,022 |
Top Domains by Extracted Triples | Show top domains
- blogspot.com (2,146,058,454 triples)
- wordpress.com (716,755,448 triples)
- theclothdiaperwhisperer.com (258,310,473 triples)
- ezlocal.com (65,359,822 triples)
- blogspot.co.uk (47,463,431 triples)
- blogspot.com.es (42,589,044 triples)
- blogspot.ca (35,569,546 triples)
- blogspot.com.br (26,463,174 triples)
- sonsofstevegarvey.com (24,421,587 triples)
- paginegialle.it (24,215,994 triples)
- politico.eu (21,927,082 triples)
- blogspot.de (21,907,346 triples)
- blogspot.fr (21,778,831 triples)
- hatenablog.com (21,475,791 triples)
- webotopia.org (20,592,125 triples)
- ugent.be (19,943,529 triples)
- blogspot.com.au (19,240,500 triples)
- wikipedia.org (18,495,850 triples)
- wikitravel.org (18,454,061 triples)
- More
|
Top Classes | Show top values by domain count
- vcard2006:Name (2,770,158 Domains)
- vcard2006:VCard (2,768,112 Domains)
- vcard2006:Organization (141,197 Domains)
|
| Show top values by entity count
- vcard2006:Name (1,574,836,479 Entities)
- vcard2006:VCard (1,574,203,281 Entities)
- vcard2006:Organization (37,632,262 Entities)
|
Top Properties | Show top values by domain count
- vcard2006:n (2,770,158 Domains)
- vcard2006:fn (2,338,105 Domains)
- vcard2006:given-name (2,331,514 Domains)
- vcard2006:family-name (2,331,371 Domains)
- vcard2006:url (1,606,418 Domains)
- vcard2006:photo (582,079 Domains)
- vcard2006:adr (168,703 Domains)
- vcard2006:tel (145,163 Domains)
- vcard2006:org (141,197 Domains)
- vcard2006:organization-name (141,197 Domains)
- vcard2006:email (94,949 Domains)
- vcard2006:nickname (18,815 Domains)
- vcard2006:title (15,254 Domains)
- vcard2006:geo (12,806 Domains)
- vcard2006:note (12,762 Domains)
- vcard2006:category (5,689 Domains)
- vcard2006:role (4,771 Domains)
- vcard2006:logo (4,236 Domains)
- vcard2006:workTel (3,381 Domains)
- vcard2006:additional-name (2,462 Domains)
|
| Show top values by entity count
- vcard2006:n (1,574,836,479 Entities)
- vcard2006:fn (851,132,585 Entities)
- vcard2006:given-name (847,024,490 Entities)
- vcard2006:family-name (846,941,996 Entities)
- vcard2006:url (451,466,958 Entities)
- vcard2006:photo (475,438,609 Entities)
- vcard2006:adr (29,692,767 Entities)
- vcard2006:tel (17,373,992 Entities)
- vcard2006:org (37,632,262 Entities)
- vcard2006:organization-name (37,632,262 Entities)
- vcard2006:email (5,389,112 Entities)
- vcard2006:nickname (7,579,150 Entities)
- vcard2006:title (2,316,388 Entities)
- vcard2006:geo (3,345,852 Entities)
- vcard2006:note (2,839,243 Entities)
- vcard2006:category (1,546,671 Entities)
- vcard2006:role (2,201,713 Entities)
- vcard2006:logo (2,012,963 Entities)
- vcard2006:workTel (326,955 Entities)
- vcard2006:additional-name (225,241 Entities)
|
Triples Extracted | 1,629,581,643 |
URLs with Triples | 220,889,867 |
Average Triples per URL | 7.37 |
Domains with Triples | 1,209,430 |
Average Triples per Domain | 1,347.39 |
Typed Entities | 430,349,620 |
Top Domains by Extracted Triples | Show top domains
- skyrock.com (28,917,186 triples)
- blogspot.com (26,395,396 triples)
- adsoftheworld.com (9,792,643 triples)
- epicsports.com (8,774,504 triples)
- nbcnews.com (8,275,957 triples)
- canalblog.com (7,365,254 triples)
- securitysystemsnews.com (7,349,415 triples)
- instinctmagazine.com (5,921,335 triples)
- goo.ne.jp (5,077,131 triples)
- coveleaderpress.com (4,954,094 triples)
- uscfsales.com (4,555,833 triples)
- clashmusic.com (4,275,032 triples)
- blog.cz (4,180,646 triples)
- staples.ca (4,159,633 triples)
- dccomics.com (4,117,598 triples)
- ubuntu-es.org (4,087,243 triples)
- aljazeera.net (3,969,871 triples)
- treysongz.com (3,817,106 triples)
- justhungry.com (3,801,434 triples)
- More
|
---|
Top Classes | Show top values by domain count
- website (365,470 Domains)
- gd:Breadcrumb (287,356 Domains)
- article (105,811 Domains)
- foaf:Document (85,180 Domains)
- foaf:Image (83,193 Domains)
- sioc:Item (65,350 Domains)
- skos:Concept (25,766 Domains)
- sioc:UserAccount (23,739 Domains)
- object (10,374 Domains)
- product (10,241 Domains)
- sioc:Post (8,994 Domains)
- blog (7,397 Domains)
- gd:Review-aggregate (6,535 Domains)
- music.album (5,858 Domains)
- sioc:BlogPost (4,609 Domains)
- sioc:Comment (4,576 Domains)
- gd:Rating (4,081 Domains)
- band (3,508 Domains)
- foaf:Person (2,315 Domains)
- vcard2006:Address (1,595 Domains)
|
| Show top values by entity count
- gd:Breadcrumb (127,159,418 Entities)
- foaf:Image (64,683,855 Entities)
- foaf:Document (18,171,969 Entities)
- sioc:Item (16,642,780 Entities)
- article (14,031,296 Entities)
- skos:Concept (10,779,628 Entities)
- website (9,000,298 Entities)
- sioc:UserAccount (6,885,003 Entities)
- sioc:Post (5,869,152 Entities)
- sioc:Comment (5,231,704 Entities)
- product (2,352,200 Entities)
- gd:Review-aggregate (1,869,050 Entities)
- gd:Rating (1,234,210 Entities)
- city (972,269 Entities)
- https://rdf.data-vocabulary.org/#Breadcrumb (771,497 Entities)
- http://rdf.data-vocabulary.org/Breadcrumb (701,322 Entities)
- http://rdf.data-vocabulary.org/#BreadcrumbBreadcrumb (687,637 Entities)
- sioc:BlogPost (477,094 Entities)
- gr:Offering (457,568 Entities)
- scoop_it:topic (432,660 Entities)
|
Top Properties | Show top values by domain count
- ogp-og:title (432,101 Domains)
- ogp-og:site_name (424,276 Domains)
- ogp-og:url (422,685 Domains)
- gd:title (299,328 Domains)
- gd:url (298,518 Domains)
- ogp-og:description (274,174 Domains)
- ogp-me:title (184,483 Domains)
- ogp-me:description (166,533 Domains)
- ogp-me:image (164,198 Domains)
- ogp-me:url (164,100 Domains)
- ogp-me:type (141,040 Domains)
- ogp-me:site_name (137,540 Domains)
- ogp-og:image (115,468 Domains)
- dc:title (96,768 Domains)
- content:encoded (84,475 Domains)
- gd:child (68,664 Domains)
- ogp-fb:app_id (66,563 Domains)
- fb2008:fbmladmins (54,334 Domains)
- sioc:num_replies (51,762 Domains)
- ogp-me:locale (50,740 Domains)
|
| Show top values by entity count
- gd:title (126,710,211 Entities)
- gd:url (123,588,852 Entities)
- ogp-me:title (52,913,093 Entities)
- fb2008:fbmlapp_id (48,719,817 Entities)
- ogp-me:image (47,387,452 Entities)
- ogp-me:url (43,838,461 Entities)
- ogp-me:type (40,073,980 Entities)
- ogp-me:description (38,063,509 Entities)
- ogp-me:site_name (37,901,917 Entities)
- ogp-og:title (35,923,850 Entities)
- ogp-og:site_name (34,844,480 Entities)
- ogp-og:url (34,626,803 Entities)
- ogp-fb:app_id (28,108,917 Entities)
- ogp-og:image (27,299,248 Entities)
- ogp-og:description (25,585,286 Entities)
- fb2008:fbmladmins (20,465,586 Entities)
- dc:title (20,269,850 Entities)
- content:encoded (15,866,951 Entities)
- rdfs:label (11,074,733 Entities)
- skos-core:prefLabel (10,853,149 Entities)
|
Detailed Statistics as Excel-File |
html-rdfa.xlsx (120kb) |
Triples Extracted | 401,275,671
|
URLs with Triples | 27,320,114
|
Average Triples per URL | 14.68
|
Domains with Triples | 392,035
|
Average Triples per Domain | 1,023.57
|
Typed Entities | 69,259,620
|
Top Domains by Extracted Triples | Show top domains
- wordpress.com (101,301,127 triples)
- nadaguides.com (10,642,716 triples)
- blogspot.com (4,178,278 triples)
- bummyla.com (3,769,194 triples)
- marie-claire.es (3,674,872 triples)
- blogcu.com (3,519,972 triples)
- typepad.com (3,518,473 triples)
- epicurious.com (2,832,260 triples)
- spletnik.ru (2,561,556 triples)
- yahoo.com (2,445,952 triples)
- contactmusic.net (2,398,140 triples)
- after55.com (2,386,756 triples)
- greenbookblog.org (1,722,688 triples)
- soup.io (1,712,258 triples)
- heraldo.es (1,656,278 triples)
- bibliacatolica.com.br (1,635,648 triples)
- katom.com (1,521,408 triples)
- nazioneindiana.com (1,444,880 triples)
- serpadres.es (1,411,698 triples)
- More
|
Top Classes | Show top values by domain count
- foaf:Person (394,442 Domains)
|
| Show top values by entity count
- foaf:Person (69,259,620 Entities)
|
Top Properties | Show top values by domain count
- xfn:mePage (394,445 Domains)
- xfn:me-hyperlink (354,701 Domains)
- xfn:friend (29,696 Domains)
- xfn:friend-hyperlink (29,692 Domains)
- xfn:colleague (19,198 Domains)
- xfn:colleague-hyperlink (19,193 Domains)
- xfn:met (18,537 Domains)
- xfn:met-hyperlink (18,536 Domains)
- xfn:contact (17,492 Domains)
- xfn:contact-hyperlink (17,483 Domains)
- xfn:acquaintance (10,545 Domains)
- xfn:acquaintance-hyperlink (10,542 Domains)
- xfn:co-worker (10,139 Domains)
- xfn:co-worker-hyperlink (10,136 Domains)
- xfn:neighbor (5,111 Domains)
- xfn:neighbor-hyperlink (5,110 Domains)
- xfn:co-resident (3,253 Domains)
- xfn:co-resident-hyperlink (3,252 Domains)
- xfn:spouse (2,281 Domains)
- xfn:spouse-hyperlink (2,280 Domains)
|
| Show top values by entity count
- xfn:mePage (69,250,556 Entities)
- xfn:me-hyperlink (20,628,900 Entities)
- xfn:friend (5,269,479 Entities)
- xfn:friend-hyperlink (5,268,497 Entities)
- xfn:colleague (2,947,290 Entities)
- xfn:colleague-hyperlink (2,946,788 Entities)
- xfn:met (2,849,647 Entities)
- xfn:met-hyperlink (2,849,120 Entities)
- xfn:contact (3,080,063 Entities)
- xfn:contact-hyperlink (3,079,471 Entities)
- xfn:acquaintance (1,999,016 Entities)
- xfn:acquaintance-hyperlink (1,998,889 Entities)
- xfn:co-worker (1,725,704 Entities)
- xfn:co-worker-hyperlink (1,725,005 Entities)
- xfn:neighbor (748,917 Entities)
- xfn:neighbor-hyperlink (748,513 Entities)
- xfn:co-resident (560,434 Entities)
- xfn:co-resident-hyperlink (559,911 Entities)
- xfn:spouse (315,336 Entities)
- xfn:spouse-hyperlink (315,262 Entities)
|
Triples Extracted | 143,728,079
|
URLs with Triples | 17,895,411
|
Average Triples per URL | 8.03
|
Domains with Triples | 192,390
|
Average Triples per Domain | 747
|
Typed Entities | 41,729,827
|
Top Domains by Extracted Triples | Show top domains
- paginegialle.it (15,074,805 triples)
- ticketstogo.com (3,533,858 triples)
- nj.com (3,092,973 triples)
- oregonlive.com (2,750,249 triples)
- wikitravel.org (2,470,463 triples)
- cleveland.com (2,382,548 triples)
- nih.gov (2,257,506 triples)
- telefoonboek.nl (2,123,805 triples)
- nola.com (2,013,342 triples)
- al.com (1,904,320 triples)
- silive.com (1,835,825 triples)
- mlive.com (1,832,116 triples)
- remax.com (1,669,027 triples)
- musiqua.it (1,529,146 triples)
- bonhams.com (1,379,165 triples)
- law.com (1,365,412 triples)
- rostender.info (1,348,983 triples)
- opensecrets.org (1,185,475 triples)
- yellowbot.com (1,177,227 triples)
- pennlive.com (1,145,105 triples)
- More
|
Top Classes | Show top values by domain count
- vcard2006:Address (194,681 Domains)
|
| Show top values by entity count
- vcard2006:Address (41,729,827 Entities)
|
Top Properties | Show top values by domain count
- vcard2006:street-address (159,236 Domains)
- vcard2006:postal-code (140,876 Domains)
- vcard2006:region (121,251 Domains)
- vcard2006:country-name (59,247 Domains)
- vcard2006:extended-address (7,942 Domains)
- vcard2006:addressType (5,677 Domains)
- vcard2006:post-office-box (681 Domains)
|
| Show top values by entity count
- vcard2006:locality (28,084,914 Entities)
- vcard2006:street-address (25,110,636 Entities)
- vcard2006:region (22,810,735 Entities)
- vcard2006:postal-code (21,191,688 Entities)
- vcard2006:country-name (6,836,109 Entities)
- vcard2006:addressType (982,279 Entities)
- vcard2006:extended-address (514,588 Entities)
- vcard2006:post-office-box (120,778 Entities)
|
Triples Extracted | 62,666,600
|
URLs with Triples | 2,343,185
|
Average Triples per URL | 26.74
|
Domains with Triples | 40,257
|
Average Triples per Domain | 1,556.66
|
Typed Entities | 14,251,603
|
Top Domains by Extracted Triples | Show top domains
- rostender.info (3,226,279 triples)
- conventionscene.com (2,036,506 triples)
- wikipedia.org (1,676,069 triples)
- sched.com (1,303,861 triples)
- ticketnetwork.com (1,030,794 triples)
- lasvegastickets.com (956,909 triples)
- uu.se (936,353 triples)
- brucecounty.on.ca (906,662 triples)
- excite.com (858,043 triples)
- gamestub.com (855,659 triples)
- kokucheese.com (808,282 triples)
- ticketsinventory.com (761,460 triples)
- betshoot.com (681,220 triples)
- yesmagazine.org (590,204 triples)
- museum.by (531,391 triples)
- ticketamerica.com (504,119 triples)
- uticacurlingclub.org (494,355 triples)
- very.vn (472,115 triples)
- connpass.com (442,594 triples)
- onlinetickets.com (399,064 triples)
- More
|
Top Classes | Show top values by domain count
- icaltzd:vcalendar (40,523 Domains)
- icaltzd:Vevent (35,007 Domains)
- icaltzd:DomainOf_rrule (22 Domains)
- icaltzd:Vtodo (8 Domains)
- icaltzd:Vjournal (1 Domain)
- icaltzd:Vfreebusy (1 Domain)
|
| Show top values by entity count
- icaltzd:vcalendar (2,378,764 Entities)
- icaltzd:Vevent (11,871,638 Entities)
- icaltzd:DomainOf_rrule (1,044 Entities)
- icaltzd:Vtodo (150 Entities)
- icaltzd:Vjournal (6 Entities)
- icaltzd:Vfreebusy (1 Entities)
|
Top Properties | Show top values by domain count
- icaltzd:component (35,005 Domains)
- icaltzd:summary (28,321 Domains)
- icaltzd:dtstart (25,945 Domains)
- icaltzd:description (20,783 Domains)
- icaltzd:url (17,515 Domains)
- icaltzd:dtend (16,130 Domains)
- icaltzd:location (14,560 Domains)
- icaltzd:categories (1,209 Domains)
- icaltzd:uid (560 Domains)
- icaltzd:organizer (153 Domains)
- icaltzd:calAddress (146 Domains)
- icaltzd:dtstamp (145 Domains)
- icaltzd:status (26 Domains)
- icaltzd:rrule (22 Domains)
- icaltzd:class (12 Domains)
- icaltzd:freq (6 Domains)
|
| Show top values by entity count
- icaltzd:summary (10,136,804 Entities)
- icaltzd:dtstart (8,179,510 Entities)
- icaltzd:url (5,430,866 Entities)
- icaltzd:location (5,244,745 Entities)
- icaltzd:dtend (3,404,612 Entities)
- icaltzd:description (3,215,747 Entities)
- icaltzd:component (2,145,724 Entities)
- icaltzd:categories (726,419 Entities)
- icaltzd:uid (213,086 Entities)
- icaltzd:dtstamp (97,370 Entities)
- icaltzd:organizer (19,006 Entities)
- icaltzd:calAddress (18,652 Entities)
- icaltzd:status (10,299 Entities)
- icaltzd:class (1,401 Entities)
- icaltzd:rrule (1,018 Entities)
- icaltzd:freq (407 Entities)
|
Triples Extracted | 59,026,065
|
URLs with Triples | 3,702,554
|
Average Triples per URL | 15.94
|
Domains with Triples | 27,181
|
Average Triples per Domain | 2,171.59
|
Typed Entities | 10,153,540
|
Top Domains by Extracted Triples | Show top domains
- blogspot.com (3,498,789 triples)
- vrbo.com (2,608,885 triples)
- homeaway.com (1,968,155 triples)
- fewo-direkt.de (1,333,569 triples)
- aluguetemporada.com.br (1,212,765 triples)
- freewebsitereport.org (1,031,784 triples)
- tucsonweekly.com (1,000,764 triples)
- homeaway.co.uk (951,500 triples)
- ayda.ru (936,086 triples)
- 892,000.it (931,011 triples)
- glassdoor.com (903,434 triples)
- listen360.com (778,656 triples)
- homeaway.nl (705,805 triples)
- hotelanacapri.co.uk (696,780 triples)
- otzyvua.net (644,483 triples)
- menupages.com (615,416 triples)
- abritel.fr (565,100 triples)
- caring.com (537,498 triples)
- rakuten.co.jp (498,284 triples)
- More
|
---|
Top Classes | Show top values by domain count
- rev:Review (27,337 Domains)
|
| Show top values by entity count
- rev:Review (10,153,540 Entities)
|
Top Properties | Show top values by domain count
- rev:reviewer (22,348 Domains)
- dcterms:date (21,040 Domains)
- vcard2,006:url (20,657 Domains)
- rev:hasReview (20,656 Domains)
- vcard2,006:fn (15,628 Domains)
- rev:rating (14,753 Domains)
- rev:text (9,363 Domains)
- rev:title (6,847 Domains)
- vcard2,006:photo (6,217 Domains)
- rev:type (2,342 Domains)
|
| Show top values by entity count
- rev:reviewer (8,246,503 Entities)
- dcterms:date (7,835,343 Entities)
- vcard2,006:url (6,391,751 Entities)
- rev:hasReview (6,391,872 Entities)
- vcard2,006:fn (4,910,773 Entities)
- rev:rating (6,332,053 Entities)
- rev:text (5,514,551 Entities)
- rev:title (1,337,858 Entities)
- vcard2,006:photo (1,694,119 Entities)
- rev:type (302,511 Entities)
|
Triples Extracted | 31,778,446
|
URLs with Triples | 314,164
|
Average Triples per URL | 101.15 |
Domains with Triples | 7,162
|
Average Triples per Domain | 4,437.09 |
Typed Entities | 9,148,197
|
Top Domains by Extracted Triples | Show top domains
- icase.it (8,464,917 triples)
- remax.com (2,404,454 triples)
- quoka.de (1,540,710 triples)
- after55.com (1,020,065 triples)
- iberia.com (603,091 triples)
- thomasstaudtverlagflensburg.de (594,079 triples)
- forrentuniversity.com (572,423 triples)
- mambo-717.com (497,440 triples)
- grecavricambi.it (410,060 triples)
- buecherhalle.ch (331,999 triples)
- japanmania-shop.com (273,402 triples)
- laendleanzeiger.at (267,273 triples)
- istantidigioia.com (236,446 triples)
- onhabitat.com (231,882 triples)
- lesbambetises.com (214,625 triples)
- corporatehousing.com (204,454 triples)
- verlags-service-imfeld.ch (191,670 triples)
- artefactum-fineart.com (185,940 triples)
- bytheseasir.com (181,746 triples)
- More
|
Top Classes | Show top values by domain count
- hlisting:Lister (7,164 Domains)
- hlisting:Listing (7,164 Domains)
- hlisting:Item (4,133 Domains)
|
| Show top values by entity count
- hlisting:Lister (3,410,885 Entities)
- hlisting:Listing (3,410,885 Entities)
- hlisting:Item (2,326,427 Entities)
|
Top Properties | Show top values by domain count
- hlisting:lister (7,164 Domains)
- hlisting:price (6,790 Domains)
- hlisting:item (4,133 Domains)
- hlisting:itemPhoto (4,133 Domains)
- hlisting:itemUrl (4,133 Domains)
- hlisting:description (989 Domains)
- hlisting:itemName (178 Domains)
- hlisting:listerUrl (158 Domains)
- hlisting:listerLogo (158 Domains)
- hlisting:listerName (152 Domains)
- hlisting:action (66 Domains)
- hlisting:summary (62 Domains)
- hlisting:listerOrg (49 Domains)
- vcard2006:tel (41 Domains)
- hlisting:dtlisted (39 Domains)
- hlisting:permalink (13 Domains)
- foaf:mbox (9 Domains)
- hlisting:dtexpired (1 Domain)
|
| Show top values by entity count
- hlisting:lister (3,410,885 Entities)
- hlisting:price (3,047,900 Entities)
- hlisting:item (2,326,427 Entities)
- hlisting:itemPhoto (2,326,427 Entities)
- hlisting:itemUrl (2,326,424 Entities)
- hlisting:description (1,430,998 Entities)
- hlisting:summary (934,171 Entities)
- hlisting:dtlisted (685,482 Entities)
- hlisting:itemName (338,076 Entities)
- hlisting:listerUrl (327,716 Entities)
- hlisting:listerLogo (327,716 Entities)
- hlisting:listerName (307,622 Entities)
- hlisting:action (103,267 Entities)
- hlisting:listerOrg (60,265 Entities)
- vcard2006:tel (31,364 Entities)
- hlisting:permalink (1,497 Entities)
- hlisting:dtexpired (75 Entities)
- foaf:mbox (46 Entities)
|
Triples Extracted | 15,228,823
|
URLs with Triples | 543,865
|
Average Triples per URL | 28
|
Domains with Triples | 5,179
|
Average Triples per Domain | 2,940.49
|
Typed Entities | 3,681,013
|
Top Domains by Extracted Triples | Show top domains
- grouprecipes.com (2,929,452 triples)
- blogspot.com (492,078 triples)
- mmenu.com (452,685 triples)
- shipuxiu.com (445,066 triples)
- nyam.ru (438,587 triples)
- happy-giraffe.ru (341,136 triples)
- receptok.ru (278,534 triples)
- astray.com (255,853 triples)
- deepsouthdish.com (250,561 triples)
- seriouseats.com (225,046 triples)
- cookpad.com (219,703 triples)
- sheknows.com (171,016 triples)
- mydailymoment.com (150,399 triples)
- midwestliving.com (105,740 triples)
- webopskrifter.dk (101,565 triples)
- fatsecret.com (89,773 triples)
- hlebopechka.ru (89,724 triples)
- gekonntgekocht.de (79,535 triples)
- nostimada.gr (79,360 triples)
- More
|
---|
Top Classes | Show top values by domain count
- hrecipe:Recipe (5,258 Domains)
- hrecipe:Ingredient (3,039 Domains)
- hrecipe:Duration (1,115 Domains)
- hrecipe:Nutrition (369 Domains)
|
| Show top values by entity count
- hrecipe:Recipe (861,979 Entities)
- hrecipe:Ingredient (2,674,432 Entities)
- hrecipe:Duration (107,082 Entities)
- hrecipe:Nutrition (37,520 Entities)
|
Top Properties | Show top values
- hrecipe:fn (3,658 Domains)
- hrecipe:ingredient (3,039 Domains)
- hrecipe:ingredientName (3,007 Domains)
- hrecipe:photo (2,668 Domains)
- hrecipe:instructions (2,593 Domains)
- hrecipe:tag (2,274 Domains)
- hrecipe:author (1,641 Domains)
- hrecipe:yield (1,603 Domains)
- hrecipe:summary (1,429 Domains)
- hrecipe:duration (1,115 Domains)
- hrecipe:durationTime (1,078 Domains)
- hrecipe:published (1,045 Domains)
- hrecipe:nutrition (369 Domains)
- hrecipe:ingredientQuantity (356 Domains)
- hrecipe:ingredientQuantityType (341 Domains)
- hrecipe:nutritionValue (329 Domains)
- hrecipe:durationTitle (157 Domains)
- hrecipe:nutritionValueType (78 Domains)
|
| Show top values by entity count
- hrecipe:ingredientName (2,661,652 Entities)
- hrecipe:fn (752,605 Entities)
- hrecipe:photo (421,395 Entities)
- hrecipe:ingredient (368,932 Entities)
- hrecipe:ingredientQuantity (343,034 Entities)
- hrecipe:instructions (340,147 Entities)
- hrecipe:ingredientQuantityType (338,790 Entities)
- hrecipe:tag (249,480 Entities)
- hrecipe:author (220,724 Entities)
- hrecipe:summary (192,126 Entities)
- hrecipe:yield (155,730 Entities)
- hrecipe:duration (96,520 Entities)
- hrecipe:durationTime (84,852 Entities)
- hrecipe:published (81,878 Entities)
- hrecipe:nutritionValue (35,800 Entities)
- hrecipe:nutrition (23,712 Entities)
- hrecipe:durationTitle (13,245 Entities)
- hrecipe:nutritionValueType (11,746 Entities)
|
Triples Extracted | 754,815
|
URLs with Triples | 99,301
|
Average Triples per URL | 7.6 |
Domains with Triples | 225
|
Average Triples per Domain | 3,354.73 |
Typed Entities | 307,276
|
Top Domains by Extracted Triples | Show top domains
- wikipedia.org (653,315 triples)
- preen.com (28,100 triples)
- blogspot.com (13,704 triples)
- antwiki.org (12,407 triples)
- hitchhikersgui.de (8,557 triples)
- wikimedia.org (6,034 triples)
- thefullwiki.org (5,905 triples)
- wiktionary.org (5,830 triples)
- wikidoc.org (2,511 triples)
- wikivisually.com (1,915 triples)
- wordpress.com (1,604 triples)
- wikien4.appspot.com (1,231 triples)
- mashpedia.com (1,085 triples)
- insect-collection.com (805 triples)
- marefa.org (554 triples)
- zipcodezoo.com (525 triples)
- like2do.com (438 triples)
- everipedia.org (427 triples)
- portadelaidewiki.org.au (391 triples)
- More
|
---|
Top Classes | Show top values by domain count
- wo:species (228 Domains)
- wo:Kingdom (128 Domains)
- wo:Order (127 Domains)
- wo:Family (124 Domains)
- wo:Genus (124 Domains)
- wo:Species (103 Domains)
- wo:Phylum (96 Domains)
- wo:Class (87 Domains)
|
| Show top values by entity count
- wo:species (117,374 Entities)
- wo:Kingdom (38,920 Entities)
- wo:Order (36,500 Entities)
- wo:Family (33,997 Entities)
- wo:Genus (35,479 Entities)
- wo:Species (8,475 Entities)
- wo:Phylum (28,268 Entities)
- wo:Class (8,263 Entities)
|
Top Properties | Show top values by domain count
- wo:kingdom (128 Domains)
- wo:kingdomName (128 Domains)
- wo:orderName (127 Domains)
- wo:order (127 Domains)
- wo:genusName (124 Domains)
- wo:genus (124 Domains)
- wo:family (124 Domains)
- wo:familyName (123 Domains)
- wo:speciesName (113 Domains)
- wo:scientificName (112 Domains)
- wo:species (103 Domains)
- wo:phylumName (96 Domains)
- wo:phylum (96 Domains)
- wo:className (87 Domains)
- wo:class (87 Domains)
|
| Show top values by entity count
- wo:kingdom (38,921 Entities)
- wo:kingdomName (38,916 Entities)
- wo:orderName (36,486 Entities)
- wo:order (36,501 Entities)
- wo:genusName (35,473 Entities)
- wo:genus (35,480 Entities)
- wo:family (33,999 Entities)
- wo:familyName (33,985 Entities)
- wo:speciesName (23,950 Entities)
- wo:scientificName (52,693 Entities)
- wo:species (8,475 Entities)
- wo:phylumName (28,265 Entities)
- wo:phylum (28,268 Entities)
- wo:className (8,259 Entities)
- wo:class (8,263 Entities)
|