In summary, we found structured data within 0.9 billion HTML pages out of the 2.5 billion pages contained in the
crawl (37.1%).
These pages originate from 9.6 million different pay-level-domains out of the 32.8 million pay-level-domains covered
by the crawl (29.3%).
Altogether, the extracted data sets consist of 31.5 billion RDF
quads.
Instructions on how to download the RDFa, Microdata, Embedded JSON-LD and Microformats data sets are given on the
page how to get the data.
Triples Extracted |
20,318,437,064 |
URLs with Triples |
556,088,266 |
Average Triples per URL |
36.54 |
Domains with Triples |
5,183,199 |
Average Triples per Domain |
3,920.06 |
Typed Entities |
3,927,363,100 |
Top Domains by Extracted Triples |
Show
top domains
- google.com (269,910,845 triples)
- bezformata.com (109,596,352 triples)
- wordpress.com (90,787,700 triples)
- skyrock.com (84,332,965 triples)
- canalblog.com (78,025,636 triples)
- shopstyle.com (76,303,978 triples)
- elpais.com (63,716,129 triples)
- parenting.com (54,381,871 triples)
- makaan.com (50,473,517 triples)
- more.com (45,921,022 triples)
- homehardware.ca (44,098,027 triples)
- gigmasters.com (39,829,871 triples)
- blogspot.com (39,388,805 triples)
- partycity.com (37,657,448 triples)
- foxsports.com (35,802,769 triples)
- lyst.com (35,375,420 triples)
- untitled-magazine.com (32,593,224 triples)
- colombia.com (28,731,481 triples)
- kidsroom.de (25,774,825 triples)
- kp.ru (25,069,699 triples)
- More
|
Top Classes |
Show
top values by domain count
- schema:WebPage (1,124,583 Domains)
- schema:Product (812,205 Domains)
- schema:Offer (676,899 Domains)
- data-voc:Breadcrumb (621,344 Domains)
- schema:Article (612,361 Domains)
- schema:Organization (510,069 Domains)
- schema:PostalAddress (502,615 Domains)
- https://schema.org/WPHeader (490,296 Domains)
- https://schema.org/SiteNavigationElement (483,721 Domains)
- schema:SiteNavigationElement (468,096 Domains)
- https://schema.org/WPFooter (461,624 Domains)
- schema:WPHeader (443,043 Domains)
- https://schema.org/ImageObject (414,427 Domains)
- schema:WPFooter (394,652 Domains)
- https://schema.org/Person (365,670 Domains)
- schema:ImageObject (360,875 Domains)
- https://schema.org/WebPage (357,621 Domains)
- https://schema.org/CreativeWork (350,773 Domains)
- schema:BreadcrumbList (344,538 Domains)
- schema:ListItem (338,845 Domains)
|
|
Show
top values by entity count
- schema:WebPage (69,511,072 Entities)
- schema:Product (306,531,345 Entities)
- schema:Offer (260,577,205 Entities)
- data-voc:Breadcrumb (369,938,577 Entities)
- schema:Article (130,309,999 Entities)
- schema:Organization (97,958,878 Entities)
- schema:PostalAddress (89,524,321 Entities)
- https://schema.org/WPHeader (22,471,336 Entities)
- https://schema.org/SiteNavigationElement (33,931,502 Entities)
- schema:SiteNavigationElement (82,126,829 Entities)
- https://schema.org/WPFooter (20,087,235 Entities)
- schema:WPHeader (34,489,977 Entities)
- https://schema.org/ImageObject (127,258,860 Entities)
- schema:WPFooter (29,348,540 Entities)
- https://schema.org/Person (68,677,793 Entities)
- schema:ImageObject (225,565,381 Entities)
- https://schema.org/WebPage (18,459,109 Entities)
- https://schema.org/CreativeWork (31,028,396 Entities)
- schema:BreadcrumbList (80,202,453 Entities)
- schema:ListItem (273,229,987 Entities)
|
Top Properties |
Show
top values by domain count
- http://www.w3.org/1,999/xhtml/microdata#item (5,207,847 Domains)
- dcterms:title (4,866,092 Domains)
- schema:Product/name (754,812 Domains)
- schema:WebPage/name (646,378 Domains)
- schema:Product/offers (645,994 Domains)
- schema:Offer/price (639,598 Domains)
- data-voc:Breadcrumb/url (608,657 Domains)
- data-voc:Breadcrumb/title (607,491 Domains)
- schema:Offer/priceCurrency (606,990 Domains)
- schema:Product/image (573,614 Domains)
- schema:Product/description (520,307 Domains)
- schema:WebPage/url (498,213 Domains)
- schema:Offer/availability (477,170 Domains)
- schema:WebPage/description (404,022 Domains)
- schema:PostalAddress/streetAddress (389,237 Domains)
- schema:PostalAddress/addressLocality (375,645 Domains)
- schema:Product/url (364,889 Domains)
- https://schema.org/CreativeWork/text (345,901 Domains)
- https://schema.org/Person/name (335,355 Domains)
- schema:PostalAddress/postalCode (332,581 Domains)
|
|
Show
top values by entity count
- http://www.w3.org/1,999/xhtml/microdata#item (569,617,163 Entities)
- dcterms:title (510,717,808 Entities)
- schema:Product/name (286,638,774 Entities)
- schema:WebPage/name (21,025,919 Entities)
- schema:Product/offers (202,812,439 Entities)
- schema:Offer/price (234,541,798 Entities)
- data-voc:Breadcrumb/url (343,079,808 Entities)
- data-voc:Breadcrumb/title (353,919,393 Entities)
- schema:Offer/priceCurrency (206,642,640 Entities)
- schema:Product/image (223,653,117 Entities)
- schema:Product/description (141,629,873 Entities)
- schema:WebPage/url (13,700,791 Entities)
- schema:Offer/availability (114,434,875 Entities)
- schema:WebPage/description (9,801,867 Entities)
- schema:PostalAddress/streetAddress (61,753,617 Entities)
- schema:PostalAddress/addressLocality (72,884,751 Entities)
- schema:Product/url (143,569,073 Entities)
- https://schema.org/CreativeWork/text (26,704,121 Entities)
- https://schema.org/Person/name (66,443,716 Entities)
- schema:PostalAddress/postalCode (47,104,294 Entities)
|
Detailed Statistics as Excel-File |
html-microdata.xlsx (335kb) |
Triples Extracted |
4,159,835,616 |
URLs with Triples |
194,648,550 |
Average Triples per URL |
21.37 |
Domains with Triples |
3,835,046 |
Average Triples per Domain |
1,084.69 |
Typed Entities |
925,744,293 |
Top Domains by Extracted Triples |
Show
top domains
- tunein.com (79,598,015 triples)
- maxpreps.com (60,705,334 triples)
- apple.com (42,180,120 triples)
- foroactivo.com (37,791,164 triples)
- yoo7.com (30,871,915 triples)
- forumotion.com (30,412,735 triples)
- forumactif.com (26,731,635 triples)
- bimmershops.com (19,714,215 triples)
- 5.ua (19,548,189 triples)
- shutterstock.com (17,832,975 triples)
- hotels.com (17,272,497 triples)
- singleplatform.com (17,118,077 triples)
- nbc.com (15,910,802 triples)
- forumactif.org (15,900,384 triples)
- allmenus.com (14,576,788 triples)
- aplaceformom.com (13,337,925 triples)
- forumeiros.com (13,192,246 triples)
- imdb.com (12,807,126 triples)
- si.com (12,409,736 triples)
- kp.ru (11,926,289 triples)
- More
|
Top Classes |
Show
top values by domain count
- schema:WebSite (3,519,466 Domains)
- schema:SearchAction (2,988,042 Domains)
- schema:Organization (1,349,775 Domains)
- schema:Person (335,784 Domains)
- schema:LocalBusiness (249,017 Domains)
- schema:ListItem (209,207 Domains)
- schema:BreadcrumbList (205,971 Domains)
- schema:PostalAddress (178,500 Domains)
- schema:WebPage (121,393 Domains)
- schema:ImageObject (111,946 Domains)
- schema:GeoCoordinates (71,894 Domains)
- schema:Place (66,396 Domains)
- schema:Event (63,605 Domains)
- schema:Offer (57,756 Domains)
- schema:Article (57,082 Domains)
- schema:ContactPoint (51,296 Domains)
- schema:BlogPosting (43,243 Domains)
- schema:Product (40,169 Domains)
- schema:AggregateRating (23,105 Domains)
- schema:OpeningHoursSpecification (20,734 Domains)
|
|
Show
top values by entity count
- schema:WebSite (64,463,601 Entities)
- schema:SearchAction (41,739,092 Entities)
- schema:Organization (116,482,765 Entities)
- schema:Person (68,112,352 Entities)
- schema:LocalBusiness (6,741,534 Entities)
- schema:ListItem (178,891,863 Entities)
- schema:BreadcrumbList (51,617,517 Entities)
- schema:PostalAddress (20,005,146 Entities)
- schema:WebPage (39,444,927 Entities)
- schema:ImageObject (70,795,830 Entities)
- schema:GeoCoordinates (6,493,652 Entities)
- schema:Place (9,671,088 Entities)
- schema:Event (2,661,806 Entities)
- schema:Offer (22,449,301 Entities)
- schema:Article (6,634,064 Entities)
- schema:ContactPoint (10,471,249 Entities)
- schema:BlogPosting (4,975,144 Entities)
- schema:Product (20,614,006 Entities)
- schema:AggregateRating (4,518,522 Entities)
- schema:OpeningHoursSpecification (4,831,267 Entities)
|
Top Properties |
Show
top values by domain count
- schema:url (3,780,120 Domains)
- schema:name (3,752,802 Domains)
- schema:potentialAction (2,990,173 Domains)
- schema:target (2,989,799 Domains)
- schema:query-input (2,988,051 Domains)
- schema:logo (1,123,010 Domains)
- schema:sameAs (871,908 Domains)
- schema:description (709,531 Domains)
- schema:image (512,757 Domains)
- schema:telephone (400,696 Domains)
- schema:address (387,811 Domains)
- schema:email (250,740 Domains)
- schema:legalName (218,468 Domains)
- schema:itemListElement (210,330 Domains)
- schema:position (209,503 Domains)
- schema:item (208,038 Domains)
- schema:addressLocality (173,532 Domains)
- schema:openingHours (168,995 Domains)
- schema:streetAddress (164,059 Domains)
- schema:postalCode (162,570 Domains)
|
|
Show
top values by entity count
- schema:url (327,544,696 Entities)
- schema:name (523,762,931 Entities)
- schema:potentialAction (42,848,337 Entities)
- schema:target (42,724,538 Entities)
- schema:query-input (41,714,110 Entities)
- schema:logo (103,866,876 Entities)
- schema:sameAs (78,568,203 Entities)
- schema:description (78,575,751 Entities)
- schema:image (92,933,656 Entities)
- schema:telephone (26,054,372 Entities)
- schema:address (28,110,917 Entities)
- schema:email (10,518,404 Entities)
- schema:legalName (6,246,478 Entities)
- schema:itemListElement (54,056,202 Entities)
- schema:position (185,452,756 Entities)
- schema:item (169,530,476 Entities)
- schema:addressLocality (18,526,682 Entities)
- schema:openingHours (4,619,789 Entities)
- schema:streetAddress (17,423,774 Entities)
- schema:postalCode (15,945,901 Entities)
|
Detailed Statistics as Excel-File |
html-embedded-jsonld.xlsx (103kb) |
Triples Extracted |
5,374,115,412 |
URLs with Triples |
248,130,845 |
Average Triples per URL |
21.66 |
Domains with Triples |
3,399,902 |
Average Triples per Domain |
1,580.67 |
Typed Entities |
1,793,926,360 |
Top Domains by Extracted Triples |
Show
top domains
- blogspot.com (393,484,107 triples)
- wordpress.com (202,608,687 triples)
- ezlocal.com (85,394,673 triples)
- countrytvofficial.co.uk (25,735,406 triples)
- kamidougaadult.com (24,954,236 triples)
- nctbe.com (18,760,758 triples)
- ericah.se (17,876,985 triples)
- 1vh.de (16,227,812 triples)
- equivalencytheorem.info (15,763,157 triples)
- wikipedia.org (14,649,156 triples)
- nanofilosofia.ru (14,605,888 triples)
- rudn.ru (13,672,084 triples)
- urbanverse.net (12,990,744 triples)
- adrianribao.es (12,381,453 triples)
- ugent.be (12,282,948 triples)
- hatenablog.com (11,853,508 triples)
- webberslaw.com (11,293,404 triples)
- jukens.com (11,069,918 triples)
- appitite.org.uk (10,487,693 triples)
- azarask.in (8,925,797 triples)
- More
|
Top Classes |
Show
top values by domain count
- vcard2006:Name (3,413,117 Domains)
- vcard2006:VCard (3,410,799 Domains)
- vcard2006:Organization (140,707 Domains)
|
|
Show
top values by entity count
- vcard2006:Name (883,978,288 Entities)
- vcard2006:VCard (883,798,324 Entities)
- vcard2006:Organization (26,149,748 Entities)
|
Top Properties |
Show
top values by domain count
- vcard2006:n (3,413,117 Domains)
- vcard2006:fn (2,871,802 Domains)
- vcard2006:given-name (2,864,402 Domains)
- vcard2006:family-name (2,864,239 Domains)
- vcard2006:url (1,915,225 Domains)
- vcard2006:photo (645,406 Domains)
- vcard2006:adr (174,611 Domains)
- vcard2006:tel (148,779 Domains)
- vcard2006:org (140,707 Domains)
- vcard2006:organization-name (140,707 Domains)
- vcard2006:email (93,382 Domains)
- vcard2006:nickname (15,474 Domains)
- vcard2006:title (15,045 Domains)
- vcard2006:note (14,067 Domains)
- vcard2006:geo (12,738 Domains)
- vcard2006:category (6,369 Domains)
- vcard2006:role (5,196 Domains)
- vcard2006:logo (5,048 Domains)
- vcard2006:workTel (3,634 Domains)
- vcard2006:fax (2,143 Domains)
|
|
Show
top values by entity count
- vcard2006:n (883,978,288 Entities)
- vcard2006:fn (668,845,054 Entities)
- vcard2006:given-name (666,235,816 Entities)
- vcard2006:family-name (666,199,414 Entities)
- vcard2006:url (403,074,193 Entities)
- vcard2006:photo (227,566,475 Entities)
- vcard2006:adr (17,425,793 Entities)
- vcard2006:tel (11,290,199 Entities)
- vcard2006:org (26,149,748 Entities)
- vcard2006:organization-name (26,149,748 Entities)
- vcard2006:email (4,525,303 Entities)
- vcard2006:nickname (6,182,556 Entities)
- vcard2006:title (1,446,951 Entities)
- vcard2006:note (1,906,846 Entities)
- vcard2006:geo (2,808,198 Entities)
- vcard2006:category (1,757,936 Entities)
- vcard2006:role (1,317,899 Entities)
- vcard2006:logo (1,001,487 Entities)
- vcard2006:workTel (194,723 Entities)
- vcard2006:fax (294,396 Entities)
|
Triples Extracted |
1,207,733,576 |
URLs with Triples |
154,260,198 |
Average Triples per URL |
7.83 |
Domains with Triples |
1,382,497 |
Average Triples per Domain |
873.59 |
Typed Entities |
338,787,245 |
Top Domains by Extracted Triples |
Show
top domains
- skyrock.com (12,652,689 triples)
- extra.com.co (9,638,446 triples)
- canalblog.com (6,924,971 triples)
- blogspot.com (5,983,477 triples)
- openjurist.org (5,341,304 triples)
- nbcnews.com (4,520,489 triples)
- blog.cz (4,085,621 triples)
- maayboli.com (3,919,085 triples)
- kleushka.ru (3,793,737 triples)
- loc.gov (3,269,949 triples)
- radio.com (3,257,036 triples)
- goo.ne.jp (3,254,415 triples)
- menaiset.fi (3,202,269 triples)
- aljazeera.net (3,166,987 triples)
- telegraph.co.uk (2,787,526 triples)
- opensooq.com (2,710,747 triples)
- justhungry.com (2,688,620 triples)
- worldcat.org (2,587,220 triples)
- nhaccuatui.com (2,542,111 triples)
- threadless.com (2,489,374 triples)
- More
|
Top Classes |
Show
top values by domain count
- http://rdf.data-vocabulary.org/#Breadcrumb (304,278 Domains)
- website (89,940 Domains)
- article (87,425 Domains)
- foaf:Document (74,298 Domains)
- foaf:Image (72,100 Domains)
- http://rdfs.org/sioc/ns#Item (60,464 Domains)
- http://www.w3.org/2,004/02/skos/core#Concept (24,302 Domains)
- http://www.w3.org/1,999/xhtml (20,516 Domains)
- product (20,013 Domains)
- http://rdfs.org/sioc/ns#UserAccount (19,952 Domains)
- blog (9,824 Domains)
- http://rdfs.org/sioc/ns#Post (7,933 Domains)
- http://rdf.data-vocabulary.org/#Review-aggregate (6,474 Domains)
- object (6,137 Domains)
- https://rdf.data-vocabulary.org/#Breadcrumb (4,371 Domains)
- http://rdfs.org/sioc/types#Comment (4,064 Domains)
- http://rdfs.org/sioc/types#BlogPost (3,896 Domains)
- http://rdf.data-vocabulary.org/#Rating (3,754 Domains)
- foaf:Person (2,068 Domains)
- profile (1,781 Domains)
|
|
Show
top values by entity count
- http://rdf.data-vocabulary.org/#Breadcrumb (101,264,815 Entities)
- website (6,450,222 Entities)
- article (15,218,991 Entities)
- foaf:Document (13,752,361 Entities)
- foaf:Image (39,476,019 Entities)
- http://rdfs.org/sioc/ns#Item (12,668,263 Entities)
- http://www.w3.org/2,004/02/skos/core#Concept (10,250,861 Entities)
- http://www.w3.org/1,999/xhtml (13,381,925 Entities)
- product (2,371,099 Entities)
- http://rdfs.org/sioc/ns#UserAccount (4,901,502 Entities)
- blog (568,531 Entities)
- http://rdfs.org/sioc/ns#Post (4,428,118 Entities)
- http://rdf.data-vocabulary.org/#Review-aggregate (1,096,258 Entities)
- object (331,299 Entities)
- https://rdf.data-vocabulary.org/#Breadcrumb (2,295,192 Entities)
- http://rdfs.org/sioc/types#Comment (3,887,808 Entities)
- http://rdfs.org/sioc/types#BlogPost (379,174 Entities)
- http://rdf.data-vocabulary.org/#Rating (735,605 Entities)
- foaf:Person (98,430 Entities)
- profile (136,642 Entities)
|
Top Properties |
Show
top values by domain count
- http://opengraphprotocol.org/schema/title (554,830 Domains)
- http://opengraphprotocol.org/schema/url (552,065 Domains)
- http://opengraphprotocol.org/schema/type (551,993 Domains)
- http://opengraphprotocol.org/schema/site_name (548,495 Domains)
- http://opengraphprotocol.org/schema/description (413,087 Domains)
- http://rdf.data-vocabulary.org/#title (320,682 Domains)
- http://rdf.data-vocabulary.org/#url (319,955 Domains)
- http://ogp.me/ns#title (194,214 Domains)
- http://ogp.me/ns#description (176,018 Domains)
- http://ogp.me/ns#url (172,895 Domains)
- http://ogp.me/ns#image (167,180 Domains)
- http://ogp.me/ns#site_name (158,908 Domains)
- http://opengraphprotocol.org/schema/image (93,702 Domains)
- dcterms:title (84,191 Domains)
- http://purl.org/rss/1.0/modules/content/encoded (73,484 Domains)
- http://ogp.me/ns/fb#app_id (62,852 Domains)
- http://opengraphprotocol.org/schema/latitude (60,541 Domains)
- http://opengraphprotocol.org/schema/longitude (60,541 Domains)
- http://rdf.data-vocabulary.org/#child (50,463 Domains)
- http://www.facebook.com/2,008/fbmladmins (50,453 Domains)
|
|
Show
top values by entity count
- http://opengraphprotocol.org/schema/title (29,320,625 Entities)
- http://opengraphprotocol.org/schema/url (28,571,407 Entities)
- http://opengraphprotocol.org/schema/type (28,045,310 Entities)
- http://opengraphprotocol.org/schema/site_name (27,897,575 Entities)
- http://opengraphprotocol.org/schema/description (23,513,899 Entities)
- http://rdf.data-vocabulary.org/#title (104,977,818 Entities)
- http://rdf.data-vocabulary.org/#url (98,137,251 Entities)
- http://ogp.me/ns#title (32,404,894 Entities)
- http://ogp.me/ns#description (25,721,520 Entities)
- http://ogp.me/ns#url (29,453,627 Entities)
- http://ogp.me/ns#image (29,219,503 Entities)
- http://ogp.me/ns#site_name (26,916,467 Entities)
- http://opengraphprotocol.org/schema/image (20,048,459 Entities)
- dcterms:title (15,064,504 Entities)
- http://purl.org/rss/1.0/modules/content/encoded (12,592,077 Entities)
- http://ogp.me/ns/fb#app_id (18,672,140 Entities)
- http://opengraphprotocol.org/schema/latitude (1,153,812 Entities)
- http://opengraphprotocol.org/schema/longitude (1,153,808 Entities)
- http://rdf.data-vocabulary.org/#child (5,991,311 Entities)
- http://www.facebook.com/2,008/fbmladmins (9,738,258 Entities)
|
Detailed Statistics as Excel-File |
html-rdfa.xlsx (594kb) |
Triples Extracted |
263,545,886
|
URLs with Triples |
19,292,875
|
Average Triples per URL |
13.66
|
Domains with Triples |
390,343
|
Average Triples per Domain |
675.16
|
Typed Entities |
44,797,779
|
Top Domains by Extracted Triples |
Show
top domains
- wordpress.com (24,955,762 triples)
- bummyla.com (5,693,618 triples)
- dribbble.com (2,837,326 triples)
- newsrimini.it (2,574,698 triples)
- iwedplanner.com (1,761,140 triples)
- heraldo.es (1,643,188 triples)
- apollosolaris.com (1,507,438 triples)
- blogcu.com (1,269,858 triples)
- vesselfinder.com (1,145,376 triples)
- typepad.com (1,127,062 triples)
- knfilters.com (1,046,130 triples)
- semensperms.com (1,029,050 triples)
- yahoo.com (982,798 triples)
- soup.io (907,654 triples)
- blogspot.com (892,590 triples)
- marie-claire.es (875,298 triples)
- casetify.com (836,234 triples)
- vbox7.com (786,828 triples)
- cinemaschool.by (758,022 triples)
- vogue.ua (753,984 triples)
- More
|
Top Classes |
Show
top values by domain count
- foaf:Person (392,721 Domains)
|
|
Show
top values by entity count
- foaf:Person (44,797,779 Entities)
|
Top Properties |
Show
top values by domain count
- xfn:mePage (392,725 Domains)
- xfn:me-hyperlink (355,040 Domains)
- xfn:friend (27,053 Domains)
- xfn:friend-hyperlink (27,050 Domains)
- xfn:met-hyperlink (16,636 Domains)
- xfn:met (16,636 Domains)
- xfn:colleague (15,967 Domains)
- xfn:colleague-hyperlink (15,962 Domains)
- xfn:contact (15,434 Domains)
- xfn:contact-hyperlink (15,416 Domains)
- xfn:co-worker (9,541 Domains)
- xfn:co-worker-hyperlink (9,538 Domains)
- xfn:acquaintance (8,943 Domains)
- xfn:acquaintance-hyperlink (8,940 Domains)
- xfn:neighbor (5,140 Domains)
- xfn:neighbor-hyperlink (5,137 Domains)
- xfn:co-resident (2,779 Domains)
- xfn:co-resident-hyperlink (2,778 Domains)
- xfn:muse (2,022 Domains)
|
|
Show
top values by entity count
- xfn:mePage (44,787,074 Entities)
- xfn:me-hyperlink (15,485,043 Entities)
- xfn:friend (2,907,797 Entities)
- xfn:friend-hyperlink (2,907,460 Entities)
- xfn:met-hyperlink (1,605,634 Entities)
- xfn:met (1,605,717 Entities)
- xfn:colleague (1,663,153 Entities)
- xfn:colleague-hyperlink (1,662,764 Entities)
- xfn:contact (1,748,106 Entities)
- xfn:contact-hyperlink (1,744,248 Entities)
- xfn:co-worker (978,358 Entities)
- xfn:co-worker-hyperlink (978,068 Entities)
- xfn:acquaintance (1,095,560 Entities)
- xfn:acquaintance-hyperlink (1,094,923 Entities)
- xfn:neighbor (445,802 Entities)
- xfn:neighbor-hyperlink (445,733 Entities)
- xfn:co-resident (319,691 Entities)
- xfn:co-resident-hyperlink (319,606 Entities)
- xfn:muse (284,189 Entities)
|
Triples Extracted | 90,430,790 |
URLs with Triples | 12,627,048 |
Average Triples per URL | 7.16 |
Domains with Triples | 209,086 |
Average Triples per Domain | 432.5 |
Typed Entities | 28,993,043 |
Top Domains by Extracted Triples |
Show top domains
- yellowpages.com (4,815,230 triples)
- edmunds.com (2,710,191 triples)
- nih.gov (2,275,224 triples)
- us-business.info (1,860,758 triples)
- musiqua.it (1,627,662 triples)
- kudzu.com (1,164,053 triples)
- telefoonboek.nl (1,145,796 triples)
- paginegialle.it (969,536 triples)
- branchen-info.net (833,468 triples)
- stadtbranche.de (724,338 triples)
- yellowbot.com (698,224 triples)
- opensecrets.org (662,580 triples)
- storage.com (652,000 triples)
- rostender.info (592,536 triples)
- deltanewsweb.com (568,196 triples)
- ladresse.com (551,457 triples)
- weblancer.net (539,908 triples)
- cityvoter.com (507,644 triples)
- yelloyello.com (487,740 triples)
- wikipedia.org (465,666 triples)
- More
|
Top Classes | Show top values by domain count
- vcard2006:Address (210,172 Domains)
|
| Show top values by entity count
- vcard2006:Address (28,993,043 Entities)
|
Top Properties | Show top values by domain count
- vcard2006:locality (171,969 Domains)
- vcard2006:street-address (168,330 Domains)
- vcard2006:postal-code (145,632 Domains)
- vcard2006:region (128,245 Domains)
- vcard2006:country-name (48,322 Domains)
- vcard2006:extended-address (9,055 Domains)
- vcard2006:addressType (5,950 Domains)
- vcard2006:post-office-box (662 Domains)
|
| Show top values by entity count
- vcard2006:locality (17,551,071 Entities)
- vcard2006:street-address (16,182,698 Entities)
- vcard2006:postal-code (11,996,350 Entities)
- vcard2006:region (11,584,841 Entities)
- vcard2006:country-name (5,087,677 Entities)
- vcard2006:extended-address (358,173 Entities)
- vcard2006:addressType (433,111 Entities)
- vcard2006:post-office-box (66,010 Entities)
|
Triples Extracted |
56,583,642 |
URLs with Triples |
1,940,859 |
Average Triples per URL |
29.15 |
Domains with Triples |
44,238 |
Average Triples per Domain |
1,279.07 |
Typed Entities |
12,727,005 |
Top Domains by Extracted Triples |
Show
top domains
- deltanewsweb.com (9,548,866 triples)
- wikipedia.org (2,158,035 triples)
- rostender.info (1,511,055 triples)
- museum.by (1,267,202 triples)
- eventfinda.co.nz (1,074,595 triples)
- connpass.com (802,090 triples)
- popula.de (789,895 triples)
- uu.se (763,762 triples)
- brucecounty.on.ca (627,835 triples)
- wikimedia.org (525,600 triples)
- gamestub.com (452,686 triples)
- sortir32.fr (448,748 triples)
- lasvegastickets.com (397,022 triples)
- ticketcity.com (396,766 triples)
- amazingregistry.com (355,924 triples)
- sfisd.net (354,468 triples)
- findticketsnow.com (330,445 triples)
- scometix.com (326,058 triples)
- actickets.com (323,750 triples)
- justatickets.com (314,474 triples)
- More
|
Top Classes |
Show
top values by domain count
- icaltzd:vcalendar (44,556 Domains)
- icaltzd:Vevent (38,333 Domains)
- icaltzd:DomainOf_rrule (13 Domains)
- icaltzd:Vtodo (4 Domains)
- icaltzd:Vjournal (3 Domains)
|
|
Show
top values by entity count
- icaltzd:vcalendar (1,957,811 Entities)
- icaltzd:Vevent (10,768,418 Entities)
- icaltzd:DomainOf_rrule (709 Entities)
- icaltzd:Vtodo (44 Entities)
- icaltzd:Vjournal (23 Entities)
|
Top Properties |
Show
top values by domain count
- icaltzd:component (38,326 Domains)
- icaltzd:summary (31,645 Domains)
- icaltzd:dtstart (30,058 Domains)
- icaltzd:description (24,043 Domains)
- icaltzd:url (19,221 Domains)
- icaltzd:location (16,569 Domains)
- icaltzd:dtend (16,400 Domains)
- icaltzd:categories (1,102 Domains)
- icaltzd:uid (592 Domains)
- icaltzd:organizer (187 Domains)
- icaltzd:calAddress (181 Domains)
- icaltzd:dtstamp (158 Domains)
- icaltzd:status (24 Domains)
- icaltzd:rrule (13 Domains)
- icaltzd:class (13 Domains)
- icaltzd:freq (2 Domains)
|
|
Show
top values by entity count
- icaltzd:component (1,752,970 Entities)
- icaltzd:summary (9,389,950 Entities)
- icaltzd:dtstart (7,817,927 Entities)
- icaltzd:description (3,443,184 Entities)
- icaltzd:url (4,281,676 Entities)
- icaltzd:location (3,773,879 Entities)
- icaltzd:dtend (3,781,503 Entities)
- icaltzd:categories (454,237 Entities)
- icaltzd:uid (205,404 Entities)
- icaltzd:organizer (11,452 Entities)
- icaltzd:calAddress (11,122 Entities)
- icaltzd:dtstamp (68,217 Entities)
- icaltzd:status (11,264 Entities)
- icaltzd:rrule (685 Entities)
- icaltzd:class (2,797 Entities)
- icaltzd:freq (7 Entities)
|
Triples Extracted |
38,699,354
|
URLs with Triples |
2,699,433
|
Average Triples per URL |
14.34
|
Domains with Triples |
32,408
|
Average Triples per Domain |
1,194.13
|
Typed Entities |
6,643,239
|
Top Domains by Extracted Triples |
Show
top domains
- blogspot.com (1,255,388 triples)
- freelancehunt.com (718,095 triples)
- atlaspiv.cz (678,987 triples)
- rakuten.co.jp (602,423 triples)
- moselka.ru (440,207 triples)
- edmunds.com (394,755 triples)
- glassdoor.com (388,786 triples)
- cantorion.org (363,887 triples)
- which.co.uk (340,948 triples)
- bookdirectrooms.com (335,268 triples)
- izum.ua (250,275 triples)
- oyster.com (232,986 triples)
- thegreenhousegroupinc.com (231,240 triples)
- keiziban-jp.com (217,912 triples)
- pcmag.com (207,044 triples)
- moneymunch.com (205,174 triples)
- kabulpress.org (199,656 triples)
- texnologosgeoponos.gr (198,392 triples)
- checkatrade.com (180,333 triples)
- lefaso.net (174,491 triples)
- More
|
Top Classes |
Show
top values by domain count
- rev:Review (32,585 Domains)
|
|
Show
top values by entity count
- rev:Review (6,643,239 Entities)
|
Top Properties |
Show
top values by domain count
- rev:reviewer (26,987 Domains)
- vcard2006:url (25,342 Domains)
- rev:hasReview (25,339 Domains)
- dcterms:date (25,251 Domains)
- vcard2006:fn (19,028 Domains)
- rev:rating (17,749 Domains)
- rev:text (9,913 Domains)
- vcard2006:photo (8,557 Domains)
- rev:title (6,145 Domains)
- rev:type (2,206 Domains)
|
|
Show
top values by entity count
- rev:reviewer (4,807,066 Entities)
- vcard2006:url (4,994,035 Entities)
- rev:hasReview (4,994,117 Entities)
- dcterms:date (4,855,433 Entities)
- vcard2006:fn (3,980,391 Entities)
- rev:rating (3,449,367 Entities)
- rev:text (2,385,558 Entities)
- vcard2006:photo (1,730,642 Entities)
- rev:title (997,779 Entities)
- rev:type (233,599 Entities)
|
Triples Extracted |
27,250,871
|
URLs with Triples |
252,161
|
Average Triples per URL |
108.07 |
Domains with Triples |
9,397
|
Average Triples per Domain |
2,899.95 |
Typed Entities |
8,352,956
|
Top Domains by Extracted Triples |
Show
top domains
- grecavricambi.it (1,015,704 triples)
- lesbambetises.com (561,613 triples)
- icase.it (445,925 triples)
- remax.com (370,147 triples)
- tafelfarben.de (324,002 triples)
- quoka.de (315,915 triples)
- lindathiele.de (297,110 triples)
- japanmania-shop.com (278,042 triples)
- underdog-fanzine.de (264,068 triples)
- verlag-heilbronn.de (263,056 triples)
- sunnyfengshui.com (259,809 triples)
- paradizo.com (252,671 triples)
- corporatehousing.com (241,846 triples)
- frauenwoerth.de (238,110 triples)
- mambo-717.com (232,570 triples)
- hommonokenkoclub.com (227,000 triples)
- dielendesign.de (222,310 triples)
- after55.com (218,117 triples)
- jimdo.com (210,349 triples)
- onhabitat.com (192,859 triples)
- More
|
Top Classes |
Show
top values by domain count
- hlisting:Lister (9,402 Domains)
- hlisting:Listing (9,402 Domains)
- hlisting:Item (6,639 Domains)
|
|
Show
top values by entity count
- hlisting:Lister (2,951,794 Entities)
- hlisting:Listing (2,951,794 Entities)
- hlisting:Item (2,449,368 Entities)
|
Top Properties |
Show
top values by domain count
- hlisting:lister (9,402 Domains)
- hlisting:price (8,679 Domains)
- hlisting:item (6,639 Domains)
- hlisting:itemPhoto (6,639 Domains)
- hlisting:itemUrl (6,639 Domains)
- hlisting:description (1,467 Domains)
- hlisting:itemName (168 Domains)
- hlisting:listerUrl (124 Domains)
- hlisting:listerLogo (124 Domains)
- hlisting:listerName (117 Domains)
- hlisting:action (69 Domains)
- hlisting:summary (36 Domains)
- hlisting:dtlisted (27 Domains)
- hlisting:listerOrg (23 Domains)
- vcard2006:tel (21 Domains)
- hlisting:permalink (9 Domains)
- foaf:mbox (5 Domains)
- hlisting:dtexpired (3 Domains)
|
|
Show
top values by entity count
- hlisting:lister (2,951,794 Entities)
- hlisting:price (2,765,363 Entities)
- hlisting:item (2,449,368 Entities)
- hlisting:itemPhoto (2,449,368 Entities)
- hlisting:itemUrl (2,449,368 Entities)
- hlisting:description (350,590 Entities)
- hlisting:itemName (134,323 Entities)
- hlisting:listerUrl (77,484 Entities)
- hlisting:listerLogo (77,484 Entities)
- hlisting:listerName (73,276 Entities)
- hlisting:action (51,841 Entities)
- hlisting:summary (104,832 Entities)
- hlisting:dtlisted (48,219 Entities)
- hlisting:listerOrg (20,660 Entities)
- vcard2006:tel (9,605 Entities)
- hlisting:permalink (2,454 Entities)
- foaf:mbox (189 Entities)
- hlisting:dtexpired (208 Entities)
|
Triples Extracted |
9,149,734
|
URLs with Triples |
417,518
|
Average Triples per URL |
21.91
|
Domains with Triples |
5,123
|
Average Triples per Domain |
1,786.01
|
Typed Entities |
2,343,367
|
Top Domains by Extracted Triples |
Show
top domains
- cookpad.com (2,054,221 triples)
- grouprecipes.com (495,819 triples)
- blogspot.com (124,187 triples)
- nyam.ru (103,556 triples)
- 24kitchen.pt (94,784 triples)
- drinksmixer.com (88,160 triples)
- menunedeli.ru (74,841 triples)
- receptvaros.hu (66,822 triples)
- numnums.com (64,838 triples)
- cook-s.ru (58,286 triples)
- restoranam.net (55,131 triples)
- desktopcookbook.com (53,920 triples)
- pojrem.ru (53,323 triples)
- 1,001-rezept.de (48,536 triples)
- takprosto.cc (44,591 triples)
- mirpovara.ru (44,483 triples)
- webopskrifter.dk (43,410 triples)
- trattoriadamartina.com (43,324 triples)
- lakii.com (42,231 triples)
- kosherbygloria.com (42,057 triples)
- More
|
Top Classes |
Show
top values by domain count
- hrecipe:Recipe (5,193 Domains)
- hrecipe:Ingredient (2,746 Domains)
- hrecipe:Duration (962 Domains)
- hrecipe:Nutrition (331 Domains)
|
|
Show
top values by entity count
- hrecipe:Recipe (509,178 Entities)
- hrecipe:Ingredient (1,759,187 Entities)
- hrecipe:Duration (54,226 Entities)
- hrecipe:Nutrition (20,776 Entities)
|
Top Properties |
Show
top values
- hrecipe:fn (3,435 Domains)
- hrecipe:ingredient (2,746 Domains)
- hrecipe:ingredientName (2,719 Domains)
- hrecipe:photo (2,540 Domains)
- hrecipe:tag (2,328 Domains)
- hrecipe:instructions (2,319 Domains)
- hrecipe:author (1,495 Domains)
- hrecipe:yield (1,378 Domains)
- hrecipe:summary (1,251 Domains)
- hrecipe:published (988 Domains)
- hrecipe:duration (962 Domains)
- hrecipe:durationTime (923 Domains)
- hrecipe:nutrition (331 Domains)
- hrecipe:ingredientQuantity (323 Domains)
- hrecipe:nutritionValue (315 Domains)
- hrecipe:ingredientQuantityType (312 Domains)
- hrecipe:durationTitle (152 Domains)
- hrecipe:nutritionValueType (78 Domains)
|
|
Show
top values by entity count
- hrecipe:fn (418,479 Entities)
- hrecipe:ingredient (233,142 Entities)
- hrecipe:ingredientName (1,750,706 Entities)
- hrecipe:photo (300,100 Entities)
- hrecipe:tag (152,293 Entities)
- hrecipe:instructions (153,651 Entities)
- hrecipe:author (243,589 Entities)
- hrecipe:yield (116,885 Entities)
- hrecipe:summary (145,415 Entities)
- hrecipe:published (170,018 Entities)
- hrecipe:duration (45,839 Entities)
- hrecipe:durationTime (48,280 Entities)
- hrecipe:nutrition (13,802 Entities)
- hrecipe:ingredientQuantity (172,049 Entities)
- hrecipe:nutritionValue (20,390 Entities)
- hrecipe:ingredientQuantityType (168,475 Entities)
- hrecipe:durationTitle (10,713 Entities)
- hrecipe:nutritionValueType (9,110 Entities)
|
Triples Extracted |
357,472
|
URLs with Triples |
50,249
|
Average Triples per URL |
7.11 |
Domains with Triples |
245
|
Average Triples per Domain |
1,459.07 |
Typed Entities |
145,715
|
Top Domains by Extracted Triples |
Show
top domains
- wikipedia.org (281,634 triples)
- thefullwiki.org (16,184 triples)
- antwiki.org (13,668 triples)
- hitchhikersgui.de (7,818 triples)
- wikivisually.com (4,886 triples)
- wikimedia.org (4,777 triples)
- webot.org (2,667 triples)
- wiktionary.org (2,656 triples)
- infogalactic.com (2,636 triples)
- plusg.co.jp (2,254 triples)
- blogspot.com (2,151 triples)
- wikien4.appspot.com (1,324 triples)
- wiki2.org (781 triples)
- wikia.com (751 triples)
- like2do.com (739 triples)
- istanbuldevelopment.com (615 triples)
- kiddle.co (549 triples)
- wikidoc.org (530 triples)
- know.cf (478 triples)
- wordpress.com (475 triples)
|
Top Classes |
Show
top values by domain count
- wo:species (247 Domains)
- wo:Family (137 Domains)
- wo:Order (135 Domains)
- wo:Genus (134 Domains)
- wo:Kingdom (129 Domains)
- wo:Species (126 Domains)
- wo:Phylum (95 Domains)
- wo:Class (84 Domains)
|
|
Show
top values by entity count
- wo:species (54,211 Entities)
- wo:Family (14,824 Entities)
- wo:Order (16,050 Entities)
- wo:Genus (14,888 Entities)
- wo:Kingdom (17,033 Entities)
- wo:Species (12,172 Entities)
- wo:Phylum (11,598 Entities)
- wo:Class (4,939 Entities)
|
Top Properties |
Show
top values by domain count
- wo:speciesName (137 Domains)
- wo:family (137 Domains)
- wo:familyName (136 Domains)
- wo:orderName (135 Domains)
- wo:order (135 Domains)
- wo:genusName (134 Domains)
- wo:genus (134 Domains)
- wo:kingdom (129 Domains)
- wo:kingdomName (129 Domains)
- wo:species (126 Domains)
- wo:scientificName (120 Domains)
- wo:phylumName (95 Domains)
- wo:phylum (95 Domains)
- wo:className (84 Domains)
- wo:class (84 Domains)
|
|
Show
top values by entity count
- wo:speciesName (15,541 Entities)
- wo:family (14,825 Entities)
- wo:familyName (14,823 Entities)
- wo:orderName (16,050 Entities)
- wo:order (16,051 Entities)
- wo:genusName (14,887 Entities)
- wo:genus (14,889 Entities)
- wo:kingdom (17,034 Entities)
- wo:kingdomName (17,033 Entities)
- wo:species (12,173 Entities)
- wo:scientificName (25,692 Entities)
- wo:phylumName (11,598 Entities)
- wo:phylum (11,598 Entities)
- wo:className (4,939 Entities)
- wo:class (4,939 Entities)
|