In summary, we found structured data within 1.24 billion HTML pages out of the 3.2 billion pages contained in the crawl (38%).
These pages originate from 5.63 million different pay-level-domains out of the 34 million pay-level-domains covered by the crawl (16.5%).
Altogether, the extracted data sets consist of 44.2 billion RDF quads.
Instructions on how to download the RDFa, Microdata, Embedded JSON-LD and Microformats data sets are given on the page how to get the data.
Triples Extracted | 34,637,805,559 |
URLs with Triples | 901,118,191 |
Average Triples per URL | 38.44 |
Domains with Triples | 2,537,539 |
Average Triples per Domain | 13,650.16 |
Typed Entities | 6,872,341,887 |
Top Domains by Extracted Triples | Show top domains
- moosejaw.com (410,021,025 triples)
- hallmark.com (351,800,741 triples)
- cnbc.com (298,241,469 triples)
- hotels.com (283,236,562 triples)
- repairpal.com (266,250,223 triples)
- uncommongoods.com (261,844,622 triples)
- justia.com (207,513,581 triples)
- leadferret.com (207,038,617 triples)
- propartner.ru (180,976,524 triples)
- callersmart.com (162,779,369 triples)
- ticketprocess.com (162,300,892 triples)
- gigmasters.com (151,575,204 triples)
- epicsports.com (132,459,147 triples)
- unitiki.com (125,549,640 triples)
- drom.ru (122,572,723 triples)
- zap2it.com (122,113,522 triples)
- caasa.it (108,318,384 triples)
- apple.com (105,659,632 triples)
- flightaware.com (104,808,131 triples)
- getauto.com (95,292,182 triples)
- More
|
Top Classes | Show top values by domain count
- schema:WebPage (761,035 Domains)
- schema:PostalAddress (338,058 Domains)
- schema:SiteNavigationElement (321,892 Domains)
- schema:WPHeader (300,888 Domains)
- schema:WPFooter (288,877 Domains)
- schema:Blog (272,552 Domains)
- schema:Article (254,213 Domains)
- schema:Product (249,947 Domains)
- schema:Organization (234,353 Domains)
- data-voc:Breadcrumb (200,169 Domains)
- schema:LocalBusiness (192,558 Domains)
- schema:BlogPosting (181,298 Domains)
- schema:WPSideBar (177,712 Domains)
- schema:Offer (169,477 Domains)
- schema:Person (155,740 Domains)
- schema:CreativeWork (133,144 Domains)
- https://schema.org/SiteNavigationElement (114,739 Domains)
- https://schema.org/WPHeader (114,281 Domains)
- schema:WebSite (103,980 Domains)
- https://schema.org/WPFooter (102,157 Domains)
|
| Show top values by entity count
- data-voc:Breadcrumb (1,003,747,999 Entities)
- schema:Product (682,850,010 Entities)
- schema:Offer (572,152,283 Entities)
- schema:ListItem (383,817,380 Entities)
- schema:Person (370,175,971 Entities)
- schema:PostalAddress (291,970,567 Entities)
- schema:ImageObject (285,935,057 Entities)
- data-voc:Person (235,906,566 Entities)
- schema:Organization (210,662,634 Entities)
- schema:AggregateRating (163,563,407 Entities)
- schema:Article (155,024,568 Entities)
- schema:BreadcrumbList (108,130,828 Entities)
- schema:WebPage (106,558,965 Entities)
- schema:LocalBusiness (99,543,802 Entities)
- schema:SiteNavigationElement (92,697,489 Entities)
- schema:Place (92,598,440 Entities)
- schema:Comment (81,467,380 Entities)
- schema:GeoCoordinates (70,239,402 Entities)
- schema:Review (69,789,534 Entities)
- schema:NewsArticle (64,169,519 Entities)
|
Top Properties | Show top values by domain count
- schema:WebPage/name (388,575 Domains)
- schema:PostalAddress/streetAddress (300,548 Domains)
- schema:PostalAddress/addressLocality (290,860 Domains)
- schema:WebPage/url (287,602 Domains)
- schema:PostalAddress/postalCode (273,903 Domains)
- schema:WebPage/image (266,204 Domains)
- schema:Product/name (231,882 Domains)
- schema:PostalAddress/addressRegion (195,989 Domains)
- data-voc:Breadcrumb/title (192,755 Domains)
- data-voc:Breadcrumb/url (186,583 Domains)
- schema:WebPage/thumbnailUrl (176,664 Domains)
- schema:WPHeader/headline (173,996 Domains)
- schema:Offer/price (162,414 Domains)
- schema:Product/offers (150,911 Domains)
- schema:Person/name (144,861 Domains)
- schema:LocalBusiness/address (142,957 Domains)
- schema:LocalBusiness/name (140,981 Domains)
- schema:WebPage/mainContentOfPage (139,292 Domains)
- schema:WebPage/description (138,307 Domains)
- schema:Organization/name (134,383 Domains)
|
| Show top values by entity count
- data-voc:Breadcrumb/title (957,254,610 Entities)
- data-voc:Breadcrumb/url (934,743,907 Entities)
- schema:Product/name (612,475,856 Entities)
- schema:Offer/price (541,603,598 Entities)
- schema:Product/image (465,616,541 Entities)
- schema:Offer/priceCurrency (426,856,319 Entities)
- schema:Product/offers (426,736,729 Entities)
- schema:ListItem/name (349,248,899 Entities)
- schema:ListItem/item (314,318,557 Entities)
- schema:ListItem/position (291,066,893 Entities)
- schema:Person/name (281,709,485 Entities)
- schema:Product/url (274,857,075 Entities)
- schema:PostalAddress/addressLocality (245,323,167 Entities)
- schema:Product/description (241,743,632 Entities)
- data-voc:Person/name (226,304,453 Entities)
- schema:Offer/availability (217,088,077 Entities)
- data-voc:Person/title (213,739,741 Entities)
- schema:PostalAddress/streetAddress (184,847,191 Entities)
- schema:PostalAddress/addressRegion (169,378,628 Entities)
- schema:ImageObject/name (149,619,540 Entities)
|
Detailed Statistics as Excel-File |
html-microdata.xlsx (1,180kb) |
Triples Extracted | 1,880,721,886 |
URLs with Triples | 111,411,049 |
Average Triples per URL | 16.88 |
Domains with Triples | 2,116,755 |
Average Triples per Domain | 888.49 |
Typed Entities | 385,731,201 |
Top Domains by Extracted Triples | Show top domains
- tapwage.com (50,143,925 triples)
- hrs.com (31,984,803 triples)
- wsj.com (23,242,951 triples)
- expertissim.com (18,203,038 triples)
- maxpreps.com (17,771,833 triples)
- wayfair.co.uk (15,259,274 triples)
- audible.com (13,240,641 triples)
- huffingtonpost.com (12,713,117 triples)
- klix.ba (12,564,738 triples)
- allmodern.com (11,549,395 triples)
- aviewfrommyseat.com (10,762,941 triples)
- ehc.com (10,434,378 triples)
- upi.com (10,316,948 triples)
- cakecentral.com (10,203,836 triples)
- realself.com (10,048,373 triples)
- flickr.com (9,987,026 triples)
- rome2rio.com (9,599,290 triples)
- si.com (8,869,349 triples)
- justanswer.com (8,712,406 triples)
- worldmarket.com (8,543,907 triples)
- More
|
Top Classes | Show top values by domain count
- schema:WebSite (2,072,793 Domains)
- schema:SearchAction (1,794,670 Domains)
- schema:Organization (537,173 Domains)
- schema:LocalBusiness (122,929 Domains)
- schema:Person (97,606 Domains)
- schema:PostalAddress (18,849 Domains)
- schema:ListItem (10,364 Domains)
- schema:ContactPoint (10,236 Domains)
- schema:BreadcrumbList (9,560 Domains)
- schema:GeoCoordinates (8,258 Domains)
- schema:ImageObject (7,809 Domains)
- schema:Place (6,979 Domains)
- schema:WebPage (6,088 Domains)
- schema:Event (4,477 Domains)
- schema:Offer (3,848 Domains)
- schema:AggregateRating (3,717 Domains)
- schema:Article (2,907 Domains)
- schema:Product (2,526 Domains)
- schema:BlogPosting (2,394 Domains)
- schema:ItemList (2,156 Domains)
|
| Show top values by entity count
- schema:Organization (59,940,002 Entities)
- schema:ImageObject (51,102,547 Entities)
- schema:WebSite (41,766,626 Entities)
- schema:ListItem (35,422,376 Entities)
- schema:SearchAction (32,740,938 Entities)
- schema:NewsArticle (29,174,193 Entities)
- schema:Person (27,704,784 Entities)
- schema:WebPage (13,146,602 Entities)
- schema:BreadcrumbList (9,471,252 Entities)
- schema:PostalAddress (9,291,712 Entities)
- schema:Offer (9,126,171 Entities)
- schema:ContactPoint (8,579,663 Entities)
- schema:Product (7,377,107 Entities)
- schema:Article (4,725,337 Entities)
- schema:Place (4,640,903 Entities)
- schema:GeoCoordinates (3,984,928 Entities)
- schema:AggregateRating (3,085,736 Entities)
- schema:Thing (2,054,300 Entities)
- schema:Event (1,809,562 Entities)
- schema:LocalBusiness (1,727,378 Entities)
|
Top Properties | Show top values by domain count
- schema:url (2,107,978 Domains)
- schema:name (1,971,881 Domains)
- schema:potentialAction (1,794,820 Domains)
- schema:target (1,794,807 Domains)
- schema:query-input (1,794,690 Domains)
- schema:logo (411,948 Domains)
- schema:sameAs (293,009 Domains)
- schema:description (274,038 Domains)
- schema:alternateName (199,351 Domains)
- schema:image (182,510 Domains)
- schema:telephone (138,780 Domains)
- schema:address (130,923 Domains)
- schema:legalName (118,018 Domains)
- schema:email (79,558 Domains)
- schema:openingHours (54,345 Domains)
- schema:streetAddress (20,666 Domains)
- schema:addressLocality (20,353 Domains)
- schema:postalCode (19,477 Domains)
- schema:addressRegion (15,889 Domains)
- schema:addressCountry (11,177 Domains)
|
| Show top values by entity count
- schema:name (159,439,780 Entities)
- schema:url (159,131,722 Entities)
- schema:logo (51,261,404 Entities)
- schema:headline (36,782,381 Entities)
- schema:position (34,864,050 Entities)
- schema:target (33,602,861 Entities)
- schema:item (33,556,392 Entities)
- schema:potentialAction (33,405,822 Entities)
- schema:image (33,111,961 Entities)
- schema:query-input (32,743,576 Entities)
- schema:width (32,284,123 Entities)
- schema:height (32,275,579 Entities)
- schema:publisher (31,261,851 Entities)
- schema:description (28,650,693 Entities)
- schema:sameAs (28,476,465 Entities)
- schema:author (26,093,064 Entities)
- schema:keywords (24,891,866 Entities)
- schema:articleSection (24,884,882 Entities)
- schema:thumbnailUrl (23,754,561 Entities)
- schema:datePublished (22,686,293 Entities)
|
Detailed Statistics as Excel-File |
html-embedded-jsonld.xlsx (48kb) |
Triples Extracted | 4,600,477,456 |
URLs with Triples | 159,748,255 |
Average Triples per URL | 28.8 |
Domains with Triples | 1,668,039 |
Average Triples per Domain | 2,758.02 |
Typed Entities | 1,614,688,960 |
Top Domains by Extracted Triples | Show top domains
- blogspot.com (581,449,550 triples)
- theclothdiaperwhisperer.com (314,888,334 triples)
- wordpress.com (200,179,406 triples)
- ticketprocess.com (161,730,830 triples)
- sonsofstevegarvey.com (59,315,914 triples)
- webotopia.org (55,374,749 triples)
- politico.com (52,156,777 triples)
- wikitravel.org (40,431,101 triples)
- mlblogs.com (36,020,017 triples)
- wikipedia.org (31,863,378 triples)
- neb.com (31,218,470 triples)
- nasa.gov (30,467,312 triples)
- ugent.be (30,320,075 triples)
- couriernews.com (24,937,852 triples)
- ticketstogo.com (24,264,425 triples)
- politico.eu (24,039,565 triples)
- bonhams.com (21,125,617 triples)
- justanswer.com (19,474,019 triples)
- nj.com (17,716,591 triples)
- ibm.com (17,410,583 triples)
- More
|
Top Classes | Show top values by domain count
- vcard2006:Name (1,673,537 Domains)
- vcard2006:VCard (1,672,470 Domains)
- vcard2006:Organization (137,949 Domains)
|
| Show top values by entity count
- vcard2006:Name (768,898,394 Entities)
- vcard2006:VCard (767,957,257 Entities)
- vcard2006:Organization (77,833,309 Entities)
|
Top Properties | Show top values by domain count
- vcard2006:n (1,673,537 Domains)
- vcard2006:fn (1,403,633 Domains)
- vcard2006:given-name (1,396,226 Domains)
- vcard2006:family-name (1,396,097 Domains)
- vcard2006:url (965,928 Domains)
- vcard2006:photo (262,841 Domains)
- vcard2006:adr (165,869 Domains)
- vcard2006:tel (141,641 Domains)
- vcard2006:org (137,949 Domains)
- vcard2006:organization-name (137,949 Domains)
- vcard2006:email (75,279 Domains)
- vcard2006:geo (17,096 Domains)
- vcard2006:nickname (14,124 Domains)
- vcard2006:note (7,650 Domains)
- vcard2006:title (7,219 Domains)
- vcard2006:logo (4,349 Domains)
- vcard2006:category (3,841 Domains)
- vcard2006:workTel (3,544 Domains)
- vcard2006:role (2,773 Domains)
- vcard2006:additional-name (2,361 Domains)
|
| Show top values by entity count
- vcard2006:n (768,898,394 Entities)
- vcard2006:fn (486,116,937 Entities)
- vcard2006:given-name (476,433,636 Entities)
- vcard2006:family-name (476,283,756 Entities)
- vcard2006:photo (279,886,896 Entities)
- vcard2006:url (235,285,882 Entities)
- vcard2006:org (77,833,309 Entities)
- vcard2006:organization-name (77,833,306 Entities)
- vcard2006:adr (56,706,705 Entities)
- vcard2006:tel (25,153,700 Entities)
- vcard2006:nickname (7,282,565 Entities)
- vcard2006:geo (6,659,163 Entities)
- vcard2006:email (6,131,474 Entities)
- vcard2006:note (4,008,437 Entities)
- vcard2006:organization-unit (3,482,973 Entities)
- vcard2006:logo (3,271,027 Entities)
- vcard2006:category (3,005,730 Entities)
- vcard2006:title (2,564,997 Entities)
- vcard2006:role (1,607,275 Entities)
- vcard2006:fax (1,075,971 Entities)
|
Triples Extracted | 2,216,933,416 |
URLs with Triples | 311,533,110 |
Average Triples per URL | 7.12 |
Domains with Triples | 938,830 |
Average Triples per Domain | 2,361.38 |
Typed Entities | 511,555,208 |
Top Domains by Extracted Triples | Show top domains
- imore.com (63,888,651 triples)
- epicsports.com (17,823,627 triples)
- ifood.tv (14,394,337 triples)
- fourfourtwo.com (9,688,076 triples)
- manhattanreview.com (9,468,390 triples)
- securitysystemsnews.com (9,287,355 triples)
- openjurist.org (8,933,680 triples)
- uscfsales.com (8,864,003 triples)
- akihabaranews.com (7,954,025 triples)
- digitaltrends.com (7,849,827 triples)
- expedia.com (7,499,836 triples)
- coveleaderpress.com (7,174,332 triples)
- adsoftheworld.com (7,018,328 triples)
- houseofstaunton.com (6,975,562 triples)
- clashmusic.com (6,893,334 triples)
- guccimaneonline.com (6,698,570 triples)
- cisco.com (6,659,757 triples)
- elvessupply.com (6,404,697 triples)
- nevesta.info (6,300,555 triples)
- capegazette.com (6,248,582 triples)
- More
|
---|
Top Classes | Show top values by domain count
- website (406,441 Domains)
- article (130,211 Domains)
- gd:Breadcrumb (92,574 Domains)
- foaf:Image (84,897 Domains)
- foaf:Document (81,345 Domains)
- sioc:Item (46,457 Domains)
- blog (24,498 Domains)
- sioc:UserAccount (15,261 Domains)
- skos:Concept (13,970 Domains)
- product (9,009 Domains)
- gd:Review-aggregate (6,328 Domains)
- gd:Rating (4,469 Domains)
- sioc:Post (4,214 Domains)
- company (2,860 Domains)
- object (2,791 Domains)
- vcard2006:Address (2,729 Domains)
- gr:BusinessEntity (2,621 Domains)
- sioctypes:BlogPost (2,388 Domains)
- band (2,041 Domains)
- sioctypes:Comment (1,856 Domains)
|
| Show top values by entity count
- gd:Breadcrumb (175,035,307 Entities)
- foaf:Image (73,721,371 Entities)
- article (49,108,112 Entities)
- website (24,081,040 Entities)
- foaf:Document (21,848,027 Entities)
- sioc:Item (20,815,871 Entities)
- sioc:Post (14,431,754 Entities)
- sioctypes:Comment (13,984,027 Entities)
- product (9,247,359 Entities)
- skos:Concept (7,118,804 Entities)
- sioc:UserAccount (6,258,558 Entities)
- gd:Review-aggregate (3,837,709 Entities)
- city (2,655,030 Entities)
- blog (2,584,574 Entities)
- schema:Recipe (2,563,840 Entities)
- gd:Rating (2,389,586 Entities)
- company (2,197,435 Entities)
- book (2,159,866 Entities)
- gd:Event (1,694,750 Entities)
- gd:Breadcrumbs (1,649,275 Entities)
|
Top Properties | Show top values by domain count
- ogp-og:title (386,446 Domains)
- ogp-og:url (384,493 Domains)
- ogp-og:site_name (368,332 Domains)
- ogp-og:image (272,165 Domains)
- ogp-og:description (187,660 Domains)
- ogp-me:title (174,467 Domains)
- ogp-me:url (153,569 Domains)
- ogp-me:description (147,079 Domains)
- ogp-me:site_name (134,283 Domains)
- ogp-me:image (130,209 Domains)
- gd:title (91,679 Domains)
- dc:title (90,771 Domains)
- gd:url (89,659 Domains)
- content:encoded (75,381 Domains)
- fb2008:fbmladmins (63,591 Domains)
- ogp-fb:app_id (60,460 Domains)
- fb2008:fbmlapp_id (56,823 Domains)
- ogp-me:locale (49,914 Domains)
- ogp-og:locale (45,989 Domains)
- sioc:num_replies (45,661 Domains)
|
| Show top values by entity count
- gd:title (170,754,107 Entities)
- gd:url (167,988,158 Entities)
- fb2008:fbmlapp_id (96,650,401 Entities)
- ogp-me:title (81,169,738 Entities)
- ogp-me:image (75,169,004 Entities)
- ogp-me:url (67,839,852 Entities)
- ogp-me:site_name (64,654,891 Entities)
- ogp-me:description (58,553,460 Entities)
- ogp-og:title (49,617,124 Entities)
- ogp-og:url (47,672,539 Entities)
- ogp-og:site_name (46,512,204 Entities)
- ogp-og:image (45,831,883 Entities)
- ogp-fb:app_id (41,810,873 Entities)
- fb2008:fbmladmins (37,868,604 Entities)
- ogp-og:description (34,983,308 Entities)
- content:encoded (25,130,816 Entities)
- ogp-fb:admins (20,526,122 Entities)
- dc:date (18,736,575 Entities)
- dc:title (18,424,554 Entities)
- dc:created (16,739,702 Entities)
|
Detailed Statistics as Excel-File |
html-rdfa.xlsx (159kb) |
Triples Extracted | 300,764,344
|
URLs with Triples | 24,242,546
|
Average Triples per URL | 12.41
|
Domains with Triples | 195,595
|
Average Triples per Domain | 1,537.69
|
Typed Entities | 48,011,285
|
Top Domains by Extracted Triples | Show top domains
- wordpress.com (18,323,664 triples)
- contactmusic.net (7,475,760 triples)
- dribbble.com (7,158,930 triples)
- knfilters.com (7,037,644 triples)
- yahoo.com (6,668,636 triples)
- spletnik.ru (5,794,746 triples)
- koopplein.nl (5,490,630 triples)
- 2modern.com (3,049,270 triples)
- deliichi.jp (2,959,516 triples)
- grouprecipes.com (2,752,610 triples)
- flagmansale.ru (2,689,144 triples)
- bibliacatolica.com.br (2,634,768 triples)
- greenbookblog.org (2,630,320 triples)
- idakoos.com (2,573,048 triples)
- decoboom.ir (2,556,430 triples)
- sheknows.com (2,467,750 triples)
- unvlog.com (2,443,634 triples)
- n-gamz.com (2,314,848 triples)
- rockfordsquire.com (2,253,636 triples)
- heraldo.es (2,036,934 triples)
- allaboutcircuits.com (1,992,650 triples)
- More
|
---|
Top Classes | Show top values by domain count- foaf:Person (196,530 Domains)
|
| Show top values by entity count- foaf:Person (48,011,285 Entities)
|
Top Properties | Show top values by domain count
- xfn:mePage (196,530 Domains)
- xfn:me-hyperlink (149,567 Domains)
- xfn:friend (33,991 Domains)
- xfn:friend-hyperlink (33,987 Domains)
- xfn:colleague (22,089 Domains)
- xfn:colleague-hyperlink (22,085 Domains)
- xfn:met (20,949 Domains)
- xfn:met-hyperlink (20,944 Domains)
- xfn:contact (19,604 Domains)
- xfn:contact-hyperlink (19,592 Domains)
- xfn:acquaintance (12,159 Domains)
- xfn:acquaintance-hyperlink (12,157 Domains)
- xfn:co-worker (11,548 Domains)
- xfn:co-worker-hyperlink (11,545 Domains)
- xfn:neighbor (5,592 Domains)
- xfn:neighbor-hyperlink (5,591 Domains)
- xfn:co-resident (3,767 Domains)
- xfn:co-resident-hyperlink (3,767 Domains)
- xfn:muse (2,866 Domains)
- xfn:muse-hyperlink (2,865 Domains)
|
| Show top values by entity count
- xfn:mePage (48,010,068 Entities)
- xfn:me-hyperlink (20,117,612 Entities)
- xfn:contact (2,173,753 Entities)
- xfn:contact-hyperlink (2,173,720 Entities)
- xfn:friend (1,889,568 Entities)
- xfn:friend-hyperlink (1,889,543 Entities)
- xfn:colleague (1,118,892 Entities)
- xfn:colleague-hyperlink (1,118,866 Entities)
- xfn:met (902,554 Entities)
- xfn:met-hyperlink (902,538 Entities)
- xfn:acquaintance (691,126 Entities)
- xfn:acquaintance-hyperlink (690,775 Entities)
- xfn:co-worker (580,470 Entities)
- xfn:co-worker-hyperlink (580,416 Entities)
- xfn:child-hyperlink (445,437 Entities)
- xfn:child (445,437 Entities)
- xfn:muse (417,901 Entities)
- xfn:muse-hyperlink (417,896 Entities)
- xfn:neighbor (221,671 Entities)
- xfn:neighbor-hyperlink (221,663 Entities)
|
Triples Extracted | 177,931,362 |
URLs with Triples | 3,450,075 |
Average Triples per URL | 51.57 |
Domains with Triples | 22,313 |
Average Triples per Domain | 7,974.34 |
Typed Entities | 33,962,568 |
Top Domains by Extracted Triples | Show top domains
- ticketprocess.com (97,383,033 triples)
- gamestub.com (6,157,306 triples)
- rostender.info (5,416,340 triples)
- naturalsciences.org (4,246,955 triples)
- conventionscene.com (3,776,938 triples)
- eventsinamerica.com (3,501,355 triples)
- abctickets.com (3,425,198 triples)
- wikipedia.org (3,367,275 triples)
- gustavus.edu (3,269,241 triples)
- yahoo.com (2,563,284 triples)
- ticketsinventory.com (2,501,837 triples)
- ticketnetwork.com (2,106,612 triples)
- texas.gov (1,147,278 triples)
- yesmagazine.org (1,096,783 triples)
- excite.com (988,201 triples)
- lanyrd.com (926,488 triples)
- oreilly.com (906,207 triples)
- amazingregistry.com (860,048 triples)
- superboleteria.com (850,502 triples)
- free-photos.biz (833,725 triples)
- More
|
Top Classes | Show top values by domain count
- icaltzd:vcalendar (22,432 Domains)
- icaltzd:Vevent (19,028 Domains)
- icaltzd:DomainOf_rrule (20 Domains)
- icaltzd:Vtodo (9 Domains)
- icaltzd:Vjournal (1 Domain)
|
| Show top values by entity count
- icaltzd:Vevent (30,484,000 Entities)
- icaltzd:vcalendar (3,476,804 Entities)
- icaltzd:DomainOf_rrule (1,562 Entities)
- icaltzd:Vtodo (201 Entities)
- icaltzd:Vjournal (1 Entity)
|
Top Properties | Show top values by domain count
- icaltzd:component (19,013 Domains)
- icaltzd:summary (15,618 Domains)
- icaltzd:dtstart (13,945 Domains)
- icaltzd:dtend (8,584 Domains)
- icaltzd:url (7,833 Domains)
- icaltzd:location (6,948 Domains)
- icaltzd:description (6,609 Domains)
- icaltzd:categories (651 Domains)
- icaltzd:uid (362 Domains)
- icaltzd:dtstamp (98 Domains)
- icaltzd:organizer (85 Domains)
- icaltzd:calAddress (83 Domains)
- icaltzd:status (23 Domains)
- icaltzd:rrule (20 Domains)
- icaltzd:class (8 Domains)
- icaltzd:freq (2 Domains)
|
| Show top values by entity count
- icaltzd:summary (28,492,879 Entities)
- icaltzd:dtstart (26,392,992 Entities)
- icaltzd:location (25,703,368 Entities)
- icaltzd:url (23,987,263 Entities)
- icaltzd:dtend (3,518,919 Entities)
- icaltzd:component (3,321,258 Entities)
- icaltzd:description (3,190,368 Entities)
- icaltzd:categories (1,483,946 Entities)
- icaltzd:dtstamp (163,829 Entities)
- icaltzd:uid (147,900 Entities)
- icaltzd:status (26,560 Entities)
- icaltzd:calAddress (8,551 Entities)
- icaltzd:organizer (6,932 Entities)
- icaltzd:class (1,826 Entities)
- icaltzd:rrule (1,318 Entities)
- icaltzd:freq (5 Entities)
|
Triples Extracted | 79,631,745
|
URLs with Triples | 4,551,011
|
Average Triples per URL | 17.5
|
Domains with Triples | 16,984
|
Average Triples per Domain | 4,688.63
|
Typed Entities | 13,680,480
|
Top Domains by Extracted Triples | Show top domains
- rakuten.co.jp (4,597,253 triples)
- aluguetemporada.com.br (3,502,205 triples)
- qvc.com (3,355,189 triples)
- homeaway.co.uk (2,454,948 triples)
- homeaway.com (2,336,909 triples)
- listen360.com (2,300,952 triples)
- fewo-direkt.de (2,136,485 triples)
- fotoalben-discount.de (1,967,730 triples)
- 892,000.it (1,905,633 triples)
- homeaway.nl (1,802,500 triples)
- hotelanacapri.co.uk (1,730,190 triples)
- tucsonweekly.com (1,676,489 triples)
- ownersdirect.co.uk (1,450,438 triples)
- carolinaguesthouse.co.uk (1,360,380 triples)
- ayda.ru (1,238,938 triples)
- s2cars.com (1,229,580 triples)
- theboatsideinn.com (1,080,380 triples)
- mail.ru (1,046,753 triples)
- tvtrip.de (1,030,534 triples)
- menupages.com (1,030,208 triples)
- More
|
---|
Top Classes | Show top values by domain count- rev:Review (17,084 Domains)
|
| Show top values by entity count- rev:Review (13,680,480 Entities)
|
Top Properties | Show top values by domain count
- rev:reviewer (13,578 Domains)
- rev:hasReview (13,101 Domains)
- vcard2006:url (13,099 Domains)
- dc:date (11,855 Domains)
- vcard2006:fn (9,574 Domains)
- rev:rating (9,134 Domains)
- rev:title (6,378 Domains)
- rev:text (5,965 Domains)
- vcard2006:photo (3,204 Domains)
- rev:type (1,552 Domains)
|
| Show top values by entity count
- dc:date (10,001,513 Entities)
- rev:reviewer (9,875,516 Entities)
- rev:rating (9,381,140 Entities)
- rev:text (9,262,866 Entities)
- rev:hasReview (8,341,681 Entities)
- vcard2006:url (8,341,549 Entities)
- vcard2006:fn (4,929,615 Entities)
- rev:title (2,942,251 Entities)
- vcard2006:photo (954,075 Entities)
- rev:type (855,524 Entities)
|
Triples Extracted | 36,521,676 |
URLs with Triples | 374,180 |
Average Triples per URL | 97.6 |
Domains with Triples | 4,710 |
Average Triples per Domain | 7,754.07 |
Typed Entities | 9,578,853 |
Top Domains by Extracted Triples | Show top domains
- icase.it (20,109,083 triples)
- remax.com (4,695,081 triples)
- metrocuadrado.com (3,377,790 triples)
- sothebysrealty.com (1,281,224 triples)
- quoka.de (1,174,645 triples)
- iberia.com (845,150 triples)
- after55.com (641,894 triples)
- similarsites.com (534,832 triples)
- paradizo.com (460,870 triples)
- myadsclassified.com (341,764 triples)
- forrentuniversity.com (219,026 triples)
- corporatehousing.com (201,881 triples)
- selectsothebysrealty.com (182,062 triples)
- natyucera.jp (159,520 triples)
- callawayhenderson.com (132,362 triples)
- pacificsothebysrealty.com (96,649 triples)
- fyndtorget.se (87,287 triples)
- yapalim.net (79,160 triples)
- remax-ni.net (51,994 triples)
- immobilmente.com (46,044 triples)
- More
|
Top Classes | Show top values by domain count
- hlisting:Lister (4,714 Domains)
- hlisting:Listing (4,714 Domains)
- hlisting:Item (2,835 Domains)
|
| Show top values by entity count
- hlisting:Lister (3,539,530 Entities)
- hlisting:Listing (3,539,530 Entities)
- hlisting:Item (2,499,793 Entities)
|
Top Properties | Show top values by domain count
- hlisting:lister (4,714 Domains)
- hlisting:price (3,415 Domains)
- hlisting:item (2,835 Domains)
- hlisting:itemPhoto (2,835 Domains)
- hlisting:itemUrl (2,835 Domains)
- hlisting:description (784 Domains)
- hlisting:itemName (253 Domains)
- hlisting:listerUrl (176 Domains)
- hlisting:listerLogo (176 Domains)
- hlisting:listerName (166 Domains)
- hlisting:summary (134 Domains)
- hlisting:action (121 Domains)
- hlisting:listerOrg (85 Domains)
- vcard2006:tel (68 Domains)
- hlisting:dtlisted (46 Domains)
- hlisting:permalink (17 Domains)
- foaf:mbox (9 Domains)
- hlisting:dtexpired (3 Domains)
|
| Show top values by entity count
- hlisting:lister (3,539,530 Entities)
- hlisting:price (2,826,394 Entities)
- hlisting:description (2,690,298 Entities)
- hlisting:item (2,499,793 Entities)
- hlisting:itemPhoto (2,499,793 Entities)
- hlisting:itemUrl (2,499,793 Entities)
- hlisting:summary (1,834,800 Entities)
- hlisting:dtlisted (1,550,263 Entities)
- hlisting:listerUrl (483,799 Entities)
- hlisting:listerLogo (483,799 Entities)
- hlisting:listerName (473,301 Entities)
- hlisting:itemName (305,131 Entities)
- hlisting:action (217,053 Entities)
- hlisting:listerOrg (41,736 Entities)
- vcard2006:tel (17,161 Entities)
- hlisting:permalink (216 Entities)
- foaf:mbox (181 Entities)
- hlisting:dtexpired (79 Entities)
|
Triples Extracted | 24,347,685
|
URLs with Triples | 755,544
|
Average Triples per URL | 32.23
|
Domains with Triples | 2,923
|
Average Triples per Domain | 8,329.69
|
Typed Entities | 5,695,917
|
Top Domains by Extracted Triples | Show top domains
- grouprecipes.com (5,439,541 triples)
- seriouseats.com (1,540,587 triples)
- vpuzo.com (1,310,367 triples)
- bakespace.com (1,295,747 triples)
- mmenu.com (1,124,713 triples)
- nyam.ru (1,062,865 triples)
- shipuxiu.com (634,270 triples)
- yummybook.ru (593,565 triples)
- deepsouthdish.com (569,128 triples)
- gastronom.ru (566,215 triples)
- ovkuse.ru (553,072 triples)
- sheknows.com (548,947 triples)
- happy-giraffe.ru (492,715 triples)
- receptok.ru (434,147 triples)
- cookaround.com (319,620 triples)
- webopskrifter.dk (293,312 triples)
- rachaelraymag.com (262,506 triples)
- drinksmixer.com (257,926 triples)
- pbprog.ru (250,660 triples)
- vkuso.ru (220,513 triples)
- More
|
---|
Top Classes | Show top values by domain count
- sindice:hrecipe/Recipe (2,968 Domains)
- sindice:hrecipe/Ingredient (1,919 Domains)
- sindice:hrecipe/Duration (690 Domains)
- sindice:hrecipe/Nutrition (223 Domains)
|
| Show top values by entity count
- sindice:hrecipe/Ingredient (4,626,891 Entities)
- sindice:hrecipe/Recipe (812,651 Entities)
- sindice:hrecipe/Duration (206,334 Entities)
- sindice:hrecipe/Nutrition (50,041 Entities)
|
Top Properties | Show top values
- sindice:hrecipe/fn (2,325 Domains)
- sindice:hrecipe/ingredient (1,919 Domains)
- sindice:hrecipe/ingredientName (1,906 Domains)
- sindice:hrecipe/photo (1,647 Domains)
- sindice:hrecipe/instructions (1,628 Domains)
- sindice:hrecipe/yield (1,138 Domains)
- sindice:hrecipe/tag (1,077 Domains)
- sindice:hrecipe/summary (1,060 Domains)
- sindice:hrecipe/author (968 Domains)
- sindice:hrecipe/duration (690 Domains)
- sindice:hrecipe/durationTime (653 Domains)
- sindice:hrecipe/published (580 Domains)
- sindice:hrecipe/nutrition (223 Domains)
- sindice:hrecipe/ingredientQuantity (205 Domains)
- sindice:hrecipe/ingredientQuantityType (195 Domains)
- sindice:hrecipe/nutritionValue (190 Domains)
- sindice:hrecipe/durationTitle (92 Domains)
- sindice:hrecipe/nutritionValueType (38 Domains)
|
| Show top values by entity count
- sindice:hrecipe/ingredientName (4,607,336 Entities)
- sindice:hrecipe/fn (764,648 Entities)
- sindice:hrecipe/ingredientQuantity (628,297 Entities)
- sindice:hrecipe/ingredientQuantityType (612,450 Entities)
- sindice:hrecipe/ingredient (596,829 Entities)
- sindice:hrecipe/photo (578,624 Entities)
- sindice:hrecipe/instructions (569,181 Entities)
- sindice:hrecipe/tag (357,077 Entities)
- sindice:hrecipe/yield (302,896 Entities)
- sindice:hrecipe/summary (278,243 Entities)
- sindice:hrecipe/author (211,192 Entities)
- sindice:hrecipe/duration (168,570 Entities)
- sindice:hrecipe/durationTime (132,201 Entities)
- sindice:hrecipe/published (76,455 Entities)
- sindice:hrecipe/nutritionValue (48,558 Entities)
- sindice:hrecipe/nutritionValueType (29,859 Entities)
- sindice:hrecipe/nutrition (27,215 Entities)
- sindice:hrecipe/durationTitle (3,986 Entities)
|
Triples Extracted | 1,319,837
|
URLs with Triples | 170,516
|
Average Triples per URL | 7.74 |
Domains with Triples | 95
|
Average Triples per Domain | 13,893.02 |
Typed Entities | 527,185
|
Top Domains by Extracted Triples | Show top domains
- wikipedia.org (1,258,107 triples)
- wikimedia.org (12,857 triples)
- oiseaux.net (11,501 triples)
- preen.com (6,244 triples)
- schools-wikipedia.org (6,006 triples)
- thefullwiki.org (4,943 triples)
- blogspot.com (4,355 triples)
- wiktionary.org (3,912 triples)
- meddic.jp (3,320 triples)
- everipedia.com (1,551 triples)
- mashpedia.com (1,400 triples)
- wikidoc.org (1,285 triples)
- wikia.com (654 triples)
- wordpress.com (489 triples)
- eol.org (386 triples)
- leparisien.fr (339 triples)
- birdsguides.com (295 triples)
- marefa.org (197 triples)
- readtiger.com (194 triples)
- snaturou2,000.sk (100 triples)
|
---|
Top Classes | Show top values by domain count
- wo:species (97 Domains)
- wo:Family (59 Domains)
- wo:Genus (58 Domains)
- wo:Order (56 Domains)
- wo:Kingdom (51 Domains)
- wo:Species (45 Domains)
- wo:Phylum (44 Domains)
- wo:Class (38 Domains)
|
| Show top values by entity count
- wo:species (179,376 Entities)
- wo:Order (69,703 Entities)
- wo:Kingdom (68,721 Entities)
- wo:Family (65,631 Entities)
- wo:Genus (64,920 Entities)
- wo:Phylum (52,083 Entities)
- wo:Species (14,183 Entities)
- wo:Class (12,568 Entities)
|
Top Properties | Show top values by domain count
- wo:family (59 Domains)
- wo:familyName (58 Domains)
- wo:genusName (58 Domains)
- wo:genus (58 Domains)
- wo:orderName (56 Domains)
- wo:order (56 Domains)
- wo:speciesName (51 Domains)
- wo:kingdom (51 Domains)
- wo:kingdomName (51 Domains)
- wo:species (45 Domains)
- wo:phylumName (44 Domains)
- wo:phylum (44 Domains)
- wo:scientificName (43 Domains)
- wo:className (38 Domains)
- wo:class (38 Domains)
|
| Show top values by entity count
- wo:scientificName (91,120 Entities)
- wo:order (69,705 Entities)
- wo:orderName (69,464 Entities)
- wo:kingdom (68,723 Entities)
- wo:kingdomName (68,720 Entities)
- wo:family (65,633 Entities)
- wo:genus (64,922 Entities)
- wo:genusName (64,918 Entities)
- wo:familyName (64,793 Entities)
- wo:phylum (52,084 Entities)
- wo:phylumName (52,083 Entities)
- wo:speciesName (21,937 Entities)
- wo:species (14,183 Entities)
- wo:className (12,568 Entities)
- wo:class (12,568 Entities)
|