<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Freebase Blog &#187; Data</title>
	<atom:link href="http://blog.freebase.com/category/data/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.freebase.com</link>
	<description>A blog for data geeks, application developers and interested civilians</description>
	<lastBuildDate>Wed, 25 Nov 2009 01:29:13 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>10 Million Topics!</title>
		<link>http://blog.freebase.com/2009/11/24/10-million-topics/</link>
		<comments>http://blog.freebase.com/2009/11/24/10-million-topics/#comments</comments>
		<pubDate>Wed, 25 Nov 2009 00:43:51 +0000</pubDate>
		<dc:creator>zenkat</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[Data loads]]></category>
		<category><![CDATA[freebase]]></category>
		<category><![CDATA[milestone]]></category>

		<guid isPermaLink="false">http://blog.freebase.com/?p=1279</guid>
		<description><![CDATA[Pop open the Champagne!
Freebase has passed a notable milestone.  On Sunday, at about 11:00am PST, we zoomed by our 10 millionth topic &#8212; and by the time you read this post, we should surpass the 11 million topic mark.  A year ago, Freebase stood at just over 4 million topics.  That&#8217;s an annual growth rate [...]]]></description>
			<content:encoded><![CDATA[<p>Pop open the <a href="http://www.freebase.com/view/en/champagne">Champagne</a>!</p>
<p>Freebase has passed a notable milestone.  On Sunday, at about 11:00am PST, we zoomed by our 10 millionth topic &#8212; and by the time you read this post, we should surpass the 11 million topic mark.  A year ago, Freebase stood at just over 4 million topics.  That&#8217;s an annual growth rate of over 100%.</p>
<p><img class="alignright" src="http://rystarr.files.wordpress.com/2009/06/fireworks1.jpg?w=323&amp;h=283" alt="Celebrate!" width="323" height="258" /></p>
<p>A great deal went into achieving this milestone &#8212; contributions from prolific community members like <a href="http://www.freebase.com/view/user/tfmorris">tfmorris</a>, <a href="http://www.freebase.com/view/user/pak21">pak21</a> and <a href="http://www.freebase.com/view/user/sprocketonline">sprocketonline</a> (see our <a href="http://experthub.freebaseapps.com/lb-contributions">contribution leaderboard</a> for more); new Data Team tools like the <a href="http://data.labs.freebase.com/recon/">recon service</a>, RABJ, and the <a href="http://data.labs.freebase.com/recon/recon.html">spreadsheet loader</a>; and continued growth in traditional data sources like <a href="http://download.freebase.com/wex/">Wikipedia</a>.  But the largest segment of growth came from our continuing efforts to build a comprehensive repository of high-quality information about media in all its forms &#8212; especially <a href="http://www.freebase.com/view/music">music</a>, <a href="http://www.freebase.com/view/film">movies</a>, <a href="http://www.freebase.com/view/tv">TV</a> and <a href="http://www.freebase.com/view/book">books</a>.</p>
<p>In October, we rounded out our TV domain by synchronizing with the excellent user-curated TV fan site <a href="http://www.tvrage.com">TVRage.com</a>.  Combined with earlier data loads from <a href="http://www.thetvdb.com">thetvdb.com</a>, we now have comprehensive coverage of nearly every TV <a href="http://www.freebase.com/view/tv/tv_program">show</a> and <a href="http://www.freebase.com/view/tv/tv_series_episode">episode</a> created in the United States.  It includes cast and credits, as well as links to key TV websites like <a href="http://www.tvguide.com/">tvguide.com</a> and <a href="http://www.hulu.com/">Hulu</a> &#8212; nearly a million topics in all!</p>
<p>But the load that took us over the 10 million mark was the final load of editions from <a href="http://openlibrary.org/">Open Library</a>.  Compromising 650,000 authors, almost 2 million books and 2.1 million book editions,   this load pushed new boundaries in our data acquisition, curation, reconciliation and QA processes.</p>
<p>In the months ahead, we&#8217;ll be continuing to both curate and extend our media data loads with more high-quality data sets.  We plan on continuing to reconcile authors and books already in Freebase, as well as loading more books from curated bibliographic catalogs.  We&#8217;ll also be fleshing out our data about movies with data from <a href="http://developer.netflix.com/">Netflix</a>, as well as restarting our regular synchronizations with <a href="http://www.musicbrainz.org">MusicBrainz</a> and their <a href="http://wiki.musicbrainz.org/Next_Generation_Schema">Next Generation Schema</a>.</p>
<p>Congratulations to everyone who helped get us to this point.  It&#8217;s been an exciting year &#8212; with more great data to come!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.freebase.com/2009/11/24/10-million-topics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Got a minute or two? Help us with the Freebase review queue</title>
		<link>http://blog.freebase.com/2009/10/26/help-us-with-the-freebase-review-queue/</link>
		<comments>http://blog.freebase.com/2009/10/26/help-us-with-the-freebase-review-queue/#comments</comments>
		<pubDate>Mon, 26 Oct 2009 19:45:00 +0000</pubDate>
		<dc:creator>skud</dc:creator>
				<category><![CDATA[Community]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[review queue]]></category>

		<guid isPermaLink="false">http://blog.freebase.com/?p=1254</guid>
		<description><![CDATA[Just signal-boosting this mailing list post from Iain (sprocketonline), one of the Freebase Experts team:
If anyone has some spare time today, we could do with your help with voting on delete/merge tasks.  Please head over to http://www.freebase.com/tools/pipeline/showtask and contribute some clicks.
There are loads of topics waiting to be voted on.  They were flagged [...]]]></description>
			<content:encoded><![CDATA[<p>Just signal-boosting <a href="http://lists.freebase.com/pipermail/data-modeling/2009-October/002309.html">this mailing list post</a> from Iain (<a href="http://www.freebase.com/view/user/sprocketonline">sprocketonline</a>), one of the <a href="http://wiki.freebase.com/wiki/Freebase_Experts">Freebase Experts</a> team:</p>
<blockquote><p>If anyone has some spare time today, we could do with your help with voting on delete/merge tasks.  Please head over to <a href="http://www.freebase.com/tools/pipeline/showtask">http://www.freebase.com/tools/pipeline/showtask</a> and contribute some clicks.</p>
<p>There are loads of topics waiting to be voted on.  They were flagged for delete in bulk and been a bit too keen, so there is now a bit of backlog to be worked through.</p>
<p>Some of the topics for deletion are lists &#8211; the names of which fit the pattern &#8220;National Register of Historic Places in {location}&#8221; and &#8220;{country} films of {year}&#8221;.</p>
<p>And others are topics about topics e.g. &#8220;History of x&#8221;, &#8220;Economy of x&#8221;, &#8220;Politics of x&#8221;, &#8220;Communications in x&#8221; etc.  &#8211; the data in these really belongs on the topic x itself, but they can be dealt with later.</p>
<p>Hopefully a bit of crowdsourcing will get the backlog down to usual levels, so please help out at <a href="http://www.freebase.com/tools/pipeline/showtask">http://www.freebase.com/tools/pipeline/showtask</a>.</p>
<p>Thanks,</p>
<p>Iain</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://blog.freebase.com/2009/10/26/help-us-with-the-freebase-review-queue/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>New data dumps available</title>
		<link>http://blog.freebase.com/2009/10/07/new-data-dumps-available-2/</link>
		<comments>http://blog.freebase.com/2009/10/07/new-data-dumps-available-2/#comments</comments>
		<pubDate>Wed, 07 Oct 2009 18:01:22 +0000</pubDate>
		<dc:creator>skud</dc:creator>
				<category><![CDATA[Data dumps]]></category>
		<category><![CDATA[freebase]]></category>
		<category><![CDATA[wex]]></category>

		<guid isPermaLink="false">http://blog.freebase.com/?p=1208</guid>
		<description><![CDATA[Just a note from our data team: new data dumps are now available at download.freebase.com, in a variety of formats.
The Freebase data dumps are current as of October 1st, and the WEX (Wikipedia extract) dumps as of September 12th.
]]></description>
			<content:encoded><![CDATA[<p>Just a note from our data team: new data dumps are now available at <a href="http://download.freebase.com">download.freebase.com</a>, in a variety of formats.</p>
<p>The <a href="http://download.freebase.com/datadumps/">Freebase data dumps</a> are current as of October 1st, and the <a href="http://download.freebase.com/wex/">WEX</a> (Wikipedia extract) dumps as of September 12th.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.freebase.com/2009/10/07/new-data-dumps-available-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Great new schema visualisation tool</title>
		<link>http://blog.freebase.com/2009/09/14/great-new-schema-visualisation-tool/</link>
		<comments>http://blog.freebase.com/2009/09/14/great-new-schema-visualisation-tool/#comments</comments>
		<pubDate>Mon, 14 Sep 2009 23:47:08 +0000</pubDate>
		<dc:creator>skud</dc:creator>
				<category><![CDATA[Applications]]></category>
		<category><![CDATA[Schema]]></category>
		<category><![CDATA[Acre]]></category>
		<category><![CDATA[freebase]]></category>
		<category><![CDATA[pak21]]></category>

		<guid isPermaLink="false">http://blog.freebase.com/?p=1167</guid>
		<description><![CDATA[One of our Freebase Experts, Philip Kendall, recently built a fantastic tool for understanding Freebase schemas.  Schemaviz shows types and properties within any domain you specify.  Here&#8217;s the schema for one of our simpler commons domains:

Philip&#8217;s email explains what you&#8217;re seeing:

* Nodes represent types, edges represent properties.
* Reverse properties are shown in parentheses [...]]]></description>
			<content:encoded><![CDATA[<p>One of our Freebase Experts, <a href="http://www.freebase.com/view/user/pak21">Philip Kendall</a>, recently built a fantastic tool for understanding Freebase schemas.  <a href="http://schemaviz.freebaseapps.com/">Schemaviz</a> shows types and properties within any domain you specify.  Here&#8217;s the schema for one of our simpler commons domains:</p>
<p><a href="http://schemaviz.freebaseapps.com/?domain=%2Fbicycles"><img alt="" src="http://graphviz.mqlx.com/graph/?id=883e98e8e0415fbaf4d970d5fd841b79.png" title="Bicycle schema" class="aligncenter" width="296" height="267" /></a></p>
<p>Philip&#8217;s email explains what you&#8217;re seeing:</p>
<blockquote><p>
* Nodes represent types, edges represent properties.<br />
* Reverse properties are shown in parentheses after the master property name.<br />
* Grey edges represent hidden properties<br />
* Blue nodes are types outside the domain being viewed, but which are linked to from the domain (the edge will be dashed if the master property is from the type outside the domain to that inside the domain)<br />
* Red nodes represent undocumented types<br />
* Node shapes indicate the display style of the type (ellipse: standard, rectangle: CVT, triangle: enumeration)<br />
* All nodes are clickable to view that type
</p></blockquote>
<p>The visualisations are now built into the <a href="http://schemas.freebaseapps.com/domain?id=/food">schema explorer</a>.</p>
<p>As with all other Acre apps, the <a href="http://acre.freebase.com/#app=/user/pak21/schemaviz&#038;file=index">source code</a> is available and cloneable.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.freebase.com/2009/09/14/great-new-schema-visualisation-tool/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Gender and names in Freebase</title>
		<link>http://blog.freebase.com/2009/09/09/gender-and-names-in-freebase/</link>
		<comments>http://blog.freebase.com/2009/09/09/gender-and-names-in-freebase/#comments</comments>
		<pubDate>Thu, 10 Sep 2009 00:02:47 +0000</pubDate>
		<dc:creator>skud</dc:creator>
				<category><![CDATA[Applications]]></category>
		<category><![CDATA[Freebase Commons]]></category>
		<category><![CDATA[freebase]]></category>
		<category><![CDATA[gender]]></category>
		<category><![CDATA[names]]></category>
		<category><![CDATA[people]]></category>

		<guid isPermaLink="false">http://blog.freebase.com/?p=1161</guid>
		<description><![CDATA[If you&#8217;ve looked at the Genderizer queue lately, you&#8217;ll have noticed a lot of Person topics that have a name, but no description or picture.  It makes it hard to guess their gender, and you&#8217;re left relying just on their given name.  It seemed to me that a computer could do this just [...]]]></description>
			<content:encoded><![CDATA[<p>If you&#8217;ve looked at the <a href="http://genderizer.freebaseapps.com/">Genderizer</a> queue lately, you&#8217;ll have noticed a lot of Person topics that have a name, but no description or picture.  It makes it hard to guess their gender, and you&#8217;re left relying just on their given name.  It seemed to me that a computer could do this just as well as a human, so I asked our data team what they thought.</p>
<p>Brian Karlak looked into it and told me:</p>
<blockquote><p>We have ~600K people in Freebase with both names and genders.</p>
<p>I took the first name as the first space-separated token in the /type/object/name of these people.  Dreadfully simplistic, I know, but it will do for this analysis.  It does mean that &#8220;first names&#8221; will include such honorifics as  &#8220;King&#8221;, &#8220;Princess&#8221; and &#8220;Dr.&#8221;, but that&#8217;s OK &#8212; some correlate very well with gender.</p>
<p>I looked for first names with at least 100 exemplars.  Of those, there were 693 first names that correlated >99% with a particular gender.  There were 507 that showed 100% correlation with a particular gender.  Over 80% (412) of these were male names.</p>
<p>There were 127 first names that had at least 100 exemplars but did not correlate consistently with a single gender.  The most gender-ambiguous name was Andrea (nearly 50/50 split), followed by Ashley (48/52), Dana (53/47),  Nicola (47/53), and Charlie (47/53).</p></blockquote>
<p>Brian tells me we&#8217;ll be able to use this to assert the genders of people in Freebase, so you don&#8217;t have to. </p>
<p>If you&#8217;re interested in gendered names in Freebase, I recently created an app to do more or less what Brian is doing above.  It&#8217;s called <a href="http://genderednames.skud.user.dev.freebaseapps.com/">Gendered Names</a> and if you tell it a given name, it will tell you the split between male and female based on what&#8217;s in Freebase and present it as a nice chart.  Here&#8217;s what it thinks about the name &#8220;Evelyn&#8221; for example:</p>
<p><img alt="" src="http://chart.apis.google.com/chart?chs=500x200&#038;chd=t:18,82&#038;cht=p3&#038;chl=Male%2018%|Female%2082%&#038;chco=6666ff,ff6699" title="Pie chart of the gender distribution of Evelyn" class="aligncenter" width="500" height="200" /></p>
<p>Of course, as with any Acre app, you can <a href="http://acre.freebase.com/#app=/user/skud/genderednames&#038;file=index">view the source</a> and clone it if you&#8217;d like to build something similar.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.freebase.com/2009/09/09/gender-and-names-in-freebase/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Freebase now has 8.4 million topics</title>
		<link>http://blog.freebase.com/2009/08/12/freebase-now-has-8-4-million-topics/</link>
		<comments>http://blog.freebase.com/2009/08/12/freebase-now-has-8-4-million-topics/#comments</comments>
		<pubDate>Wed, 12 Aug 2009 20:52:13 +0000</pubDate>
		<dc:creator>skud</dc:creator>
				<category><![CDATA[Applications]]></category>
		<category><![CDATA[Data loads]]></category>
		<category><![CDATA[Freebase Commons]]></category>
		<category><![CDATA[books]]></category>
		<category><![CDATA[olp]]></category>
		<category><![CDATA[tippify]]></category>
		<category><![CDATA[tv]]></category>
		<category><![CDATA[tv4me]]></category>
		<category><![CDATA[tvrage]]></category>

		<guid isPermaLink="false">http://blog.freebase.com/?p=1138</guid>
		<description><![CDATA[Remember not so long ago when we were very excited about crossing the 5 million mark?  Well, in the last couple of weeks, we suddenly raced past 6, 7, and 8 million topics in Freebase, to reach a current total of 8,450,348 topics.
This is largely thanks to two big loads from the data team: [...]]]></description>
			<content:encoded><![CDATA[<p>Remember not so long ago when we were very excited about crossing the <a href="http://blog.freebase.com/2009/04/08/five-million-topics/">5 million</a> mark?  Well, in the last couple of weeks, we suddenly raced past 6, 7, and 8 million topics in Freebase, to reach a current total of <a href="http://www.freebase.com/explore">8,450,348 topics</a>.</p>
<p>This is largely thanks to two big loads from the data team: first up, a massive import of millions of books and related information from the <a href="http://openlibrary.org/">Open Library Project</a>, and secondly a big load of around 255k TV episodes from <a href="http://tvrage.com/">TVRage</a>.</p>
<p>Here&#8217;s a chart showing topic growth in Freebase since the beginning of April:</p>
<div id="attachment_1143" class="wp-caption aligncenter" style="width: 603px"><img src="http://blog.freebase.com/wp-content/uploads/2009/08/fb-growth2.png" alt="Topic growth in Freebase, April to August 2009" title="Topic growth in Freebase" width="593" height="360" class="size-full wp-image-1143" /><p class="wp-caption-text">Topic growth in Freebase, April to August 2009</p></div>
<p>To take a look at the relevant data, visit our <a href="http://www.freebase.com/view/book">Publishing Commons</a> or <a href="http://www.freebase.com/view/tv">TV Commons</a>.  For some apps taking advantage of all this data, look at <a href="http://tippify.com">Tippify</a> for book recommendations, or Alex&#8217;s <a href="http://tv4me.freebaseapps.com/">TV4Me</a> app (currently in development), a mashup of Freebase&#8217;s TV data with US broadcast schedules.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.freebase.com/2009/08/12/freebase-now-has-8-4-million-topics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Introducing (de)Typewriter</title>
		<link>http://blog.freebase.com/2009/07/06/introducing-detypewriter/</link>
		<comments>http://blog.freebase.com/2009/07/06/introducing-detypewriter/#comments</comments>
		<pubDate>Mon, 06 Jul 2009 22:11:36 +0000</pubDate>
		<dc:creator>zenkat</dc:creator>
				<category><![CDATA[Data]]></category>
		<category><![CDATA[Data mobs]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[detypewriter]]></category>
		<category><![CDATA[freebase]]></category>
		<category><![CDATA[vtalwar]]></category>

		<guid isPermaLink="false">http://blog.freebase.com/?p=1035</guid>
		<description><![CDATA[You know that a person is not a building, that a play is not a film, and that Glen Beck is not a reporter.  Right?
And now &#8212; with your help &#8212; Freebase can, too!
http://detypewriter.freebaseapps.com/
Written by vtalwar, (de)Typewriter is a Typewriter-style application based upon our incompatible types definitions.  On a nightly basis, we check our topics [...]]]></description>
			<content:encoded><![CDATA[<p>You know that a person is not a building, that a play is not a film, and that Glen Beck is not a reporter.  Right?</p>
<p>And now &#8212; with your help &#8212; Freebase can, too!</p>
<p><a href="http://detypewriter.freebaseapps.com/">http://detypewriter.freebaseapps.com/</a></p>
<p>Written by <a href="http://www.freebase.com/view/user/vtalwar" target="_blank">vtalwar</a>, (de)Typewriter is a Typewriter-style application based upon our <a href="http://www.freebase.com/view/dataworld/incompatible_types">incompatible types definitions</a>.  On a nightly basis, we check our topics against these rules.  Topics that have incompatible types are marked and flagged for review and cleanup.</p>
<p>That&#8217;s where you come in.  Cleanup of topics with incompatible types isn&#8217;t necessarily straightforward.  In some cases, the incompatibility is just caused by a simple mistyping &#8212; a book was mistakenly typed as a person, or a band was mistakenly typed as an album.  These cases are easy to fix: the offending type just needs to be removed.  There are other cases, however, which are more difficult to fix &#8212; for instance, in cases in which two entities were mistakenly merged. This requires that the offending topic be split into multiple topics, which is a much more involved process.</p>
<p>For instance, the band &#8220;Happy Mondays&#8221; is not and never will be a /people/person, so it needs to be detyped.  However, &#8220;Bob Dylan&#8221; can be both a /people/person, and his eponymous /music/album, so it would need to be split.(*)</p>
<p>Once we know which case we&#8217;re dealing with, we can make the fix.  But we need your to help us figure out which case we&#8217;re looking at.   Much like the Typewriter and Genderizer apps on which it&#8217;s based, the (de)Typewriter application walks you through a list of potential incompatibilities.  But in this case, our question is phrased differently:</p>
<p><img class="aligncenter size-full wp-image-1036" src="http://blog.freebase.com/wp-content/uploads/2009/06/detypewriter.jpg" alt="detypewriter" width="462" height="93" />First off, you&#8217;ll notice that we&#8217;re not asking about a particular Freebase topic.  That&#8217;s because we already know that the Los del Rio topic may have problems.  Instead, we want to know if somewhere, out there in the big wide world, if there is a person named Los del Rio.  If there is, then we know we have some split-like cleanup to do.  If there&#8217;s isn&#8217;t, however, then we know we can automatically remove the /people/person type from the Los del Rio topic.</p>
<p>For each question, you&#8217;ll get some clues.  On the left panel, you&#8217;ll see the results of a wikipedia search for that name.  On the right, you&#8217;ll have Google, IMDB, or Google Maps results.  Usually the answer will be obvious from these clues &#8212; but feel free to follow links if you think more information will help.</p>
<p>Try not to &#8220;overthink&#8221; the questions.  If the answer isn&#8217;t obvious (or if a question and/or its supporting clues seem confusing to you), just click &#8220;Skip&#8221; to go to the next question.  You can also get extra help by clicking the &#8220;Help&#8221; button, which will give you a discussion of the type&#8217;s usage as well as letting you know which incompatible type rule spawned this question.</p>
<p>Finally, note that unlike Typewriter and Genderizer, (de)Typewriter uses a vote tallying mechanism.  Detyping doesn&#8217;t occur immediately after clicking a button.  Instead, votes on the separate typings are compared to determine the proper course of action.</p>
<p>Have fun detyping!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.freebase.com/2009/07/06/introducing-detypewriter/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>New Freebase data dumps and Wikipedia Extract (WEX) available</title>
		<link>http://blog.freebase.com/2009/06/26/new-freebase-data-dumps-and-wikipedia-extract-wex-available/</link>
		<comments>http://blog.freebase.com/2009/06/26/new-freebase-data-dumps-and-wikipedia-extract-wex-available/#comments</comments>
		<pubDate>Fri, 26 Jun 2009 20:42:14 +0000</pubDate>
		<dc:creator>skud</dc:creator>
				<category><![CDATA[Data dumps]]></category>
		<category><![CDATA[freebase]]></category>
		<category><![CDATA[wex]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://blog.freebase.com/?p=1027</guid>
		<description><![CDATA[And while I&#8217;m reposting things from the dev list, Viral just announced that new data dumps are available from download.freebase.com.  This includes all the facts in Freebase in two different formats, plus our Wikipedia Extract.
]]></description>
			<content:encoded><![CDATA[<p>And while I&#8217;m reposting things from the dev list, Viral just announced that new data dumps are available from <a href="http://download.freebase.com/">download.freebase.com</a>.  This includes all the facts in Freebase in two different formats, plus our Wikipedia Extract.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.freebase.com/2009/06/26/new-freebase-data-dumps-and-wikipedia-extract-wex-available/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Check out Stefano&#8217;s Schema Explorer</title>
		<link>http://blog.freebase.com/2009/06/26/check-out-stefanos-schema-explorer/</link>
		<comments>http://blog.freebase.com/2009/06/26/check-out-stefanos-schema-explorer/#comments</comments>
		<pubDate>Fri, 26 Jun 2009 20:32:05 +0000</pubDate>
		<dc:creator>skud</dc:creator>
				<category><![CDATA[Acre]]></category>
		<category><![CDATA[Applications]]></category>
		<category><![CDATA[MQL]]></category>
		<category><![CDATA[Schema]]></category>
		<category><![CDATA[freebase]]></category>
		<category><![CDATA[stefano mazzocchi]]></category>

		<guid isPermaLink="false">http://blog.freebase.com/?p=1022</guid>
		<description><![CDATA[Stefano writes on our developer mailing list:
If you&#8217;re like me, MQL often feels like a combination of power and dark  magic:  some people can whip up a MQL query in no time, like they knew  all about every type, every property and how they link all the various pieces of the graph [...]]]></description>
			<content:encoded><![CDATA[<p><i><a href="http://www.freebase.com/view/en/stefano_mazzocchi">Stefano</a> writes on our <a href="http://lists.freebase.com/mailman/listinfo/developers">developer mailing list</a>:</i></p>
<p>If you&#8217;re like me, MQL often feels like a combination of power and dark  magic:  some people can whip up a MQL query in no time, like they knew  all about every type, every property and how they link all the various pieces of the graph together.</p>
<p>Asking around how people did it, yielded several strategies&#8230; but many basically revolved around using the &#8216;explore pages&#8217; (those magically  revealed by hitting the F8 key on freebase.com).</p>
<p>Then David wrote the new query editor, with the amazing context completion and I thought I was set&#8230; but while my MQL mojo improved a  lot with it, it always felt like something was missing: if I knew the type I wanted, I could get started&#8230; but what if I didn&#8217;t know where to  start? what if I didn&#8217;t know how to construct the query and I needed  some guidance in navigating how the types were all connected together?  (especially with CVTs and whatnot)</p>
<p>I felt like I was in a fog and with no map: finding my way out felt tentative at best, frustrating at worst.</p>
<p>The Jason showed me an acre app he had written a long time ago and that he uses all the time that offered a browsable interface to all the  freebase domains, types and properties, their IDs&#8230; and how they all connect together.</p>
<p>I had found my map.</p>
<p>Unfortunately, while extremely useful already, the schema explorer app needed some love&#8230; so I decided to give it some and the result has been so successful internally that I&#8217;m happy to announce it to the public so that everybody can benefit from it. Find it at: <a href="http://schemas.freebaseapps.com/">http://schemas.freebaseapps.com/</a></p>
<p>I hope you&#8217;ll find it as useful as I do.</p>
<p>NOTES:</p>
<p>1) suggestions/criticism/feature-requests appreciated and welcome</p>
<p>2) this is an acre app and if you&#8217;re curious (or interested in forking it) you can find it at: <a href="http://acre.freebase.com/#app=/user/jdouglas/schemas">http://acre.freebase.com/#app=/user/jdouglas/schemas</a></p>
<p>3) this couldn&#8217;t have been possible without Jason&#8217;s work upon which I built this so kudos go to him while all the mistakes remain mine</p>
<p>4) I&#8217;ve been working on integrating the schema explorer with the upcoming freebase developer&#8217;s documentation hub, so that newcomers won&#8217;t have to hunt down this URL in the mail archives to find a little guidance in the type maze.</p>
<p>Enjoy!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.freebase.com/2009/06/26/check-out-stefanos-schema-explorer/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Typing authors and books</title>
		<link>http://blog.freebase.com/2009/06/24/typing-authors-and-books/</link>
		<comments>http://blog.freebase.com/2009/06/24/typing-authors-and-books/#comments</comments>
		<pubDate>Wed, 24 Jun 2009 19:51:25 +0000</pubDate>
		<dc:creator>skud</dc:creator>
				<category><![CDATA[Applications]]></category>
		<category><![CDATA[Community]]></category>
		<category><![CDATA[Data mobs]]></category>
		<category><![CDATA[95% typed challenge]]></category>
		<category><![CDATA[freebase]]></category>
		<category><![CDATA[gwap]]></category>
		<category><![CDATA[human intelligence]]></category>
		<category><![CDATA[typewriter]]></category>

		<guid isPermaLink="false">http://blog.freebase.com/?p=968</guid>
		<description><![CDATA[I was just talking to some of the Freebase Data Team, and they asked me to pass on a request.  They&#8217;re working on several large loads of books and authors: the Open Library Project, ISFDB, and an as-yet-unnamed major university library&#8217;s catalogue.  As information comes in from an external source, it&#8217;s important to [...]]]></description>
			<content:encoded><![CDATA[<p>I was just talking to some of the Freebase Data Team, and they asked me to pass on a request.  They&#8217;re working on several large loads of books and authors: the Open Library Project, ISFDB, and an as-yet-unnamed major university library&#8217;s catalogue.  As information comes in from an external source, it&#8217;s important to match it to what&#8217;s already in Freebase.  This is called <b>reconciliation</b>.  </p>
<p>Reconciliation gets better the more we know about the topics in Freebase, and one of the most important things to know is a topic&#8217;s types.  That is, when we&#8217;re trying to match an author &#8220;Jane Smith&#8221;, it helps if we can limit our matching to people named Jane Smith that we already know are authors.</p>
<p>In short, <b>typed topics make our data loads more effective</b>.</p>
<p>We&#8217;re already working on <a href="http://blog.freebase.com/2009/06/23/the-95-typed-challenge/">typing 95% of the topics in Freebase</a>.  If you&#8217;d like to help out, and know that your efforts are going to be multiplied by the data team&#8217;s mass loads, you can help us by typing authors and books.</p>
<p>Using <a href="http://typewriter.freebaseapps.com/">Typewriter</a>, simply choose <a href="http://typewriter.freebaseapps.com/queue/book/author">Author</a> (15,349 items) or <a href="http://typewriter.freebaseapps.com/queue/book/book">Book</a> (9,323 items) from the list of types, and answer &#8220;Yes&#8221; or &#8220;No&#8221; for each topic.  </p>
<p><img src="http://blog.freebase.com/wp-content/uploads/2009/06/typewriter.png" alt="typewriter" title="typewriter" width="515" height="304" class="aligncenter size-full wp-image-971" /></p>
<p>Don&#8217;t forget, there&#8217;s also an iPhone app: great for that long commute or boring meeting!</p>
<p>For more background reading on our data loads, reconciliation, and human intelligence, check out:</p>
<ul>
<li><a href="http://blog.freebase.com/2009/01/06/meet-the-freebase-bots/">Meet the Freebase bots</a>
<li><a href="http://blog.freebase.com/2009/03/05/partial-evidence-and-human-intelligence-a-dispatch-from-the-data-mines/">Partial evidence and human intelligence: a dispatch from the data mines</a>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://blog.freebase.com/2009/06/24/typing-authors-and-books/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
