<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: &#8220;Master&#8221; metadata</title>
	<atom:link href="http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/</link>
	<description>Just another Walker Blogs weblog</description>
	<pubDate>Thu, 08 Jan 2009 02:51:05 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.5</generator>
		<item>
		<title>By: Gordon Paynter</title>
		<link>http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-553</link>
		<dc:creator>Gordon Paynter</dc:creator>
		<pubDate>Wed, 18 Jun 2008 03:57:14 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-553</guid>
		<description>Hi all:



We're having to deal with these issues increasingly for &lt;a href="http://nzresearch.org.nz/" rel="nofollow"&gt;nzresearch.org.nz/&lt;/a&gt; and other metadata aggregation websites that the National Library of New Zealand is working on.



Most of our experience is with machine-augmented metadata. For example, we keep the (OAI-PMH) harvested DC "master" metadata for a record (&lt;a href="http://nzresearch.org.nz/index.php/record/viewSchema/16377/3" rel="nofollow"&gt;example&lt;/a&gt;), plus our internally-generated "cleaned up" form of that metadata (&lt;a href="http://nzresearch.org.nz/index.php/record/viewSchema/16377/4" rel="nofollow"&gt;example&lt;/a&gt;) that we derive at harvest time, plus our "admin" metadata (&lt;a&gt;example&lt;/a&gt;) which you might think of as "meta-metadata", or metadata about the quality of the master metadata (used to feed back to repository admins and to create &lt;a href="http://nzresearch.org.nz/index.php/reports/viewLogFile?section=metadata_quality&#38;location=reports_dir&#38;filename=metadata_quality_2008-06-01.csv" rel="nofollow"&gt;national metadata quality reports&lt;/a&gt;).





Anyway, there's a dependency here, because the two sets of derived metadata are based on the harvested "master" metadata and therefore have to be updated whenever the harvested metadata changes (in other words, when we harvest a new version of a record, we can delete all the metadata and start over).



This is relatively simple because we know all the metadata is either harvested (or "master") metadata or directly derived from the harvested metadata. However, our new prototypes throw user-contributed and admin-contributed metadata into the mix, and suddenly you're in a situation where you want to yupdate a record because the master metadata changes and you have to try and figure out exactly what the provenance of all the metadata are for a record so that you only replace the metadata that needs to be replaced. I'm still experimenting with different ways to do this.



I hope that made sense.

Gordon</description>
		<content:encoded><![CDATA[<p>Hi all:</p>
<p>We&#8217;re having to deal with these issues increasingly for <a href="http://nzresearch.org.nz/" rel="nofollow">nzresearch.org.nz/</a> and other metadata aggregation websites that the National Library of New Zealand is working on.</p>
<p>Most of our experience is with machine-augmented metadata. For example, we keep the (OAI-PMH) harvested DC &#8220;master&#8221; metadata for a record (<a href="http://nzresearch.org.nz/index.php/record/viewSchema/16377/3" rel="nofollow">example</a>), plus our internally-generated &#8220;cleaned up&#8221; form of that metadata (<a href="http://nzresearch.org.nz/index.php/record/viewSchema/16377/4" rel="nofollow">example</a>) that we derive at harvest time, plus our &#8220;admin&#8221; metadata (<a>example</a>) which you might think of as &#8220;meta-metadata&#8221;, or metadata about the quality of the master metadata (used to feed back to repository admins and to create <a href="http://nzresearch.org.nz/index.php/reports/viewLogFile?section=metadata_quality&amp;location=reports_dir&amp;filename=metadata_quality_2008-06-01.csv" rel="nofollow">national metadata quality reports</a>).</p>
<p>Anyway, there&#8217;s a dependency here, because the two sets of derived metadata are based on the harvested &#8220;master&#8221; metadata and therefore have to be updated whenever the harvested metadata changes (in other words, when we harvest a new version of a record, we can delete all the metadata and start over).</p>
<p>This is relatively simple because we know all the metadata is either harvested (or &#8220;master&#8221;) metadata or directly derived from the harvested metadata. However, our new prototypes throw user-contributed and admin-contributed metadata into the mix, and suddenly you&#8217;re in a situation where you want to yupdate a record because the master metadata changes and you have to try and figure out exactly what the provenance of all the metadata are for a record so that you only replace the metadata that needs to be replaced. I&#8217;m still experimenting with different ways to do this.</p>
<p>I hope that made sense.</p>
<p>Gordon</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: George Oates</title>
		<link>http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-558</link>
		<dc:creator>George Oates</dc:creator>
		<pubDate>Fri, 06 Jun 2008 21:29:26 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-558</guid>
		<description>Hi - Sorry I'm a bit late to this party, but, I've been wondering about this idea too... Our experience in The Commons has been really interesting. I think the initial guess was that tags added by the Flickr Community would be able to be mined/harvested for instant value, but it turns out that the real gold is hidden in people's conversations about what they see in the photographs.



Conversations are very difficult for a computer to parse, but easy to read. I was chuffed to hear how the Library of Congress team built a little report that dumped all the comments chronologically into an Excel doc so they could be &lt;i&gt;read&lt;/i&gt;. Turns out the LC has updated around 200 of their catalogues as a result of these conversations. So, rather than "defying" the Master Metadata, it is simply augmented as the photograph's context is broadened. I love the idea that an institution will hold "The Golden Copy" of an object (and its metadata), but that there can be a network of "augmentation channels" that the institution may pick from, or simply link to on the web.



That's not to say that tags are useless though, but their real strength is in the Flickr context, nestled amongst all the other words in the natural language "lexicon" that has grown over the last four years. Now I find myself wondering how we might be able to make &lt;i&gt;that&lt;/i&gt; available, the lexicon I mean. I would love to be able to compare this natural language lexicon against all the controlled vocabularies out there. I know which one I'd find easier to navigate! ;)</description>
		<content:encoded><![CDATA[<p>Hi - Sorry I&#8217;m a bit late to this party, but, I&#8217;ve been wondering about this idea too&#8230; Our experience in The Commons has been really interesting. I think the initial guess was that tags added by the Flickr Community would be able to be mined/harvested for instant value, but it turns out that the real gold is hidden in people&#8217;s conversations about what they see in the photographs.</p>
<p>Conversations are very difficult for a computer to parse, but easy to read. I was chuffed to hear how the Library of Congress team built a little report that dumped all the comments chronologically into an Excel doc so they could be <i>read</i>. Turns out the LC has updated around 200 of their catalogues as a result of these conversations. So, rather than &#8220;defying&#8221; the Master Metadata, it is simply augmented as the photograph&#8217;s context is broadened. I love the idea that an institution will hold &#8220;The Golden Copy&#8221; of an object (and its metadata), but that there can be a network of &#8220;augmentation channels&#8221; that the institution may pick from, or simply link to on the web.</p>
<p>That&#8217;s not to say that tags are useless though, but their real strength is in the Flickr context, nestled amongst all the other words in the natural language &#8220;lexicon&#8221; that has grown over the last four years. Now I find myself wondering how we might be able to make <i>that</i> available, the lexicon I mean. I would love to be able to compare this natural language lexicon against all the controlled vocabularies out there. I know which one I&#8217;d find easier to navigate! ;)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Lynda Kelly</title>
		<link>http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-552</link>
		<dc:creator>Lynda Kelly</dc:creator>
		<pubDate>Mon, 05 May 2008 20:16:18 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-552</guid>
		<description>Hi Nate, there has been some discussion about the Walker Art teens website on my &lt;a href="http://amarclk.blogspot.com/2008/04/designing-for-teens-mw2008-day-3.html" rel="nofollow"&gt;blog&lt;/a&gt;. Do you have any more information about it somewhere here. Am particularly interested in the development and subsequent design of it.</description>
		<content:encoded><![CDATA[<p>Hi Nate, there has been some discussion about the Walker Art teens website on my <a href="http://amarclk.blogspot.com/2008/04/designing-for-teens-mw2008-day-3.html" rel="nofollow">blog</a>. Do you have any more information about it somewhere here. Am particularly interested in the development and subsequent design of it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mia</title>
		<link>http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-555</link>
		<dc:creator>Mia</dc:creator>
		<pubDate>Sat, 19 Apr 2008 20:19:55 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-555</guid>
		<description>Ok, I've found the reference to London's Transport Museum taking UGC directly back into their collections management system in a post on the Museums Computer Group list: http://tinyurl.com/5emc65



And I've blogged a bit about it at http://openobjects.blogspot.com/2008/04/museum-and-claytons-audience.html</description>
		<content:encoded><![CDATA[<p>Ok, I&#8217;ve found the reference to London&#8217;s Transport Museum taking UGC directly back into their collections management system in a post on the Museums Computer Group list: <a href="http://tinyurl.com/5emc65" rel="nofollow">http://tinyurl.com/5emc65</a></p>
<p>And I&#8217;ve blogged a bit about it at <a href="http://openobjects.blogspot.com/2008/04/museum-and-claytons-audience.html" rel="nofollow">http://openobjects.blogspot.com/2008/04/museum-and-claytons-audience.html</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nate Solas</title>
		<link>http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-554</link>
		<dc:creator>Nate Solas</dc:creator>
		<pubDate>Sat, 19 Apr 2008 11:36:19 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-554</guid>
		<description>@Seb - great comment.  I agree in the sense that we shouldn't be judging the user generated content for "rightness", but my real concern was that the UGC is going to be built over a set of harvested data.  The primary collection lives in a remote repository, and I know at some point they're going to come to me and say "hey, can we bring some of the new data back into our main system"?



Still, I think you're given me the answer with the Flickr example: I should just write an API that allows the originating museum to query the system for any UGC.  Now to find the time...  :)</description>
		<content:encoded><![CDATA[<p>@Seb - great comment.  I agree in the sense that we shouldn&#8217;t be judging the user generated content for &#8220;rightness&#8221;, but my real concern was that the UGC is going to be built over a set of harvested data.  The primary collection lives in a remote repository, and I know at some point they&#8217;re going to come to me and say &#8220;hey, can we bring some of the new data back into our main system&#8221;?</p>
<p>Still, I think you&#8217;re given me the answer with the Flickr example: I should just write an API that allows the originating museum to query the system for any UGC.  Now to find the time&#8230;  :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Seb Chan</title>
		<link>http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-551</link>
		<dc:creator>Seb Chan</dc:creator>
		<pubDate>Sat, 19 Apr 2008 08:35:39 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-551</guid>
		<description>Hey Nate



I'm not sure why we need to be concerned with the 'right' metadata. Surely 'right' is just a matter of who the user is - and now that we've moved defiantly away from JUST catering for scholarly users there are a lot of different categories of 'right'.



That's been one of the joys of our implementation of OpenCalais - suddenly we have a whole lot of different metadata that IS 'right' but just not in the ways any of our museum staff - registrars, curators or even the web team - would have thought about our collection. This is because OC is applying a financial markets eye to our content - but this has great value in terms of allowing the browsing of our collection by company name for example.



Likewise, our Commons on Flickr experience (http://www.flickr.com/photos/powerhouse_museum/) has been very much that Flickr users add more tags and different tags to the same content that has been available (and taggable) on our own site for a long time. And yes, we can import them back with the Flickr API.



Our own tagging experience is one of peaceful coexistence . . . . user terms can coexist with formal museum classifications and enhance discovery and cross-search.



Mia - on the LTM example I will be quite surprised if they get UGC at any significant volume. They have set the barriers to participation quite high with requirements for email addresses and user details etc. Remembering that even tagging on the PHM collection - 15million views in 2007, 5thousand tags . . . - and that is without requiring ANY form of login.



Seb</description>
		<content:encoded><![CDATA[<p>Hey Nate</p>
<p>I&#8217;m not sure why we need to be concerned with the &#8216;right&#8217; metadata. Surely &#8216;right&#8217; is just a matter of who the user is - and now that we&#8217;ve moved defiantly away from JUST catering for scholarly users there are a lot of different categories of &#8216;right&#8217;.</p>
<p>That&#8217;s been one of the joys of our implementation of OpenCalais - suddenly we have a whole lot of different metadata that IS &#8216;right&#8217; but just not in the ways any of our museum staff - registrars, curators or even the web team - would have thought about our collection. This is because OC is applying a financial markets eye to our content - but this has great value in terms of allowing the browsing of our collection by company name for example.</p>
<p>Likewise, our Commons on Flickr experience (http://www.flickr.com/photos/powerhouse_museum/) has been very much that Flickr users add more tags and different tags to the same content that has been available (and taggable) on our own site for a long time. And yes, we can import them back with the Flickr API.</p>
<p>Our own tagging experience is one of peaceful coexistence . . . . user terms can coexist with formal museum classifications and enhance discovery and cross-search.</p>
<p>Mia - on the LTM example I will be quite surprised if they get UGC at any significant volume. They have set the barriers to participation quite high with requirements for email addresses and user details etc. Remembering that even tagging on the PHM collection - 15million views in 2007, 5thousand tags . . . - and that is without requiring ANY form of login.</p>
<p>Seb</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mia</title>
		<link>http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-557</link>
		<dc:creator>Mia</dc:creator>
		<pubDate>Wed, 16 Apr 2008 19:19:19 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-557</guid>
		<description>You probably need three labels for tags - institutional, user-generated, and machine-generated (for things like OpenCalais).



There's some work in the UK on this, I'll try and track down a reference.  Also, I'm sure I read that London's Transport Museum are going to incorporate user-generated content back into their collections management system but I can't find it again.</description>
		<content:encoded><![CDATA[<p>You probably need three labels for tags - institutional, user-generated, and machine-generated (for things like OpenCalais).</p>
<p>There&#8217;s some work in the UK on this, I&#8217;ll try and track down a reference.  Also, I&#8217;m sure I read that London&#8217;s Transport Museum are going to incorporate user-generated content back into their collections management system but I can&#8217;t find it again.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Phil Getchell</title>
		<link>http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-556</link>
		<dc:creator>Phil Getchell</dc:creator>
		<pubDate>Wed, 16 Apr 2008 16:41:34 +0000</pubDate>
		<guid isPermaLink="false">http://blogs.walkerart.org/newmedia/2008/04/16/master-metadata/#comment-556</guid>
		<description>Hi,



You are on the right track, as far as I can tell.  We need to plan for the handling of non-authoritative "categories" of meta data.  I think we all agree on the potential for tags, comments, and other user-contributed data to add value - bolstering folksonomy, adding to scholarship, helping to create meaning, etc.  But, I have not done the work of planng for the handling of this.  This time constraint also keeps us from doing the important work of creating our own thesaurus, lexicon, or semantic framework - the kind of foundational requirement for using delphi you elude to.



I'll admit to hoping we will work together to build these blocks.  I feel that such things are finally emerging as useful results of so many ongoing projects.  Also, we need to pay close attention to the methods and tools that the big players, like Flickr, Yahoo!, Google, etc.



more later, but very cool thread, thanx,



- Phil</description>
		<content:encoded><![CDATA[<p>Hi,</p>
<p>You are on the right track, as far as I can tell.  We need to plan for the handling of non-authoritative &#8220;categories&#8221; of meta data.  I think we all agree on the potential for tags, comments, and other user-contributed data to add value - bolstering folksonomy, adding to scholarship, helping to create meaning, etc.  But, I have not done the work of planng for the handling of this.  This time constraint also keeps us from doing the important work of creating our own thesaurus, lexicon, or semantic framework - the kind of foundational requirement for using delphi you elude to.</p>
<p>I&#8217;ll admit to hoping we will work together to build these blocks.  I feel that such things are finally emerging as useful results of so many ongoing projects.  Also, we need to pay close attention to the methods and tools that the big players, like Flickr, Yahoo!, Google, etc.</p>
<p>more later, but very cool thread, thanx,</p>
<p>- Phil</p>
]]></content:encoded>
	</item>
</channel>
</rss>
