{"id":182,"date":"2017-03-15T15:22:40","date_gmt":"2017-03-15T15:22:40","guid":{"rendered":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/?p=182"},"modified":"2025-02-26T13:18:20","modified_gmt":"2025-02-26T13:18:20","slug":"automated-concept-keyword-generation-for-journal-articles-in-repositories","status":"publish","type":"post","link":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/2017\/03\/15\/automated-concept-keyword-generation-for-journal-articles-in-repositories\/","title":{"rendered":"Automated concept \/ keyword generation for journal articles in repositories"},"content":{"rendered":"<p>I was checking out a research paper in the ACM Digital Library: <a href=\"http:\/\/dl.acm.org\/citation.cfm?doid=2815400.2815411\">SibylFS: formal specification and oracle-based testing for POSIX and real-world file systems<\/a>\u00a0and spotted something interesting: a &#8220;Concepts in this article&#8221; drop-down &#8220;Powered by <a href=\"https:\/\/en.wikipedia.org\/wiki\/Watson_(computer)\">IBM Watson<\/a>&#8221;\u00a0For more on Watson, see <a href=\"https:\/\/learning.acm.org\/webinar\/lally.cfm\">IBM Watson: Beyond Jeopardy! Q&amp;A<\/a><\/p>\n<p>I&#8217;ve been thinking what calls to action a user in our audience might be interested in when they land on a page for a research manuscript in our <a href=\"https:\/\/lra.le.ac.uk\/\">repository<\/a>.<\/p>\n<p>One thing they might want is <strong>show me what this is about<\/strong>\u00a0or <strong>find other articles similar to this paper\u00a0<\/strong>following a thread of interest. A semi-automated way of doing this would be to offer some concepts to choose from. This can shade into data mining, such as <a href=\"http:\/\/openminted.eu\/\">OpenMinTeD<\/a>.<\/p>\n<p>Other automated clustering and keyword \/ concept generation techniques exist. I&#8217;ve only ever seen one attached to a repository (<a href=\"https:\/\/www.elsevier.com\/solutions\/elsevier-fingerprint-engine\">Elsevier Fingerprint Engine<\/a>), I&#8217;ve seem demonstrations of <a href=\"http:\/\/apidemo.pingar.com\/Taxonomy.aspx\">Pingar\u00a0<\/a>and <a href=\"http:\/\/www.flax.co.uk\/blog\/2012\/06\/12\/clade-a-freely-available-open-source-taxonomy-and-autoclassification-tool\/\">Clade (2012 blog)<\/a>\u00a0and I know of some interesting recent commercial work in the area.<\/p>\n<p>I wonder why we don&#8217;t see concepts pulled out automatically from repositories and offered to users for navigation. Is it too difficult, too expensive or just not helpful to users?<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I was checking out a research paper in the ACM Digital Library: SibylFS: formal specification and oracle-based testing for POSIX and real-world file systems\u00a0and spotted something interesting: a &#8220;Concepts in this article&#8221; drop-down &#8220;Powered by IBM Watson&#8221;\u00a0For more on Watson, see IBM Watson: Beyond Jeopardy! Q&amp;A I&#8217;ve been thinking what calls to action a user [&hellip;]<\/p>\n","protected":false},"author":88,"featured_media":185,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[34],"tags":[2,26,22],"class_list":["post-182","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-audience-workflow","tag-leicester-research-archive","tag-tools","tag-usability"],"_links":{"self":[{"href":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/wp-json\/wp\/v2\/posts\/182","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/wp-json\/wp\/v2\/users\/88"}],"replies":[{"embeddable":true,"href":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/wp-json\/wp\/v2\/comments?post=182"}],"version-history":[{"count":2,"href":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/wp-json\/wp\/v2\/posts\/182\/revisions"}],"predecessor-version":[{"id":186,"href":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/wp-json\/wp\/v2\/posts\/182\/revisions\/186"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/wp-json\/wp\/v2\/media\/185"}],"wp:attachment":[{"href":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/wp-json\/wp\/v2\/media?parent=182"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/wp-json\/wp\/v2\/categories?post=182"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/staffblogs.le.ac.uk\/researcharchiving\/wp-json\/wp\/v2\/tags?post=182"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}