Reporting from the 2013 International Conference on Trends for Scientific Information Professionals Conference (ICIC)

The 2013 International Conference on Trends for Scientific Information Professionals Conference (ICIC) was held last week in Vienna, Austria. More than 120 patent and scientific information professionals, from around the world, gathered to hear three days worth of presentations on the latest trends in field. For those of you who have not heard of the ICIC before, the following is a description from the organizers:

For those in the sci-tech world, sharing best practices, networking and evaluating trends have become matters of considerable importance, and the annual ICIC Meeting & Exhibition is the significant forum for this, attracting attendees from the main sci-tech companies and information, service or software providers.

The traditional focus of the ICIC programmes is subjects of interest to information professionals in a scientific or technical environment. As in previous years, this year’s programme contains a number of presentations examining aspects of patent information retrieval, including drug discovery; patent information presents many challenges and is thus an excellent test-bed when it comes to information technology. The ICIC meeting’s roots in chemical information also ensure that that area features prominently; chemical information was one of the first fields to attract modern information technologies back at the end of the nineteenth century and the very beginning of the twentieth.

I was pleased to participate in the conference this year and at the last minute changed the topic of my presentation from a tactical one on using IBM’s Many Eyes in patent analytics to something a little more philosophical. The new title was A Sea Change is Coming to Patent Analytics – Brought to You by Big Data. A link to my slides from SlideShare can be found below:

As I was putting the original presentation together I was reminded how frequently, especially in the last two to three years, the tools and techniques of Big Data have been applied to the field of patent analytics. I see this trend continuing as the organizations that build databases and search/analysis systems for the field start to incorporate new data structures and multi-processor processes into the tools they provide. I also see the potential for patent analysts to observe what is being done by data scientists working in other fields and applying these techniques, and the tools associated with them, to the work we do. Since the ICIC has always been about forward-thinking ideas, and trends, I felt that the revised topic would fit better with the theme of the meeting.

There were a few of us tweeting notes during the meeting so for those of you who were unable to attend, but are interested in what transpired, a list of the presentations provided on each day, along with the corresponding tweets is provided. A similar format was used in the previous post covering the PIUG NE Conference. Links to download all of the presentations, or view them on SlideShare can be found at the URLs below:

http://www.haxel.com/icic/2013/Programme/monday-14-oct-2013

http://www.haxel.com/icic/2013/Programme/tuesday-15-oct-2013

http://www.haxel.com/icic/2013/Programme/wednesday-16-oct-2013

At the conclusion of this post there are also tweets from the product reviews associated with the meeting. Those of you who are interested in what is new and exciting from the vendors associated with this industry will likely be interested in the items covered in this section.

Day 1 – from last to first presentation

Wolfie Christl – Cuteacute – Personal Data in the Digital Age – Big Brother on Steroids?

Arne Krueger ‏@herrkrueger 14 Oct

@datadealer about personal data now: “much more people should know, how it works” full ack, thank you for this amazing presentation! #icic13

Anthony Trippe ‏@atripper 14 Oct

Christl – a surprising amount of personal data is being collected & made available 4 sale to government & insurance agencies #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

Christl: very unusual talk for ICIC #icic13

Anthony Trippe ‏@atripper 14 Oct

Christl – what is to be done with safeguarding personal data and keeping control of your digital self #icic13

Sebastian Radestock – Elsevier – Making Hidden Data Discoverable: How to Build Effective Drug Discovery Engines

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

Radestock demos Roche system #icic13

Anthony Trippe ‏@atripper 14 Oct

Radestock – Roche Elec Lab Notebook is also connected to Reaxys data through API but kept separate from in-house data #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

PubChem and eMolecules integration in Reaxys was done by fed model and API #icic13

Anthony Trippe ‏@atripper 14 Oct

Radestock – Warehouse model allows nearly comprehensive view of chem space involving in-house & published substs in 1 view #icic13

Anthony Trippe ‏@atripper 14 Oct

Radestock – Warehouse approach requires an extract transfer & load process involving standardization & normalization of data #icic13

Anthony Trippe ‏@atripper 14 Oct

Radestock – In federated model Reaxys API can be used to retrieve property and reaction data associated with chemical lookup #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

Now he is listing lessons learned from Reaxys  API #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

Reaxys Tree and automatic indexing #icic13

Anthony Trippe ‏@atripper 14 Oct

Radestock – Chemical integration for in-house projects can be done either via a federated or a warehouse based model #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

Sebastian Radestock of Elsevier on effective drug discovery engines #icic13

Josef Eiblmaier – InfoChem – Extraction of structure info from ChemDraw CDX file: Easy, or an underestimated, difficult challenge

Anthony Trippe ‏@atripper 14 Oct

Eiblmaier – Most pubs start with ChemDraw files – it makes sense to work with originally submitted data from publishers #icic13

Anthony Trippe ‏@atripper 14 Oct

Eiblmaier – InfoChem also working with Markush structures to provide enumeration output for these systems #icic13

Anthony Trippe ‏@atripper 14 Oct

Eiblmaier – a hybrid approach using both an algorithmic & templating approach gives the best results when intreprretting CDX files #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

More laughs. Authors are very inventive. Not the fault of the software #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

Audience laughs out loud at some examples #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

CDX files are optical illusion. “Perfect” layout very deceiving #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

Now Josef Eiblmaier of InfoChem talking about challenges with ChemDraw files #icic13

Jane List – Extract Information – Untangling the scientific information web

Anthony Trippe ‏@atripper 14 Oct

@WendyAnneWarr – I should have remembered that Scirus has always been Elsevier, thought that sounded odd #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

Someone asks “Is GoogleScholar available in China?” #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

@atripper Scirus has always been Elsevier #icic13

Anthony Trippe ‏@atripper 14 Oct

List – Scirus will be closing in 2014 as it was acquired by Elsevier, who also purchased Mendelev #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

#icic13 Jane List says Elsevier scirus is closing in 2014

Anthony Trippe ‏@atripper 14 Oct

List – Microsoft Academic Search – a new entrant to scholarly pubs field – has some useful networking & citation functions #icic13

Anthony Trippe ‏@atripper 14 Oct

List – Google Scholar & Esp@cenet – becoming de facto standards 4 searching #patent & non-patent lit using open source sites #icic13

Helmut Berger – max.recall – Open Source Search

Anthony Trippe ‏@atripper 14 Oct

Berger – both commercial and open source tools have advantages and disadvantages when deciding on an enterprise search tools #icic13

Anthony Trippe ‏@atripper 14 Oct

Berger – Many Big Data tools are available for searching large collections including elasticsearch, solr and Lucene #icic13

Colm Carroll – Innovative Medicines Initiative – European Lead Factory – A unique public-private partnership

Anthony Trippe ‏@atripper 14 Oct

Carroll – Submitting participants agree to provide exclusive access for items to Lead Factory for three years #icic13

Anthony Trippe ‏@atripper 14 Oct

Carroll – scaffolds can be submitted but the expectation is that a library of at least 500 compounds can be made #icic13

Anthony Trippe ‏@atripper 14 Oct

Carroll – Quantitative Estimate of Drug-Likeness (QED) was used to evaluate the screening deck available from IMI #icic13

Anthony Trippe ‏@atripper 14 Oct

Carroll – building a compound library as well as a screening center for lead discovery for Europe #icic13

Anthony Trippe ‏@atripper 14 Oct

Carroll – IMI developing Open Innovation model for Lead Discovery to be implemented by European Pharma companies #icic13

ChemConnector ‏@ChemConnector 14 Oct

Colm Caroll is up to discuss the European Lead Factory #icic13, an Innovative Medicines Initiative project

Alex Drijver – ChemAxon – The Pistoia Alliance and HELM – An open info standard for molec representation of bioloogics

ChemConnector ‏@ChemConnector 14 Oct

@egonwillighagen HELM as Open Source project http://www.openhelm.org/Open_Source_project … #icic13

Egon Willighagen ‏@egonwillighagen 14 Oct

not fully open yet, no right to redistribute RT @ChemConnector: Large molecule representation…solved. HELM http://www.openhelm.org/  #icic13

ChemConnector ‏@ChemConnector 14 Oct

#Chemaxon is building a registration system based on #HELM  #icic13

ChemConnector ‏@ChemConnector 14 Oct

Large molecule representation…solved. HELM http://www.openhelm.org/  #icic13

Anthony Trippe ‏@atripper 14 Oct

Drijver – HELM – Hierarchical Editing Language for Macromolecules – a standard to annotate and describe biomolecules #icic13

Antony Williams – Royal Society of Chemistry – Big Data Challenges Assoc with Building a National Data Reposit for Chemistry

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

@ChemConnector #icic13 good session this morning. Lots of questions. Could not tweet – was in chair

ChemConnector ‏@ChemConnector 14 Oct

Big data challenges associated with building a national data repository for chemistry http://www.slideshare.net/AntonyWilliams/big-data-challenges-associated-with-building-a-national-data-repository-for-chemistry … via @SlideShare #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

@ChemConnector Tony said 10 to twentyseventyseventh is brontobyte

Arne Krueger ‏@herrkrueger 14 Oct

@ChemConnector great presentation about your amazing work, thank you very much! #icic13

Anthony Trippe ‏@atripper 14 Oct

Williams – allowing elec lab notebooks to connect directly to a national repository to both supply & receive chemical data #icic13

Anthony Trippe ‏@atripper 14 Oct

Williams – Plum Analytics & other providers of AltMetrics data are providing individual reviews of individual contributions #icic13

Anthony Trippe ‏@atripper 14 Oct

Williams – AltMetrics is more than citations but helps monitor scientific impact and contributions #icic13

Anthony Trippe ‏@atripper 14 Oct

Williams – RSC National Chemical Database will allow scientists to upload own data into repository with sharing options #icic13

Anthony Trippe ‏@atripper 14 Oct

WIlliams – RSC creating a National Chemical Database Service for UK Academics contains chemical info including properties #icic13

Anthony Trippe ‏@atripper 14 Oct

Williams – 95% of his personal scientific discoveries are never published and currently are not available for public use #icic13

Uwe Rosemann – German National Library of Sci and Tech – Text and Non-textual Objects: Seamless Access for Science

Anthony Trippe ‏@atripper 14 Oct

Rosemann – image to structure indexing is also being conducted to capture chemical structures from submitted sources #icic13

Anthony Trippe ‏@atripper 14 Oct

Rosemann – Developing portal for audiovisual media – will attempt to recognize scenes, speakers & topics covered in video #icic13

Anthony Trippe ‏@atripper 14 Oct

Rosemann – Scientific research data from the internet is being made available through the DataCite tool with DOIs #icic13

Anthony Trippe ‏@atripper 14 Oct

Rosemann – Scientific and technical information is available in formats other than text & these should also be considered #icic13

Day 2 – from last to first presentation

Kuramitla Krishnaveni – Molecular Connections – Enriching Content with Semantic Tagging

Anthony Trippe ‏@atripper 15 Oct

Krishnaveni – semantic tagging also provides automatic expansion of queries with associated name synonyms #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Krishna “leveraging” (yuk) linked data #icic13

Anthony Trippe ‏@atripper 15 Oct

Krishnaveni – content enrichment adds value to articles for users and involves a multi-step process to complete #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Krishna: publishers quick to adopt content enrichment. Why not enrich patent content too? #icic13

Anthony Trippe ‏@atripper 15 Oct

Krishnaveni – many publishers are looking to enrich their full-text documents by adding semantic tags to the articles #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Krishna: 1000 people at Molecular Connections now #icic13

Nicolas Lalyre – Syngenta – Do Indexing Systems of Bibliographic Databases meet Today’s Users Needs & Expectations

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Paul Peters (CAS) says journal articles cry out “find me, find me”, patents do not #icic13

Anthony Trippe ‏@atripper 15 Oct

Lalyre – Claim specific indexing would be tremendously helpful for conduction Freedom-to-Operate searching #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Lalyre: increased number of FTO searches because of life cycle management so deep indexing necessary #icic13

Anthony Trippe ‏@atripper 15 Oct

Lalyre – Indexing for mixtures composed of items from two different lists would be very helpful for #patent searching #icic13

Anthony Trippe ‏@atripper 15 Oct

Lalyre – indexing associated with different family members within #patent family should also be performed #icic13

Anthony Trippe ‏@atripper 15 Oct

Lalyre – no additional indexing for #patent documents as they change over time is one of the areas for improvement #icic13

Anthony Trippe ‏@atripper 15 Oct

Lalyre – There are many positive attributes associated w/ indexing systems for #patent documents but there are ways to improve #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Lalyre: challenges specific to patents are broad claims, laundry lists, legal wording #icic13

Anthony Trippe ‏@atripper 15 Oct

Lalyre – Lack of precision is causing the number of documents retrieved by searching to skyrocket #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Lalyre will address indexing for patent publications, but not quality or biblio issues #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Nicolas Lalyre replaces Gerhard Fischer for Syngenta talk #icic13

Jutta Hausser – EPO – Venturing off the Beaten Track: Challenges of patent information from Arabian Countries

Anthony Trippe ‏@atripper 15 Oct

Hausser – providing a country by country profile of #patent systems and information available for Arabian countries #icic13

Anthony Trippe ‏@atripper 15 Oct

Hausser – GCC #patents are subject to post-grant review type proceedings #icic13

Sumair Riyaz ‏@Sumairriyaz 15 Oct

#Patent application is published in a local newspaper in GCC area, #Qatar. Welcome to conducting patent searches via newspapers. #icic13

Anthony Trippe ‏@atripper 15 Oct

Hausser – Many Arabian #patents are filed using GCC or PCT applications – GCC – Gulf Cooperation Council #icic13

David Milward – Linguamatics – Unstructured Text in Big Data: the Elephant in the Room

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Milward: challenge of mining twitter. Flu shots example #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Milward: pattern matching chemicals using ChemAxon N2S software #icic13

Anthony Trippe ‏@atripper 15 Oct

Milward – structures can be distinguished using text mining such as claimed vs exemplified #icic13

Anthony Trippe ‏@atripper 15 Oct

Milward – extracted data can be used to create facets to allow for guided navigation within information filtering and retrieval #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Milward: text mining to annotate concepts to feed into enterprise search  engine #icic13

Anthony Trippe ‏@atripper 15 Oct

Milward – new analytical views can be created by using the data extracted from unstructured text which is now structured #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Milward: embed I2E within Web apps #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Milward: use successful queries in Pipeline Pilot or KNIME #icic13

Anthony Trippe ‏@atripper 15 Oct

Milward – workflow approach can be used to insert structured data from text mining into alerting systems #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Milward: extracting data for a relational database or for a semantic store; or an ad hoc approach #icic13

Anthony Trippe ‏@atripper 15 Oct

Milward – started with protein to protein interactions but now there are many more applications of semantic analysis #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Milward: increasing range of applications for Linguamatics I2E #icic13

Anthony Trippe ‏@atripper 15 Oct

Milward – As more text data is created processes need to be developed to make unstructured data structured #icic13

Jan Baur – FIZ Karlsruhe – The Concept of the New STN Platform

Anthony Trippe ‏@atripper 15 Oct

Baur – will be adding patent family sort across multiple sources feature to the new platform with future release #icic13

Anthony Trippe ‏@atripper 15 Oct

Baur – information professionals wanted to see three items in the interface: query, results and history #icic13

Anthony Trippe ‏@atripper 15 Oct

Baur – New platform is web-based, parallel processed, with pre-compiled analytics and a project based workflow #icic13

Ricardo Elto-Brun – Univ Carlos III de Madrid – Dissemination Patterns of Tech Knowled in IR Industry: Scientometric Analysis

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Eito-Brun: university innovation should not be measured by learned journal articles alone #icic13

Anthony Trippe ‏@atripper 15 Oct

Eito-Brun – Can the h-index be used to study company impact based on citations? G-index or other impact factors may work better #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Eito-Brun has also used H index and G index #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Eito-Brun: impact graphs show impact of MS on IBM and vice versa. #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Eito-Brun: evolution of the most influential; the rise of Microsoft #icic13

Anthony Trippe ‏@atripper 15 Oct

Eito-Brun – applying Bradford’s Law and bibliometric impact factors to the area of patent citations – interesting #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Eito-Brun Productivity and impact shows IBM the leader; MS and Xerox joint second #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Eito-Brun: Bradford’s law applied to authors, univs etc. zone 1 has five cos; zone 5 has many #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Eito-Brun; IBM Xerox Microsoft Oracle the most active in text mining #icic13

Anthony Trippe ‏@atripper 15 Oct

Eito-Brun – Looked at citations associated with clustering and classification as a subset of text mining using US patent data #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Eito-Brun: growing number of patents on text mining since 2005 #icic13

Anthony Trippe ‏@atripper 15 Oct

Eito-Brun – MIT book on patent citations by Adam Jaffe and Manuel Trajtenberg published in 2003, need to have a look #icic13

Anthony Trippe – Patinformatics, LLC – A Sea Change is Coming to Patent Analytics – Brought to You by Big Data

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Trippe: Many network analysis tools. R, weka,  CytoScape, Sci2, Many Eyes, etc. #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Trippe lists all the things you can do with OpenRefine. He will blog #icic13.

Arne Krueger ‏@herrkrueger 15 Oct

‘Patent searcher will shortly be coders again with R and OpenRefine’ says @atripper at #ICIC13 and I agree

Favorited by Mustafa Çakır

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

R is a de facto standard and you HAVE to use OpenRefine #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

New STN platform is built on hadoop. Leading edge. See his blog post. Allows” impossible” Markush search #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Trippe: well over 80 m patents, 70 m substances #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Trippe refers to Ursula Schoch Grubler’s talk on maggots in the garden #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Tony Trippe: more broad title. Sea change brought about by big data #icic13

Monika Hanelt – Agfa Graphics – Why Networking Organizations are so Valuable in Patent Info – Together we are Strong

Anthony Trippe ‏@atripper 15 Oct

Hanelt – Global Legal Status project is an PDG initiative with WIPO & EPO to ensure consistent legal status data within databases #icic13

Anthony Trippe ‏@atripper 15 Oct

Hanelt – new CPC classification system is an example of positive benefits of collaboration #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Cooperative Patent Classification. is a success story #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

trilateral attempts to certify PI professionals #icic13

Anthony Trippe ‏@atripper 15 Oct

Hanelt – The largest patent issuing authorities are also working together to remove duplication between the work of the offices #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Monika Hanelt opens ICIC2013 second day. There are many patent inf user groups in Europe. One is PDG #icic13

Anthony Trippe ‏@atripper 15 Oct

Hanelt – Worldwide patent info networking organizations are working together on important items like certification #icic13

Anthony Trippe ‏@atripper 15 Oct

Hanelt – There are patent information networking organizations all over the world, the oldest of which is likely the PDG #icic13

Day 3 – from last to first presentation

Sumair Riyaz – Dolcera – Finding the Best Patents in your Portfolio

Sumair Riyaz ‏@Sumairriyaz 16 Oct

@atripper  reason; its easy to automate valuing objective parameters and a little difficult to automate valuing subjective #icic13 (2 of2)

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Riyaz: case study of defensive etc. value, invention value index #icic13

Anthony Trippe ‏@atripper 16 Oct

Riyaz – #patent value by looking at strategic alignment, def/off value, subjectiv measures, patent value index & part of standard #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Riyaz lists value indicators for mining model that handles the “gems”  #icic13

Anthony Trippe ‏@atripper 16 Oct

Riyaz – most #patent valuation systems use subjective measures – objective items might also be helpful #icic13

Favorited by Kenichi Yamamura

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Riyaz: Execution and maintenance of your IP strategy. M&A, CI, knowledge management etc. #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Riyaz: classification of patent portfolio into strategic focus area #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Riyaz: Level 4 companies have strategic IP plan #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Riyaz: finding your strategy focus through SWOT analysis. Maturity of strategy model has 4 levels #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Riyaz of Dolcera: 1999 Univ Delaware patent reassigned to … then reassigned to Apple. Could you find the original gem? #icic13

Daniel Bonniot – ChemAxon – Towards Automated Mining of Chemical Structures in Chinese Patents

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Bonniot does not have precision and recall figures for name entity recognition in 2108 patents. Maybe 50% recall? #icic13

Anthony Trippe ‏@atripper 16 Oct

Bonniot – Critical to create name to structure systems with native Chinese as opposed to extract from English  translation #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Bonniot: 2nd validn on 2108 patents, v. few records have name match in Chinese and English; almost half Chinese names have OCR error #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Bonniot: 38,600 Chinese names + CAS RNs in test set converted at rate 59-79% with 91% accuracy #icic13

Anthony Trippe ‏@atripper 16 Oct

Bonniot – First validation has a conversion rate of 60-80% with 91% accuracy on average #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Bonniot: Document to Database makes chem struct database from files, patents, articles, Documentum etc. #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Bonniot: interesting OCR errors from Chinese characters; lack of spaces also a challenge #icic13

Anthony Trippe ‏@atripper 16 Oct

Bonniot – Chinese language required special algorithms for identifying chemical names and convert to a structure #icic13

Favorited by Kenichi Yamamura

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Bonniot: can search structures across multiple patents in different languages #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Bonniot: convert chem names to structs; make database; can do structure searches, predict properties #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Bonniot: enormous growth in number Chinese patents #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Daniel Bonniot: mining chemical structures in Chinese patents #icic13

Kim Zwollo – RightsDirect – Use of Professional Information and Copyright Compliance

Anthony Trippe ‏@atripper 16 Oct

Zwollo – Copyright compliance will have a significant impact for allowing text miners to obtain source material legally #icic13

Marc Tobias – BGW – Assessing Qual of #Patents Based on Quantitative Approaches – the St. Gallen Patent Index

Anthony Trippe ‏@atripper 16 Oct

Tobias – main indicators are technology, market and portfolio based using citations, GDP & relative share of corporate portfolio #icic13

Richard Garner – LexisNexis – Are there any Frontiers Left in Patent Data?

Anthony Trippe ‏@atripper 16 Oct

Garner – Access to Licensing content is also a critical need and is different than #patent assignment #icic13

Anthony Trippe ‏@atripper 16 Oct

Garner – Litigation and access to data associated with them is also a significant challenge for information retrieval #icic13

Anthony Trippe ‏@atripper 16 Oct

Garner – Legal Status is still a significant issue and there is a dire need for standardization and proper reporting #icic13

Anthony Trippe ‏@atripper 16 Oct

Garner – While progress made regarding full-text #patent data there are still significant portions of world were it is not avail #icic13

Tweets from Others

Arne Krueger ‏@herrkrueger 16 Oct

Meine Highlights der #ICIC13 #tagebuch http://wp.me/pg0ze-2zi

Review of the conference – in German

Egon Willighagen ‏@egonwillighagen 16 Oct

yeah, I’ve reached Level 4! RT @WendyAnneWarr: Riyaz: Level 4 companies have strategic IP plan #icic13

Wendy Anne Warr ‏@WendyAnneWarr 16 Oct

Correction to typo in my post yesterday: Alex Allardyce claims ChemAxon to be # 1 global cheminformatics company #icic13

Sumair Riyaz ‏@Sumairriyaz 15 Oct

@atripper would be interesting to know if comparing documents that have images and structure based information is possible. #Icic13

Sumair Riyaz ‏@Sumairriyaz 15 Oct

@WendyAnneWarr @atripper I wanted to let people know how we do it instead of making it a black box.

Cheers! #ICIC13

Chris Taylor ‏@chrisftaylor 14 Oct

MT @atripper @ChemConnector Williams – AltMetrics is more than citations but helps monitor scientific impact and contributions #icic13

Tweets from Product Reviews

Anthony Trippe ‏@atripper 15 Oct

GVK Bio – 500 informatics scientists available to work with customers on items such as structure activity relationships #icic13

Anthony Trippe ‏@atripper 15 Oct

Search Technology – Super Profiles uses templates to combine information and numerous analysis to create a custom report #icic13

Anthony Trippe ‏@atripper 15 Oct

Search Technology – focus on importing many sources, cleaning tools, statistical analytics and reporting #icic13

Anthony Trippe ‏@atripper 15 Oct

Averbis – working on patent analytics, medical records and company data using info extraction, analysis and delivery #icic13

Anthony Trippe ‏@atripper 15 Oct

Averbis – text analytics consultancy with professionals from biomedical and IT backgrounds #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Product review: Averbis at ICIC for the first time. Founded 2007. Based Freiburg #icic13

Anthony Trippe ‏@atripper 15 Oct

Intellixir – Easy Lixir is like a reader application with a simplified interface for sharing with users #icic13

Anthony Trippe ‏@atripper 15 Oct

Intellixir – Easy Lixir can be used to share results using a report building functionality #icic13

Anthony Trippe ‏@atripper 15 Oct

Intellixir – Data clustering and tag clouds can be used to categorize and analyze document collections #icic13

Linguamatics ‏@Linguamatics 15 Oct

RT @wendyannewarr: Milward: increasing range of applications for Linguamatics I2E #icic13

Anthony Trippe ‏@atripper 15 Oct

Stellarix – provide consulting services for patent search and analytics using multiple sources including financial data  #icic13

Anthony Trippe ‏@atripper 15 Oct

Parthys Reverse Informatics – many IP related services including search, claim charts, landscapes and trading center #icic13

Anthony Trippe ‏@atripper 15 Oct

Parthys Reverse Informatics – Answers are everywhere so now we are looking for good questions – Reverse Informatics #icic13

Anthony Trippe ‏@atripper 15 Oct

FIZ Karlsruhe – Numeric Property Search has been added to many of the STN Databases #icic13

Anthony Trippe ‏@atripper 15 Oct

FIZ Karlsruhe – calculated expiration dates in INPADOC help focus on “in-force” patents – based on more than 400 rules #icic13

Anthony Trippe ‏@atripper 15 Oct

CEPT – Document Fingerprints can be generated from concept fingerprints to compare documents to one another semantically #icic13

Anthony Trippe ‏@atripper 15 Oct

CEPT – Semantic Fingerprinting – allows direct semantic comparison & relationships between different concepts #icic13

Anthony Trippe ‏@atripper 15 Oct

CEPT – Cortical Engine for Processing Text – New Technology to improve natural language processing based on neuroscience #icic13

Anthony Trippe ‏@atripper 15 Oct

Wiley – Smart Article extracts chemical substances from journal articles and provides them in a ribbon at bottom of article #icic13

Anthony Trippe ‏@atripper 15 Oct

Wiley – RxnFinder provides many tools for performing retro-synthetic analysis for chemistry #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

InfoChem CLASSIFY classifies reactions in RxnFinder. Reaction schemes importnt feature of RxnFinder. Retrosynth tools #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Wiley: new platform also links with smart article.#icic13

Anthony Trippe ‏@atripper 15 Oct

Wiley – Smart Article initiatives will add many enhancements to the traditional notion of a journal article #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

Wiley product review. New platform, Wiley Science Solutions, has reaction srch, current protocols, spectral libraries.#icic13

Anthony Trippe ‏@atripper 15 Oct

CAS – SciFinder now has an API associated with it. Primarily used for companies to connect with CAS data #icic13

Wendy Anne Warr ‏@WendyAnneWarr 15 Oct

SciFinder product review. Lots of big numbers. Reaction search enhancements emphasized. And new API facilities. #icic13

Anthony Trippe ‏@atripper 15 Oct

CAS – Target and disease based indicators can be used in SciFinder as well as reaction data from many sources #icic13

Anthony Trippe ‏@atripper 15 Oct

CAS – SciFinder has been redesigned and improved, now also includes a JavaScript editor #icic13

Anthony Trippe ‏@atripper 14 Oct

GenomeQuest – 217 million sequences from 437,000 documents in 182,000 INPADOC families available in database #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

Genome Quest product review now – IP sequence search. #icic13

Anthony Trippe ‏@atripper 14 Oct

ChemAxon – http://chemicalize.org  is an open source name to structure service where 60% of pages indexed are patents #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

Biomol regn is for HELM – remember talk this morning #icic13

Anthony Trippe ‏@atripper 14 Oct

ChemAxon – Marvin for JavaScript now available as well as a Biomolecule Registration toolkit #icic13

Anthony Trippe ‏@atripper 14 Oct

ChemAxon – Markush search and enumeration with Thomson Reuters coming to the cloud in early 2014 #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

ChemAxon was ChemAxon still ChemAxon will always be ChemAxon #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

Alex Allardyce – 15 years of ChemAxon #icic13

Anthony Trippe ‏@atripper 14 Oct

RightsDirect – solving common copyright challenges for companies of all sizes in the digital age #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

Dolcera: More a demo than a product review #icic13

Anthony Trippe ‏@atripper 14 Oct

Dolcera – offering a new patent to product mapping solution for studying companies and technologies #icic13

Anthony Trippe ‏@atripper 14 Oct

Minesoft – PatBase Analytics Module now works with up to 100,000 PatBase families including text clustering #icic13

Anthony Trippe ‏@atripper 14 Oct

Minesoft – Legal Status timeline takes INPADOC PRS data and provides family based status events and visualize the results #icic13

Anthony Trippe ‏@atripper 14 Oct

BizInt – Summary tables using Reference Rows allows data from multiple databases to be organized, cleaned and visualized #icic13

Wendy Anne Warr ‏@WendyAnneWarr 14 Oct

@bill_town @PistoiaAlliance SPRESIweb mobile app is on Pistoia @PistoiaAlliance app store #icic13

Anthony Trippe ‏@atripper 14 Oct

InfoChem – structure and reaction editor is now written in Javascript, synthesis planner has also been updated #icic1

Anthony Trippe ‏@atripper 14 Oct

Linguamatics – providing patent solution in the cloud using CLAIMS Direct data and ChemAxon functionality #icic13

Anthony Trippe ‏@atripper 14 Oct

Linguamatics – API available for connecting i2e to a variety of different services including Knime and Pipeline Pilot #icic13

Anthony Trippe ‏@atripper 14 Oct

LexisNexis – comparison tool creates a Venn diagram using up to three search queries and see the overlap between approaches #icic13

Anthony Trippe ‏@atripper 14 Oct

max.recall – users can upload their own collections to http://quantalyze.com  for analysis of numeric content #icic13

Anthony Trippe ‏@atripper 14 Oct

max.recall – Quantalyze – numeric data extracted from tables and text – makes it available for searching & visualization #icic13

 

Comments 4

Leave a Reply

Your email address will not be published. Required fields are marked *