Collaborative Librarians

Data don't tell the whole story.

Is “Big Science” better? February 4, 2011

Filed under: Uncategorized — Betsy Rolland @ 6:00 am

I attended a seminar on Monday (1/31/11), hosted by Sage Bionetworks, called Establishing a ‘TCP/IP’ for Human Biology: A Summit on Human Data Interoperability. There were several interesting presentations and intriguing ideas presented. But I left feeling vaguely dissatisfied. So much money is being invested in building huge data repositories of related (and sometimes unrelated) data, but is it really the best way forward? Is there evidence that the best way to answer questions about human health is through large-scale genetic analyses? How do we know the science is good when data are stripped of their context and dumped into a repository? Is personalized medicine really achievable and worthwhile?

As the funding situation gets more and more difficult, is spending billions on large data repositories more cost-effective than focusing on smaller projects? Would it be better to focus more on prevention and less on curing preventable diseases? In short, is Big Science really better?

 

CI Article: “Coordination costs and project outcomes in multi-university collaborations” February 3, 2011

Filed under: CI Article,Coordinating Centers,Information Problems,Librarians — Betsy Rolland @ 7:00 am

Cummings, J. N. and S. Kiesler (2007). “Coordination costs and project outcomes in multi-university collaborations.” Research Policy 36(10): 1620-1634.

I stumbled across this article while making my way through last year’s Science of Team Science Conference. I listened to Jonathon Cummings present an overview of some of what’s in this paper. I highly recommend getting a copy of the paper if you’re at all interested in supporting collaborative research. While I agree with the authors that their results aren’t necessarily generalizable to all domains (they focused on a single grant program in area of interdisciplinary IT research and education), I appreciate the focus on coordination. It seems as though funding agencies and even the institutions themselves underestimate the difficulties inherent in multi-institutional collaborative research. Adding in the complexity of interdisciplinarity and coordination gets even more difficult.

This is one of the first articles I’ve seen that correlates successful research to specific activities of the collaboration, such as co-authorship, student exchanges, having a web portal and email lists, etc. I think the suggestion that Cummings and Kiesler make at the end, that perhaps all large collaborations should first have a small exploratory grant to support the development of the collaboration, is an excellent one. Such support would allow groups to work together to develop trust and establish a group identity. Cummings and Kiesler also suggest that funding agencies invest in training scientists on *how* to collaborate and coordinate large research projects. I would argue that this would be an excellent task for the institutions themselves to take on, in coordination with funding agencies. I would also argue that this is yet another area where librarians, with expertise in user needs assessment and community development, could make a huge impact.

 

Science of Team Science Conference February 2, 2011

I registered recently to add the 2011 Science of Team Science Conference, hosted by Northwestern University’s NUCATS Institute and its Research Team Support & Development office. I couldn’t be more excited. I wasn’t able to attend last year’s conference because of my heavy travel schedule for the SLA research grant, so I’m thrilled to be able to attend this year.

I’ve been virtually attending last year’s conference via the PPT and MP3 recordings they’ve posted for each session. This is a treasure trove of information and worth perusing. I’ve listened to several presentations so far and have read the minutes, which are well done and really capture the essence of each conversation. They even captured the Q&A sessions!

I think it will be especially interesting to attend in my dual role as social science researcher and practitioner, as this doesn’t seem to be very common. I have to admit, I’ve been a little disappointed about the lack of discussion about libraries, librarians or even information management. I may submit a poster on that topic, just to make sure it makes it onto the radar.

 

CI Article: “New Knowledge from Old Data : The Role of Standards in the Sharing and Reuse of Ecological Data” February 1, 2011

Filed under: CI Article,Curation,Cyberinfrastructure,Data,eScience — Betsy Rolland @ 7:00 am

Zimmerman, A. (2008). “New Knowledge from Old Data.” Science, Technology, & Human Values 33(5): 631-652.

Zimmerman interviewed 13 ecologists about their use of secondary data (i.e., data they did not collect themselves) in order to tease out the role standards might play in the process of re-using data for new analyses. She found that the primary determinant in an ecologist’s decision to use the data was the researcher’s own ability to understand the data. This understanding was heavily contingent upon the researcher’s field experience and knowledge of collecting similar data. If the ecologist considered the data to be generally difficult data to collect or the kind of data that was frequently poorly understood, the data were not used. A second consideration was the reputation of the data collectors themselves or a personal relationship with the data collectors.

Zimmerman concludes that standards, while potentially useful, would be difficult to develop because the collection of data is so context-dependent. In short, the research questions determine how the data are collected and which data points are important. It would be a staggering task to try to develop standards that would cover every context and approach. Even if that were possible, science moves so quickly that the standards would likely be obsolete by the time they were approved.

There was no mention in this article about the potential for others to help with the curation or development of understanding of the data. Does the individual investigator need to be involved or is this a question that can be delegated to graduate students or a data manager? Was it a collective decision or one made by the lead researcher? The participants described a process of repeatedly going back to the journal article where the secondary data are described. I would have liked to know more about what types of information they were looking for when they did that. Which types of contextual information were most important to them? Could they even tell us or is that another form of tacit knowledge they find difficult to articulate?

 

 
Follow

Get every new post delivered to your Inbox.