DLF Fall Forum Notes
What is the going on in the future of DLF? (new members on p.6 of Rachel's presentation, p. 11 describes DLF's role compared to other conferences)
Continue to focus on the technological aspects of developing, sustaining, and federating digital libraries. DLF will promote standards, protocols, and best practices; it will evaluate and promulgate digital library projects and programs, and support digital architectures that most effectively allow for interoperable and extensible digital resources and tools.
Not attempt to build or develop projects in its name; rather, it will aggressively promote the digital production of its constituent institutional members, work with them to facilitate collaborations and cooperative efforts, and build a registry of projects and programs of wide interest and applicability.
Promote the concept of an international/ global digital library in which all major projects and future developments need to be understood as interrelated components. DLF will seek more partnerships with major libraries and institutions overseas.
work with the major funding agencies, private and public, to assist them in better understanding the interconnectedness of digital library development and to encourage them to fund projects in concert with one another to avoid redundancy and lost opportunities that can result from isolated, competitive funding schemes.
Build stronger ties to corporations, especially those investing heavily in digital resources and large-scale digital assets, and connecting the corporations to the communities that can best advise and complement the corporate investment.
Articulate qualities and characteristics of leadership in the coming decade, requisite for the changes and transformations ongoing.
http://www.clir.org/dlf/forums/fall2010/speakerslides.html
General Session Keynote highlighted promising projects:
Promising Projects
•WEST (distributed print journal repositories) - see attached word doc
•Digital Public Library of America (Harvard Robert Darnton)
•CRL print archives coordination
•HathiTrust
•2CUL
•OAPEN (EU academic publishing)
•CDL; Cal State
All of these projects also share an important tactical approach. They mitigate the concern of loss of individuality--loss of ‘brand’ in the common parlance--by keeping the level of tradition, history and idiosyncrasy of the institutions involved intact and building their interdependent alliances and collaborations within and among the services and programs that underpin research and teaching
Session #4 "Archiving" Digital Lives
Data Accessioner
library.duke.edu/uarchives/about/tools/data-accessioner.html
guiding principles - honor the unity of the collections, the needs of the donor for privacy, needs of the researcher, and the concern of the content with the digital context
JAVA based data recovery - GUI, mirror directory service, changes the last modified date to the original
First triage is to get the data off the hardware, forensic disk image - raw bit for bit, and make a logical copy
Change creation dates so copying is not good since it changes the context and relationships also the permissions change
Logical - capture deleted files
FRED- Forensic Recovery Embedded Device
Let the donor use the software
Hydra
p. 7/38 - Tailored applications and workflows for different content types, contexts and user interactions, A common repository infrastructure, Easily skinned UI, One body many heads
Technical Framework -Components
•Fedora provides a durable repository layer to support object management and persistence
•Solr, provides fast access to indexed information
•Blacklight, a Ruby on Rails pluginthat sits atop solrand provides faceted search & tailored views on objects
2010Technical Framework -Components
•Hydra Plugin, a Ruby on Rails library that provides create, update and delete actions against Fedora objects
•Services, providing discrete, reusable Lego bricks of functions (e.g., indexing, checksumming, MD transforms, etc.)
•Hydrangea, a web application that bundles all the RoRcomponents, user interaction and hooks to services into a single, adaptable package
•ActiveFedora, a Ruby gem for modeling, creating and managing Fedora objects
•Opinionated Metadata, tools for mapping domain specific vocabularies in application code to XML structures
•Solrizer, provides generic utilities for mapping content into a solrindex
•JavaScript Libraries, for Hydra-specific, rich user interactions with repository content
The Hydra project wiki: https://wiki.duraspace.org/display/hydra/The+Hydra+Project
VIRGO - UVA Manuscripts and images
Standford ETDs:https://lib.stanford.edu/sulair-news/electronic-theses-and-dissertations-available-searchworks-and-socrates-and-google-books
Stanford EEMS: http://lib.stanford.edu/everyday-electronic-materials-eems/about-eems
Code: http://github.com/projecthydra/hydrangea
Poster of interest -
Reading Session #4: Digital Content and Infrastructure Needs of Research Faculty
Led by Jennifer Schaffner, Susan Kroll, Mackenzie Smith
Links: A Slice of Research Life available at http://www.oclc.org/research/publications/library/2010/2010-15.pdfhttp://www.oclc.org/research/publications/library/2010/2010-15.pdf.
Scholarly Information Practices from the Online Environment available at http://escholarship.org/uc/item/0kr8s78vhttp://escholarship.org/uc/item/0kr8s78v.
MacKenzie's objections to Kroll's Report:
- How valid is this research at your individual institution (each university is a snowflake...)
- There is tension but still trying to move ahead even though administration is telling you that you can't
- Senior Faculty behave differently than rank and file researchers
- Mission and marketing - how do we market what we are already doing
Other notes:
Coeus - MIT http://coeus.org/
Faculty want help with: structuring their notes and datasets, negotiating publication licenses, funding requirements, authority work, customizable personal webpages, and keeping up to date with their iterations.
Review Ithaka's faculty survey
www.rin.ac.uk - patterns of info use and exchange case studies
"pre-print is a more authentic representation of their work" - Kroll's study found that this is the feeling of faculty
Staffing and Workflow Solutions:
http://www.clir.org/dlf/forums/fall2010/15DLF-UIowa.pdf
Sustainability and Revenue Models
for Online Academic Resources
An Ithaka Report
May 2008
http://www.clir.org/dlf/forums/fall2010/sca_ithaka.pdf
Reimagining METS An Exploration
http://www.clir.org/dlf/forums/fall2010/METSNextGeneration.pdf
U.S. Government Printing Office’s Federal Digital System (FDsys)
http://www.clir.org/dlf/forums/fall2010/FDsysFactSheet.pdf
Closing the Digital Curation Gap
http://www.clir.org/dlf/forums/fall2010/CDCGflyer.pdf
Twitter summary:http://wiffiti.com/screens/41933summarized: http://summarizr.labs.eduserv.org.uk/?hashtag=dlf2010
Themes of DLF wrap-up:
Digital Library Federation
- To focus on the technological aspects of developing, sustaining, and federating digital libraries:
- Promote standards, protocols, and best practices
- Evaluate and promulgate digital library projects and programs
- Support digital architectures that most effectively allow for interoperable and extensible digital resources and tools
- Facilitate relevant formal and informal education opportunities
Ideas covered -
Ideas Expressed
• Key Performance Indicators‐ develop a community of practice around assessment. #kpi
• Economic Case Studies #econcase
• Develop models for determining value, ROI, return on mission #valueassessment
• Small forum for digital archivists, working w/ electronic records in manuscript repositories, technologically neutral
solutions (esp microservices) #digarchforum
• Hot issues group, identify hot issues and share policies/plans/practices (one first one: data management plans) #dlfhot
• Microservices as way to migrate project‐>program (hackathon or similar) #mshack
• Facilitate cross domain collaboration (however) #collab
• Action Learning Groups for project managers, #dlfalg
• Talk about failures and brainstorm, #failstorm
• Developers talk and managers really listen #dlfdevtalk
• Emerging DL leaders #dlfprofdev
• Training session/workshop on software testing, good development practices, #devcraft
• Methods for counting DL objects in ARL etc environment #dlfstat
• 1/mo conf call with rotating convenor. 5 people talk for 10 mins on what they’re doing, #confcall
• Exhibit systems, how to disseminate content #accesssystems
• (make dlf.org resolve) lots better web site for DLF, #dlfweb
• tools dashboard, open source and homegrown to see what other inst’s are using #tools
• find way to allow more people to go to more sessions at the forum – don’t overlap so much. Record sessions for later
viewing. Collaborative scheduling/voting. #DLFPPC
• Supporting research methods for specific research communities (dig humanities, science data) #dlfdh, #dlfscience
• Exploring barriers to collaboration (strategic, organizational, technical) – how to convince administrators #collab
• How to deal with artificial grant cycles – allow to innovate within that structure #grantsuccess
Project Managers Meeting
Agile Project Management:
working software, customer collaborations, responding to change, individuals and interactions
Characteristics of agile:
plan as you go, feature-breakdown structure, user stories, release plan, story boards, deliver as you go, learn every iteration, adapt every thing, manage team
"Information radiators"
empowering and trusting people, providing a constant feedback
change happens frequently
requirements are not well-defined
new technology or public domain
customer isn't sure of what is desired
Delphine uses agile/scrum at Penn State - calls some members of her team "ingesters"
NCSU - refer to powerpoint for books to read
resources:
blog.mountaingoatssoftware.com
Books: http://library.mit.edu.ezproxyberklee.flo.org/item/001270790
http://mv.ezproxy.com.ezproxyberklee.flo.org/toc.asp?site=bbbga&bookid=8392
notes from Emily's presentation:
Product backlog is supposed to be prioritized. But library staff have a difficult time prioritizing issues against each other. They work best talking generally about what they think should be part of a given iteration. Then we negotiate with them about what we think we can accomplish. The IT product manager plays a large role in determining final priorities here.
Testing – “the weakest link”. For smaller projects w/o massive codebase, when is it faster and better to automate testing? No QA experts, so try to utilize product teams with regular updates throughout the cycle. In the end, developers and IT product managers have to do first pass at QA testing. Have not done a good job defining acceptance tests for each piece of functionality.
Our group is responsible both for real-time support and development. How to handle? Working toward documenting time spent on real-time support emergencies in JIRA and track how much effort spent on that vs. development. This feeds our estimation of how many hours are available for each person for each cycle; assume some time goes to support.
Timeboxing of efforts within a 6-week development cycle helps keep us from getting bogged down in large implementations that go on for months and months and keep team from working on other projects. Instead, focus on the outset on a simplified set of work that can be accomplished in 1 cycle and then re-evaluate institutional priorities at the end of that cycle. This reduces feeling of black box stall on any given project.
Users hate seeing nothing. Using these methods, more people have seen some results faster. Incremental improvements that happen regularly produce much more positive feedback and energy both within the development team and out to our customers in the library.
Adapt to changing priorities. For example, move our EZProxy Administrator tool to the top of the list as needed for Shibboleth integration. We would not have been able to do this without causing problems if we had been already committed to months of development and a huge feature list for ReservesDirect, our course reserves system. Constant re-evaluation also always priorities within a project to change – since we haven’t committed beyond the current cycle this is easy to accommodate.
Laura at Emory Library:
- Not have a set plan at the outset
- Testing in smaller chunks
- Not having to shield the developers from the stakeholders
p12/27 of pdf - "data dictionary" great idea - we should have that in DSpace/Dome Operations
Greenhopper & JIRA
Google docs & JIRA
Jira Studio...services in the cloud (http://www.atlassian.com/hosted/studio/). Cost is reasonable (free 30 day trial) and nobody has to be convinced to install the tools. It includes Greenhopper, Jira and Confluence...other options can be added.
My development team embraces agile, yet the management team prefers a plan-driven methodology. Resources I've found useful are _Agile Estimating and Planning_ <http://bit.ly/dyPv1S> and _The Software Project Manager's Bridge to Agility_ <http://bit.ly/b2RWGy>.
Happy reading, Barrie Howard, Project Manager, CACI-ISS, Inc., 202-507-0387
The Art of Project Management (http://oreilly.com/catalog/9780596007867/), has been a useful resource for us as well--not in the "template" sense, but as an orientation, especially for new project managers.
ProjectsatWork.com
http://www.projectsatwork.com/content/Articles/256677.cfm
http://www.jamasoftware.com/contour/
Zepheira uses Redmine for their ticketing system for the Recollection development. They call their system "Foundry," but it's Redmine underneath.
- Redmine http://www.redmine.org
- Feng Office http://www.fengoffice.com
- Open Workbench http://www.openworkbench.org
- OpenProj http://www.openproj.org
- ProjectPier http://www.projectpier.org
Starr, Joan. Basic Time Estimation (2010) http://www.slideshare.net/joanstarr/basic-time-estimation-3127687
Wrona, Vicki. "Project Risk Management: A Practical and Effective Approach." http://www.continuitycentral.com/feature0547.htm
Rick Johnson and Banu Lakshminarayanan of Notre Dame presented at Code4Lib Midwest ( http://wiki.code4lib.org/index.php/Midwest ) about their use of Taverna ( http://www.taverna.org.uk/ ) to build digitization workflows. It reminded me of SEASR ( http://seasr.org/ ), which has been promoted for building text processing workflows.