All posts by Phil Reed

About Phil Reed

Digital Humanities Project Officer and Data Specialist at The University of Manchester Library. Research Assistant at Alliance Manchester Business School, The University of Manchester.

Exploring digital collections at the first Digital Humanities Library Lab

A new pilot workshop, the first Digital Humanities Library Lab, ran on 3 March 2017. This engaging and informative cross-discipline event offered a dozen researchers the chance to explore and discuss new tools and digital text collections from The University of Manchester Library, inspiring the development of future Digital Humanities computational research methods.

Exploring digital collectionsThe afternoon comprised of three activities.

  1. Spelling and printing variations when searching Jisc Historical Texts
  2. Visualising themes in longform scholarly outputs using the JSTOR Topicgraph tool
  3. A UK-first, beginning to use an API to access previously unavailable content from Adam Matthew Digital’s Mass Observation

The workshop instructions are available online for all to view, and the Library is looking to run a similar event again in May. What would you like to see covered next? Please get in touch with DH@Manchester or the Library’s DH Project Officer Phil Reed directly, or leave a comment below.

Support and seedcorn funding for Faculty of Humanities researchers

The Digital Humanities Project Call 2016-2017 has just been announced. This year DH@Manchester are focusing on developing new projects in two specific areas:

  • innovative projects driving out of the Library’s extensive electronic collections
  • cutting-edge research which can be developed in partnership with colleagues in the School of Computer Science (including text mining, linked data, image processing, and data visualization).

The closing date is Wednesday, 22 March 2017. View the Project Call page for more information.

Rivers merging, metaphor of data

Library data-merging pilot grows to exciting partnership opportunity

A technical prototype I developed for the Business Data Service has been used as the driving force behind a new and exciting research project post, bringing together partners from outside The University of Manchester Library.

What is the basic premise?

To develop a collection of tools to bring together commercially available databases from separate suppliers for use in leading, innovative research, using specialist knowledge of the field for accurate and efficient execution.

Who are the partners?

Why is this new post useful?

After spending money on expensive data sets, we need to make the most out of them. It is critical to use them together in order to unlock their full research value. In the case of some specialist resources, this activity is non-trivial.

Why is joining these datasets difficult?

databases with no join
There is no easy way to use the data from these different sources together, no common index.

Identifying companies across different databases is difficult as the codes used within each platform usually do not correspond to those used in another. There are good reasons why a platform will do this (their intellectual property is one), but this makes work harder for researchers, sometimes resorting to checking company name matches by eye, one at a time!

Writing code to map these where cross-checking is available requires the software developer to be aware of the various identification codes used such as CUSIP, ISIN, SEDOL and various ticker symbols, some of which can change with time or be further complicated in other ways. A close relationship to the curators of these databases at the University is required; this is found in the Library’s Business Data Service team whose expertise is well respected and appreciated by its users.

How will it happen?

As part of the project funding application, a new post was created. It sits outside the Library but is dependent on the library staff’s curating skills and knowledge of the library’s specialist financial databases. Under this post I will use my skills as a software developer and experience working in the Library to write new tools to combine access to various datasets within the project, as the products become available and as the researchers need them.

I’ll still be working my usual job in the Library as well, so nothing is lost from the Business Data Service.

Where might it lead?

The primary objective is to publishing new research on topics covering institutional investors, financial innovation and the “real economy”.

Once the research is published, we can develop new teaching topics and further broaden access to the University’s data sets with these tools, introducing them to new audiences in other subject areas.

Specialist financial databases visualised as a tube map

Visualisation of library provision: a worked example using specialist financial databases

At The University of Manchester Library we subscribe to many database resources, containing vast amounts of structured data, organised by further descriptive or meta data. These descriptions can be considered as many dimensions or variables, and it is important to focus on just a few to begin with.

Many research students frequently need to consult our large and rich selection of specialist business and financial databases to collect data and to shape their studies. There are over fifty databases that I would consider particularly relevant to that field, which are also of interest to a wider audience. It would be beneficial to these new researchers to have a better way to begin to answer these queries quickly, saving potential hours of trawling through the wrong resource.

As an experiment, I created this diagram of specialist financial databases in the style of a topological tube network:

Specialist financial databases visualised as a tube map
Specialist financial databases visualised as a tube map

I will explain the process I took to planning and constructing this diagram below, but first I will briefly explain what it shows. Seven research areas that require the use of specialist financial and business databases are represented as tube lines. The viewer can follow each of these lines through the various database products, which are shown as stations. The places where researchers must be to use each database are shown as zones.

Identifying the content

With so many factors to consider, I focused on the most important or first answered:

  1. Research subject area (such as corporate governance, or economics)
  2. Geographical coverage
  3. Access location (in the Library or through the web).

Further factors that I would like to consider include:

  • Historical coverage
  • Type of companies or equities covered (quoted, private, banks)
  • Consideration of survivorship bias (active or dead companies)
  • Type of data provided (numerical, reports).

These seven questions still only scratch the surface when choosing a business data source, but it is a start. I had already created a table with a list of the 50-plus relevant databases and columns for each of those factors (Figure a) which I used to gain a better understanding of the resources I work with when I came into post.

I worked with my colleague Xia Hong to reduce this table to the 21 most important databases and the three most important factors listed above (Figure b). The research areas were marked against databases just as yes or no matches, preparing for a decision of which lines will go through which station.

Business database visualisation planning
Planning the visualisation: (a) original table, (b) reduced table, (c) sketch by hand, (d) sketch in PowerPoint

Designing the structure

I decided to use good old pen and paper when it came to drawing out the layout. (See Figure c.)  Network building software exists but I decided that the learning curve for these would be too steep for the benefits, since the hand-drawn approach worked for me. I started with the database that matched the most research areas (ThomsonONE.com) and drew outwards from there.

Next, I entered the structure into PowerPoint, as it was the fastest tool available that I knew how to use (Figure d). This clearer format was used for checking the content for accuracy and omissions. The layout of the objects was refined in this tool, before employing CorelDRAW for the final markup.

The final design has these features:

  • Stations: database products, with symbol “W” for those with WRDS portal entry
  • Lines: research subject areas, coloured with University branding
  • Zones: access location, with inner zone 1 for databases you need to come into the Library to use; zone 2 for web access only on-campus; and zone 3 for web access from anywhere
  • Position: the very top is North American coverage, the left China, above the middle is Europe, and the rest is international.

Sadly, there is no river, which I could have used to separate North America from the other continents!

Where next?

This diagram is busy enough that no more information could be added without compromising its readability. There is more information on the Library website subject guide page covering these databases, which is the first port of call for a student enquiry. After that, all current students and staff of The University of Manchester are welcome to attend a research consultation session, where an expert from the Research Services team will be available.

Summary

It is difficult to convey lots of structured information. If we focus on just the initial or most important factors, we can produce something that is helpful and appealing.

See also earlier post: Why are there so many business databases?

Mind map workshop

A great conference for a newbie

Group workshops at the BLA Conference 2014
Group workshops at the BLA Conference 2014 in Leicester

I recently attended my first conference representing The University of Manchester, for the annual Business Librarians Association (BLA). It was a particularly good conference for a first-timer as the themes were interesting and relevant to most academic libraries across the country, not to mention the friendly and welcoming atmosphere from fellow delegates and sponsors.

The major themes were:

  1. Customer Service Excellence
  2. Employability and Information Literacy
  3. Marketing

Customer Service Excellence

Many academic libraries are thinking about Customer Service Excellence (CSE) status, which The University of Manchester Library achieved in 2013. Helen Loughran from Leeds Metropolitan University (soon to be known as Leeds Beckett University) spoke about her experiences with her institution’s application, which is the largest university in the country to achieve accreditation so far.

While Helen’s talk was of most benefit to people thinking about or currently working towards CSE status, there were certainly ideas to take away for those with it already, to maintain and continuously improve standards. She posed some thought-provoking questions:

  • Are customers engaging with a library’s social media platforms, asking questions? If so, are there staff allocated and trained to answer these questions?
  • Do all staff across the institution know who to pass online queries on to, including when received in error?
  • When a customer has expressed dissatisfaction, should we consider inviting them in for a chat?

To this I would add my own thought:

  • Could we use a computer-aided text analysis program for determining the tone of free-text (typed words) feedback? There are many processes for qualitative data summation, but putting qualitative text data through a tool like Diction could give better analysis.

How do librarians improve student employability?

The position of a careers service will vary between institutions, in some cases being closely tied to the library. But I think everyone whose work is related to students will have some concern for those students’ employability. Paul Chin from the University of Hull quoted that 79% of students go to university to improve their job opportunities (NSS, 2011). He went on to explain that people don’t always realise all the skills that they have already, asking this question:

Can students articulate their graduate attributes?

Mind map workshop
Image-enhanced mind map of information literacy and employability at academic libraries

In addition to developing and offering excellent training, if we can help students to be able to explain why they studied and what it will enable them to do, this will ultimately improve their interview performance.

Kaye Towlson from De Montfort University ran a breakout session titled How do librarians improve student employability? In groups, we produced an image-enhanced mind map to visualise a student’s journey through the library. Our group’s mind map (pictured) includes a customer-centric hub with spoke roads leading to books, social media, technology, skills portfolio, meeting spaces and global citizenship.

Jane Secker from LSE spoke about how information literacy relates to digital literacy. It’s not just about technology, but extends out of the library into the curriculum for all students. She also made this suggestion:

Just because young people are mostly “tech-savvy”, this does not mean that they can “just do” academic research without guidance.

Tuning out the white noise

Ned Potter from the University of York gave a great presentation on library marketing, explaining how unfocused communication is often lost or not seen at all (noticeboards, websites, some email). He suggested ways to counter it. We may well be offering services beyond the traditional (book lending) remit of a library:

Promote non-library specific expertise via non-library specific channels.

This could include ensuring your content is searchable via Google, which is used must more than a library’s website as a starting point (much more). Ned also encouraged the use of direct, tailored and targeted communications to ensure the message about library value really hits home.