Google is Bad. Let's be like Google
Yesterday I attended a plenary on the topic of seamless library service access at the University of Michigan Library. I was excited at the prospect of discussion about making digital library services less chunky, just as I am excited about discussion about making digital publishing less chunky. The speaker was ProQuest’s VP of Discovery Services, John Law, and his talk was titled “Attracting and Keeping Net-Gen Student Researchers.”
The first part of the talk was a pretty dry but informative overview of an ethnographic study ProQuest did to better understand the research habits of its primary users: students. Primary research included:
- field observation of mostly undergraduate student researchers, either in person in their “natural habitat” or remotely
- online chat-based focus groups with end-user researchers
- focus groups with librarians
- end-user researcher surveys
Some results included the fact that participants overwhelmingly thought that libraries have the best information over the Internet at large, but they often start their searches in search engines because they don’t now where to begin on library sites. Library sites usually have no clear/compelling starting places, and it’s difficult to identify useful and appropriate resources without prior knowledge. Users generally satisfice (what Law called a compensatory behavior) by going to search engines instead, sometimes even returning to their or another library through search results. (And when I say he talked about search engines, I mean that he was talking about Google. No other search engines were really discussed. Bing was mentioned in passing, but not in the context of its seeming ability to be relatively good at getting deep web stuff.)
To sum up this portion of the talk, he cited the Ithaka Report to say that usage of the library as gateway is decreasing, and library disintermediation from the research process is increasing.
I was totally with him up to this point: Libraries are not being used as they used to be. Check. People are using other online tools, tools thought by librarians to be inferior. Check. Librarians want people to use their tools. Check.
I thought we were on the cusp of talking about getting web tools and library tools to work better together. But we were not.
Law views Google as competition. Instead of making library databases and catalogs and what-have-you play nicer with search engines, he has headed up the creation of a new product called Summon (taglined “Web Scale Discovery”).
It’s:
- is a single, pre-harvested unified index, including full text when available
- makes results available without authentication
- is customized for a given library’s content
- is vendor and resource neutral (but still dependent on vendor buy-in)
He referred to it as being “like Google” but for library websites.
I do think the product would do a few things well. Its core mission is to provide a way for people to search across content types in a timely fashion, replacing slow and clunky federated search. It also provides the opportunity for the user to do some vetting before they even look at a resource, if useful metadata is provided. This would allow users to look at only a few really pertinent results and then go off to use whatever resources are at their disposal. It would also be a great discovery tool for figuring out what databases and tools are useful for a given topic, especially since once people figure out what tools to use for that topic, they tend to return to them again and again. It’s a true gateway model, in that sense.
However, as you can probably gather because I only write information science-related posts when I’m angry, I have a few problems with Summon. I think Summon makes some assumptions, which I will put in quotes so someone doesn’t think these are ideas I espouse:
- “Google is bad for research.” In an increasingly cross-disciplinary world, where traditional media and traditional publishing are no longer the sole, or even primary, sources of research, I think it’s problematic to say that Google is not a research tool. Discouraging people from using Google is just going to make your library look even more out-of-touch than people already think it is.
- “People start their research at the library.” Some people do. I know I start with search engines and Wikipedia to get a grasp on what terms and names I should look for when I go into a research database. During the Q&A period I asked Law about what suggestions he had for addressing users starting their research in Google, and trying to get them to Summon. His reply, which I think was a good one in general but not specific to the tool, was that libraries should have keyword rich landing pages that pull in hits. I guess I was hoping that you could make canned searches in Summon and make landing pages with them.
- “Librarians should decide what resources are legit and which aren’t, even on the open web.” Someone in Q&A asked about whether open web content could be included in Summon. The answer was yes: whatever librarians deem important. I do think the opportunity to include open web content with scholarly sources is a good one, but, as I said before, decisions about what’s important on the Internet are made by completely different metrics, especially as resources become more cross-disciplinary. I imagine this capacity for the tool would generally be extremely limited to a few scholarly sites, populated on some sort of as-used basis, or ignored all together in the face of the enormity of the information that would have to be vetted by librarians to make it a truly integrated search. (This was tried once. It was called the Internet Public Library, and it was outgrown by the post-2000 web and made obsolete by, yes, search engines.)
- “We can change user behavior.” This is the We Know Best Approach. Instead of following users’ lead and meeting them where they’re starting, we’re trying to force them to start where we want them to start. That doesn’t work, and it’s generally a waste of time and money.
Ultimately, I think the “Google is bad, but let’s build a tool like Google” business model is not going to work out. Because, as long as there is Google, people are going to use Google. I think a better use of library resources is figuring out how to get scholarly work visible in search engines. Take that same metadata and work with search engine companies to get it into search engines. If a resource is restricted, let those listings tell a person if they can access the resource or not via their institution, given IP range or current authentication or a cookie or a plugin or whatever their library’s method is. And let them get straight at the content from there. The library doesn’t have to be invisible, but it also doesn’t have to be an unmoving wall.

January 13th, 2010 at 1:48 pm
I left when it started to look like a sales pitch, but I guess I should have stayed, because I’m still not sure how this is different than Google Scholar. Yeah, this does the auth piece for you so you don’t have to authenticate to individual resources. What else?
January 13th, 2010 at 2:04 pm
Interesting. These are clearly people who do not *do* research as their primary activity. Library web sites are often so lousy, they’re actually the research resource of last resort when it comes to actually doing the searching part of research. Putting a search box in the middle of the site does not solve the issue. Making me choose a resource type is totally misguided, especially because libraries put stupid names on the resource types and I can never tell if I need to access a “database” or an “e-journal” – whatever the heck that is supposed to be.
What’s really useful for me is Google Scholar, with the preferences modified so that it provides links to my library’s catalog and services right there in the search results. Now that is useful! Google Scholar is my portal to my library’s catalog, because they do it better than my library can. I do a fair mix of both known-item and exploratory searching, and Google Scholar is better for both of those. It finds the known items more accurately and with much less effort. It finds lots of things that my library doesn’t have when I do exploratory searching. Now tell me, when is my library going to point me at great resources that it doesn’t have? Yeah, I thought so…
Most online catalogs just aren’t very good discovery tools, and after awhile, most users just give up on them. I agree that libraries should stop trying to replicate what Google already does better, and should definitely find ways to make content more visible to search engines that actually work.
January 13th, 2010 at 2:43 pm
IF WE JUST RESTRICT ACCESS ENOUGH PEOPLE WILL HAVE TO COME TO US!
January 13th, 2010 at 4:59 pm
I have to agree about Google Scholar. It’s not perfect, but it’s the beginning of a (so far) free semantic citation network that does provide a lot of access and source info. I’ve recently been doing some experimenting to see the connections between regular Google and Google Scholar; the latter definitely informs the former as far as page rank goes.
And, god, I can’t remember the last time I used an OPAC as a discovery tool.
Joshua, I’m sorry that my stylesheet has cramped your all-caps style. Your comment is definitely best shouted.
January 15th, 2010 at 3:45 pm
I attended a talk by this dude at my last library and came away with a very similar impression – that the possibilities for Summon were exciting and all, but what’s the point of making yet another discovery tool in order to discover resources that are incredibly difficult (and expensive) to use? Where’s the added value? And how does it benefit users to have to learn yet ANOTHER platform – much less one that won’t follow them once they leave academia (as, what, 90% or more of them will)?
February 9th, 2010 at 6:16 pm
you are all missing what makes summon a killer app. the number one thing students want is immediate, one-click access to full text, every time. they hate seeing citations that look good, but no way to get to the full text. even if you are at an ip address for a university like u-m and so can click through to a huge collection of paid content, still about half the hits of any google scholar search do not lead to full text. summon restricts the search to only the stuff that your library is paying for, so you will get one-click access to full text nearly every time. as less and less full content is free on the web–and thanks to terrified publishers that, alas, is the current trend nowadays for much content needed by students and researchers–google scholar is going to become less popular compared to tools like summon.
February 9th, 2010 at 6:33 pm
Scott, my concern with products like Summon is that they introduce another functional and financial layer. If you’re paying ProQuest for Summon, you’re paying them for Summon and all the databases you already subscribe to from them.
In my view it would be more prudent for libraries to work directly with search engine companies and open access publishers to optimize and customize search tools people (not just students) are already using, to provide search results that fall within access restrictions (something Google Scholar already nods to) but show as much content as possible. There’s no need to reinvent the search wheel, and every opportunity to collaborate with technologists.
I don’t want to leave the impression that I think Google Scholar is teh most awesome thing evar. It’s extremely imperfect, and yes, it’s owned by a corporation, but then so is a lot of content. Google Scholar and academic search engines like it may not be the ultimately answer, but putting even more control into the hands of big publishers doesn’t seem the way to go, to me.
February 9th, 2010 at 7:05 pm
I don’t think there is any question of Summon being a “bad idea” or “a good idea,” per se. The functionality makes good sense for students/researchers/librarians working within a wealthy university library. The questions are more to do with why a new company needs to be in charge of doing little more (ultimately) than providing a filter. Why not work on an elaboration of the already present relationship with Google. Google can hardly be unaware of Summon, for one thing. Surely, they have the incentive to improve the experience and reach of their product. I’d, at least, like to see how Google reacts to this before I heeded PQ’s summons…*cough* *cough*