Skip to main content

The Internet Archive Will Digitize & Preserve Millions of Academic Articles with Its New Database, “Internet Archive Scholar”

Open access publishing has, indeed, made academic research more accessible, but in “the move from physical academic journals to digitally-accessible papers,” Samantha Cole writes at Vice, it has also become “more precarious to preserve…. If an institution stops paying for web hosting or changes servers, the research within could disappear.” At least a couple hundred open access journals vanished in this way between 2000 and 2019, a new study published on arxiv found. Another 900 journals are in danger of meeting the same fate.

The journals in peril include scholarship in the humanities and sciences, though many publications may only be of interest to historians, given the speed at which scientific research tends to move. In any case, “there shouldn’t really be any decay or loss in scientific publications, particularly those that have been open on the web,” says study co-author Mikael Laasko, information scientist at the Hanken School of Economics in Helsinki. Yet, in digital publishing, there are no printed copies in university libraries, catalogued and maintained by librarians.

To fill the need, the Internet Archive has created its own scholarly search platform, a “fulltext search index” that includes “over 25 million research articles and other scholarly documents” preserved on its servers. These collections span digitized and original digital articles published from the 18th century to “the latest Open Access conference proceedings and pre-prints crawled from the World Wide Web." Content in this search index comes in one of three forms:

  • public web content in the Wayback Machine web archives (web.archive.org), either identified from historic collecting, crawled specifically to ensure long-term access to scholarly materials, or crawled at the direction of Archive-It partners
  • digitized print material from paper and microform collections purchased and scanned by Internet Archive or its partners
  • general materials on the archive.org collections, including content from partner organizations, uploads from the general public, and mirrors of other projects

The project is still in “alpha” and “has several bugs,” the site cautions, but it could, when it’s fully up and running, become part of a much-needed revolution in academic research—that is if the major academic publishers don’t find some legal pretext to shut it down.

Academic publishing boasts one of the most rapacious legal business models on the global market, and one of the most exploitative: a double standard in which scholars freely publish and review research for the public benefit (ostensibly) and very often on the public dime; while private intermediaries rake in astronomical sums for themselves with paywalls. The open access model has changed things, but the only way to truly serve the “best interests of researchers and the public,” neuroscientist Shaun Khoo argues, is through public infrastructure and fully non-profit publication.

Maybe Internet Archive Scholar can go some way toward bridging the gap, as a publicly accessible, non-profit search engine, digital catalogue, and library for research that is worth preserving, reading, and building upon even if it doesn't generate shareholder revenue. For a deeper dive into how the Archive built its formidable, still developing, new database, see the video presentation above from Jefferson Bailey, Director of Web Archiving & Data Services. And have a look at Internet Archive Scholar here. It currently lacks advanced search functions, but plug in any search term and prepare to be amazed by the incredible volume of archived full text articles you turn up.

Related Content:

The Internet Archive Makes 2,500 More Classic MS-DOS Video Games Free to Play Online: Alone in the Dark, Doom, Microsoft Adventure, and Others

Libraries & Archivists Are Digitizing 480,000 Books Published in 20th Century That Are Secretly in the Public Domain

The Boston Public Library Will Digitize & Put Online 200,000+ Vintage Records

Josh Jones is a writer and musician based in Durham, NC. Follow him at @jdmagness

The Internet Archive Will Digitize & Preserve Millions of Academic Articles with Its New Database, “Internet Archive Scholar” is a post from: Open Culture. Follow us on Facebook, Twitter, and Google Plus, or get our Daily Email. And don't miss our big collections of Free Online Courses, Free Online Movies, Free eBooksFree Audio Books, Free Foreign Language Lessons, and MOOCs.



from Open Culture https://ift.tt/360eaL8
via Ilumina

Comments

Popular posts from this blog

Board Game Ideology — Pretty Much Pop: A Culture Podcast #108

https://podtrac.com/pts/redirect.mp3/traffic.libsyn.com/secure/partiallyexaminedlife/PMP_108_10-7-21.mp3 As board games are becoming increasingly popular with adults, we ask: What’s the relationship between a board game’s mechanics and its narrative? Does the “message” of a board game matter? Your host Mark Linsenmayer is joined by game designer Tommy Maranges , educator Michelle Parrinello-Cason , and ex-philosopher Al Baker to talk about re-skinning games, designing player experiences, play styles, game complexity, and more. Some of the games we mention include Puerto Rico, Monopoly, Settlers of Catan, Sorry, Munchkin, Sushi Go, Welcome To…, Codenames, Pandemic, Occam Horror, Terra Mystica, chess, Ticket to Ride, Splendor, Photosynthesis, Spirit Island, Escape from the Dark Castle, and Wingspan. Some articles that fed our discussion included: “ The Board Games That Ask You to Reenact Colonialism ” by Luke Winkie “ Board Games Are Getting Really, Really Popular ” by Darron Cu

How Led Zeppelin Stole Their Way to Fame and Fortune

When Bob Dylan released his 2001 album  Love and Theft , he lifted the title from a  book of the same name by Eric Lott , who studied 19th century American popular music’s musical thefts and contemptuous impersonations. The ambivalence in the title was there, too: musicians of all colors routinely and lovingly stole from each other while developing the jazz and blues traditions that grew into rock and roll. When British invasion bands introduced their version of the blues, it only seemed natural that they would continue the tradition, picking up riffs, licks, and lyrics where they found them, and getting a little slippery about the origins of songs. This was, after all, the music’s history. In truth, most UK blues rockers who picked up other people’s songs changed them completely or credited their authors when it came time to make records. This may not have been tradition but it was ethical business practice. Fans of Led Zeppelin, on the other hand, now listen to their music wi

Moral Philosophy on TV? Pretty Much Pop #32 Judges The Good Place

http://podtrac.com/pts/redirect.mp3/traffic.libsyn.com/partiallyexaminedlife/PMP_032_2-3-20.mp3 Mark Linsenmayer, Erica Spyres, and Brian Hirt discuss Michael Schur's NBC TV show . Is it good? (Yes, or we wouldn't be covering it?) Is it actually a sit-com? Does it effectively teach philosophy? What did having actual philosophers on the staff (after season one) contribute, and was that enough? We talk TV finales, the dramatic impact of the show's convoluted structure, the puzzle of heaven being death, and more. Here are a few articles to get you warmed up: "The Good Place’s Final Twist" by Karthryn VanArendonk "The Good Place Was a Metaphor All Along" by Sophie Gilbert "The Two Philosophers Who Cameoed in the Good Place Finale on What They Made of Its Ending" by Sam Adams "5 Moral Philosophy Concepts Featured on The Good Place" by Ellen Gutoskey If you like the show, you should also check out The Official Good Place Podca