Announcing a new release of the Perseus Art & Archaeology Artifact (A&A) Browser

The Perseus Digital Library is pleased to announce an initial release of the new Perseus Art & Archaeology Artifact (A&A) Browser (Figure 1) that is based on CollectionBuilder and uses AirTable to manage the extensive metadata found within the A&A collection. 

Figure 1: The new Perseus Art & Archaeology Browser (with randomly selected images on the home page)

An Art and Archaeology Browser has been an integral part of Perseus since the release of Perseus 1.0 in 1992, then described as “a multimedia interactive library” and a legacy instantiation has been available in Perseus 4.0 since 2004.  While work has moved ahead on the Perseus textual collections with the development of the Scaife Viewer starting in 2018, the initial release of Beyond Translation in 2023 and the continuing development of Perseus 6, these efforts have been concentrated on textual data, annotations and related reading tools rather than images of the ancient world and its material objects. 

This update was long overdue. As happens with all software, the current version of the A&A Browser has become outdated, and it can no longer be updated or maintained. The new version addresses several issues:

  1. The images have been converted to a common, standard format (TIF) and stored on an institutional server, where they are available to all through the IIIF Image API. Previously, images either could not be viewed at all due to rights restrictions or were only available as thumbnail images.
  2. The metadata has been migrated from an ad hoc record format to CIDOC-CRM, an ontology commonly used in cultural heritage institutions to describe their holdings. CRM is highly expressive, allowing us to expand upon the descriptions and relations already represented in the old XML data. Even more important, it is an RDF representation, which means the Perseus A&A metadata will become part of the linked open data network of metadata about cultural heritage materials already being built by cultural heritage institutions world-wide. In addition to this conversion, the metadata can also now be corrected, updated and enhanced. 
  3. The new web interface has been built using minimal computing principles.  This paradigm is rapidly being adopted by digital humanities projects and digital libraries (see for example, https://lib-static.github.io/

As in the P4 browsing environment, the collection features hundreds of descriptive catalog entries composed by Perseus editors, who also created descriptive keywords. Most entries (see Figure 2) include corresponding images (see Figure 3 ) and/or illustrations (many of these images were collected via custom photography for Perseus). Basic information about the objects has been taken from a number of different standard sources, each of which is cited in the object entries. The new interface also makes use of the IIIF image API to access the images on a IIIF image server (see Figure 4) for individual images. In the short run, this provides users a greatly enhanced experience with the ability to zoom in on details. More significantly, we can integrate images from any collection that makes its data available with a IIIF server. That means that the Perseus viewer could be expanded to include materials from many different collections. 

Figure 2: A sample catalog entry for a vase with a large number of images. Clicking on the image thumbnail launches a new image window as seen in Figure 3

Figure 3:  A new image window opens up with image level metadata and an option to “Click to view Full Screen” at the bottom of the image. Clicking this calls the image from an IIIF server and opens it in a new window as illustrated in Figure 4.

Figure 4: The image in a new viewing window with zooming capability.

An extensive bibliography, available in earlier versions of Perseus, will eventually be published online as part of this new release with links (where available) to bibliographic sources that are either open-access or in the public domain.

In addition to the significant work of the dedicated Perseus editors over the years, all of the descriptions and images in these collections were also produced through collaboration with scholars, researchers, museums and cultural heritage institutions over 25 to 30 years ago. With the release of this new interface we are hoping to again embrace greater collaboration as we look to better integrate the metadata and images into both the Perseus textual collection and beyond.

We call this an initial release because it is only an early version of the system we plan to build. While all the metadata is freely available in our GitHub sites in several formats, including CSV and RDF, our use of CRM is limited. There are also extensive bibliographical references in the metadata that must be resolved into modern digital citation formats, including those that enable them to be linked into the new Perseus Digital Library and potentially other digital collections. And of course the graphical user interface needs extension and refinement (one new feature for example is the ability to browse the collection by subject word cloud in Figure 5). We are making it available now for two reasons: 1) the computing platform on which the current browser runs is failing and must be retired soon; and 2) we want user feedback to guide us as we refine the new version.

Figure 5: A Word Cloud for Subjects.

We welcome your opinions on the new A&A browser. Please email the Perseus webmaster  with your comments and suggestions.

Posted in Announcement, Release, Technology | Tagged , | Leave a comment

Update on Recent Service Interruptions

June 3, 2025 Update:

We have seen reports of persistent issues. Additional measures have been implemented to block the source of the problems, but further restrictions may impact core site functionality.

The server nodes will automatically reset to clear the errors you see. Sometimes the error messages persist in your local browser even if the servers are all responding normally. This makes it hard for us to see the same issues here, as we can be accessing different servers at different times.

Typically a refresh will help as this can force a new connection to a different node. If the site goes down completely and is unresponsive for more than an hour or two, please do report it. We do not have the ability to continuously monitor the site so we rely on reports to inform us of problems.

For further information, this report from Nature details what we believe we are experiencing.


March 17, 2025 Update:

The 503 errors appear to have returned. We believe that this may be related to data capture for training models (such as AI LLMs). Unfortunately, there are so many bots at work trying to access our servers at the same time, this overwhelms our server nodes and makes it impossible to maintain service. Manual blocking of sets of IP addresses has been attempted, but, as soon as one set of addresses is blocked, the access will resume from different addresses, and so on. We are working on ways of addressing this and trying to stay ahead of the problem as the situation evolves.


The Perseus servers are currently being flooded with requests from overseas servers. Tufts Research Technology is endeavoring to block the bad actors and maintain operations, but there may be disruptions until all of the relevant domains can be identified.

We thank you for your patience.

Note that the Scaife Viewer is available: https://scaife.perseus.org
With a Quick Start Guide here: https://perseus.pubpub.org/pub/bv43kvmy/release/1

Please reach out to the Perseus webmaster with questions or concerns.

Posted in Announcement | Comments Off on Update on Recent Service Interruptions

New publication: “Greek, Latin and Augmented Intelligence: the other AI”

Gregory Crane
February 26, 2025

On behalf of myself and my co-authors, Alison Babeu and Farnoosh Shamsian, I am pleased to announce that our article on “Greek, Latin and Augmented Intelligence: the other AI” has appeared online as part of the Classical Review. The DOI is https://doi.org/10.1017/S0009840X25000356. The online publication occurred today. The article has not yet been assigned a volume number but it is available as part of the Cambridge Core.

The article is, unfortunately, not available under an open access license. We felt that, by publishing in a well-established journal, we would reach a different audience. By publishing in a traditional journal such as Classical Review, we also place this paper in a space that has its own mechanisms for long term preservation. The description of treebanks and aligned translations as they appear in the early 2020s may be of interest to future readers as they reflect on the technological evolution of scholarship.

The opening and concluding paragraphs from the piece, quoted below, serve as an abstract.

“This Profile looks at two technologies that were developed to make source texts in the original Greek, Latin and, indeed, any language directly accessible to audiences who have not yet studied – and may never study – the language itself: (1) translations aligned at the word and phrase level with the original text and (2) rich linguistic annotations explaining the part of speech, regularised dictionary form and syntactic function of each word in a corpus (typically called treebanks, because the syntactic structure is commonly visualised as an inverted tree).”

CONCLUSION – TOWARDS THE OTHER AI, AUGMENTED INTELLIGENCE

An enormous amount of work remains to be done improving our ability to generate alignments and treebanks automatically, refining the results of automatic methods for manageable amounts, and on using new refined data to generate better models for automatic analysis. Nevertheless, we finally have in place the basic services that enable new forms of reading and make source texts in the original language intellectually accessible to new audiences. This addresses a critical challenge for linguistically constrained subjects such as ancient Greek and Roman culture where only a relative handful will develop mastery of ancient Greek or Latin, much less mastery of the research publications in languages such as French, German, Italian and Spanish (to mention only traditional European languages of publication). But if we have the basic tools, we are only now starting to understand how these tools are used and how users do and do not benefit. The most important research area for students of the past might not be how to produce new articles and monographs but to learn how we can exploit digital methods to make the results of our work advance the intellectual life of society as a whole.

Posted in Uncategorized | Comments Off on New publication: “Greek, Latin and Augmented Intelligence: the other AI”

Progress towards Perseus6

Gregory Crane
November 15, 2024

A great deal of work towards the new Perseus (Perseus 6) has been going on behind the scenes and that work is now beginning to become visible. James Tauber of Signum University has led the development, with collaboration from Charles Pletcher and Clifford Wulfman. In most cases, the “we” in this narrative means “James Tauber did the work with the rest of us making comments from the peanut gallery.”

 

Figure: the ATLAS backend architecture

We are still not in the final stage of implementation but we are finishing up a major, and probably the biggest, task: creating a sustainable backend to manage a growing range of machine actionable textual data from a widening set of open data projects in multiple countries. As we refine the backend data, we can begin to move to the final stage of work: implementing within the Scaife Viewer the frontend services that support treebanks, grammatical annotations, aligned translations, metrical analyses, automatic mapping, links from edition to manuscripts and that were developed in Beyond Translation.

For now I point out work done to make the existing Scaife/Perseus Greek and Latin accessible in the ATLAS architecture.

Every Greek and Latin text available in Scaife is now available in the new ATLAS architecture. If you start with the Scaife ATLAS homepage (https://atlas.perseus.tufts.edu/) drill down into the CTS library (https://atlas.perseus.tufts.edu/library/), you can find your way into a list of all editions currently available in Scaife (a dozen versions of Thucydides in various languages are available https://atlas.perseus.tufts.edu/library/urn:cts:greekLit:tlg0003.tlg001/), and then down to each of the lowermost citable notes (e.g., https://atlas.perseus.tufts.edu/library/passage/urn:cts:greekLit:tlg0003.tlg001.perseus-grc2:1.1.1/). At this point, you can call for either plain text or xml by adjusting the final argument: https://atlas.perseus.tufts.edu/library/passage/urn:cts:greekLit:tlg0003.tlg001.perseus-grc2:1.1.1/text/ vs. https://atlas.perseus.tufts.edu/library/passage/urn:cts:greekLit:tlg0003.tlg001.perseus-grc2:1.1.1/xml/.

If you explore the Github repository, you will not not only the full text library stored in the simpler ATLAS form (identifier+textchunk) but also complete morphosyntactic analyses for all 40 million words of Greek (using GreCy) and 16 million words of Latin (using LatinCy). Navigating to particular authors is a bit cumbersome because we had too much data and it was necessary to split the data into multiple Github repositories. The information you need to find the right repository for any given Scaife Greek or Latin author is at https://github.com/scaife-viewer/tagging-pipeline/tree/main/data.

We now have three layers of morpho-syntactic data.

  1. Actively curated Treebanks (e.g., Francesco Mambrini’s Daphne).
  2. The growing repository of curated and automatically generated treebanks from Alek Keersmaeker’s Glaux Trees (currently 20 million words of Greek and including some works that are not yet in Perseus)
  3. The comprehensive morpho-syntactic data generated for any texts added to Perseus.

The opening sections of Thucydides (source here):

The identifier+textchunk format is all that we need to add new texts into Perseus. While the greater structure of the TEI XML format that we have used in Scaife has advantages, we can now add much more content much more quickly and work with larger corpora that has been practical in the more demanding XML framework. The goal is not to abandon XML but to provide the XML as time allows while also being able to work with larger, less structured collections.

The morpho-syntactic analysis (link here):

A draft of a fuller write-up for this work is available in Zenodo: The Sixth Generation of the Perseus Digital Library and a Workflow for Open Philology.

Posted in Uncategorized | Comments Off on Progress towards Perseus6

August 5, 2024: update on Perseus 4

Gregory Crane

As far as I can tell, things have settled down on Perseus 4. As I understand it, there were two issues.

First, Tufts Technology Services moved Perseus 4 to a new hardware platform. In the older platform, various checks had been added so that we would automatically reboot Perseus servers if they hung. This automatic rebooting feature was added a long time ago — probably at least ten years in the past. Replacing this automatic feature seems to have slipped our attention (and I should have checked that). It looks like we have made progress on this. It is possible we will need to tweak this reboot feature but for now I am finding that the site works (at least for me). We monitor email to the Perseus webmaster (and try to respond in a timely fashion).

Second (as noted before), we were hit with scraping and of high volume use that differed from previous patterns over the many years that Perseus 4 has been running. Measures have been put in place to mitigate against these behaviors. We will monitor this situation and respond as needed.

I would like to thank Tufts Technology Services for their on-going help.

Posted in Uncategorized | Comments Off on August 5, 2024: update on Perseus 4

Perseus Digital Library: May 2024 Updates

Gregory Crane

May 4, 2024

Updates on the conversion to Perseus 6 are available here. The update itself is published on the PubPub platform and the shift from WordPress (which is the engine behind the Perseus Updates blog) to PubPub reflects a major component of the overall change.

Posted in Uncategorized | Comments Off on Perseus Digital Library: May 2024 Updates

Towards a new Perseus: Update

Gregory Crane
Editor-in-Chief, Perseus Digital Library
Update as of January 19, 2024.

We have now completed work on Beyond Translation (a draft white paper on this work that has been submitted to the NEH Office of Digital Humanities is available here) and are focused on using the Beyond Translation work as part of an update to the Scaife Viewer. The resulting system will finally allow us to replace Perseus 4. We are calling the new version Perseus 6 (rather than 5) to reflect the amount of work embedded in the Scaife Viewer and now Beyond Translation (which we view collectively as Perseus 5). A grant from the NEH Collections and References program in the NEH Division of Preservation and Access for Perseus on the Web — preparing for the next thirty years provides the primary support for this phase of work, with additional support from the Tufts Data Intensive Studies Center, the School of Arts and Sciences, Tufts Technology Services and Google.

Our main collaborator in this phase of development is James Tauber, who is now working with Signum University. We also are waiting on Tufts administrative paperwork to finalize a contract with another group to help us reorganize the Perseus home (and associated sub pages) and replace this WordPress-based blog with the Pubpub Publishing Platform (which we already began using in documenting Beyond Translation.

A draft outline of the work that we are doing is now available here.

For now, the focus of work is to fold the services visible on Beyond Translation into the Scaife Viewer. The first results from that work will probably documentation, with changes to the Scaife Viewer following.

Posted in Uncategorized | Comments Off on Towards a new Perseus: Update

Draft NEH White Paper for Beyond Translation

GREGORY CRANE

The Office of Digital Humanities at the National Endowment for the Humanities asks its projects to submit white papers after completing a project. We have posted a draft of the White Paper for Beyond Translation here and invite suggestions as we finalize our work.

dNext up: a formal description of plans to replace Perseus 4. We secured the last chunk of funding needed shortly before the winter break. Once we finalize this report on what we have done, we will provide more details on what we are doing.

Posted in Uncategorized | Comments Off on Draft NEH White Paper for Beyond Translation

Towards the Next Perseus: mid-fall update

A lot of work continues to go on behind the scenes as we move to replace the current Perseus 4.0 (the Hopper). Our goal is to finish the transition by fall 2024, with new functionality folded into the Scaife Viewer until this can fully take the place of the now venerable system. Those who are more technically inclined can follow much of what is being done by tracking issues on the Beyond Translation Github site.

New functionality includes support for new kinds of annotations such as treebanks, translations aligned at the word and phrase level, automatic mapping, visualization of meter etc. You can see a summary of these new features and enhancements here.

New functionality for Scaife also includes addition of services to which users have been long accustomed in Perseus, with support for commentaries been at the top of the list. We are also finishing a long-term backlog of texts for which the structural markup requires some manual intervention(as well as programmatic reformatting).

At the moment we are preparing to sign a contract to replace the Perseus home page and associated data. Our plan is to replace the Word Press platform (which I am currently using) with a different publishing platform, which is much better suited to academic publication. It supports not footnotes, automatically generates citation information and allows us to include interactive visualizations. We will have more to say as soon as the contract is signed.

Our plan is to have more substantive information about what we are doing by the end of the December (i..e., a few weeks after a busy semester starts).

Posted in Announcement, General, Release, Scaife Viewer, Technology | Comments Off on Towards the Next Perseus: mid-fall update

Yet more Lucian: translations by Emily James Smith

In 1892, at the age of 27, while serving as teacher of Greek at the Packer College Institute, Emily James Smith published translations for selected works of Lucian. She later served as dean (1894-1900) and then trustee (1900-1905) of Barnard College. She provides with readable translations for a number of Lucian’s works. We added the section numbers and attentive readers will note missing sections. Smith chose to leave out those passages that could not be translated in the standards of the time because of their sexual nature.

Her translations have the identifier perseus-eng5 (e.g., tlg0062.tlg029.perseus-eng5 for “the Dream”). She includes both works that have been ascribed to Lucian (with the identifier tlg0061although some are clearly not by him) and two that are labelled as “Pseudo-Lucian” (tlg0061).

tlg0062.tlg029 The Dream
tlg0062.tlg018 Zeus the Tragedian
tlg0062.tlg024 The Sale of Lives
tlg0062.tlg019 The Cock
tlg0062.tlg016 The Ferry
tlg0062.tlg012 A True History
tlg0062.tlg044 Toxaris; Or, Friendship
tlg0061.tlg001 Loukios; Or, the Ass
tlg0061.tlg004 The Halcyon
Posted in Release | Comments Off on Yet more Lucian: translations by Emily James Smith