Skip to content

Conversation

@mjpost
Copy link
Member

@mjpost mjpost commented Dec 16, 2025

The following are the tasks that should be done on this branch (see Hugo template changes):

  • Add icons indicating whether a person is verified
  • Add documentation describing what these icons mean
  • Visually distinguish verified from unverified papers

The following are important tasks, but need not block merging:

  • Modify the ingestion script to use the library and match ORCID iDs on ingested papers
  • Add tooling to show people how to become verified
  • Adapt metadata editing UI and submission pipeline
  • Fix bulk metadata script to use library

Administrative tasks:

  • Make sure submitting conferences use ORCID iDs on all papers (or as many as possible)

@mjpost mjpost changed the base branch from master to master-new-author-system December 16, 2025 19:52
@nschneid
Copy link
Contributor

Why is the build failing? "No such file or directory" error for people.yaml

@mbollmann
Copy link
Member

Because it was created before the transitioned data was merged into master-new-author-system.

@mbollmann
Copy link
Member

Weird, but why did restarting the jobs not help... let's see if merging the base in helps

@mbollmann
Copy link
Member

I also assume this PR should be a draft at this stage, and not ready for review?

@mjpost mjpost marked this pull request as draft December 17, 2025 13:32
@mjpost
Copy link
Member Author

mjpost commented Dec 17, 2025

Okay, I created a TODO list at the top in the description. Please feel free to edit it or argue with the priorities.

My thinking is that our goal should be the minimal set of changes needed to put the new author system into production. Many pieces can be done after the transition even if they are important.

@nschneid
Copy link
Contributor

  • Mark verified entries with some icon, e.g. a green checkmark which shows a popup on-hover to explain what it means.
  • Mark unverified entries with some icon, e.g. a grey question mark which shows a popup on-hover to explain what it means.
  • Make unverified papers have subtly greyed-out/desaturated colors.

Not sure what you mean by "entries"—can you be more specific?

I was thinking that, as a first step, it would be OK just to signal (un)verified status at the top of people pages. We should have a process in place for verifying authorships (i.e. person-paper pairs) before displaying authors of a paper differently.

@mbollmann
Copy link
Member

The following are important tasks, but need not block merging:

  • Modify the ingestion script to use the library and match ORCID iDs on ingested papers

The ingestion scripts do not need to match ORCIDs, the library implements all of that already. Modifying these scripts to use it is a blocker for future ingestions, though, after we merge this.

@mjpost
Copy link
Member Author

mjpost commented Dec 17, 2025

@nschneid Sorry, I copied blindly from the wiki: these were options for how to implement the distinction between verified and unverified papers. I've updated the text.

@mbollmann Yes, it will be a blocker for future ingestions, but that's a good pressure to have.

@mbollmann
Copy link
Member

I was thinking that, as a first step, it would be OK just to signal (un)verified status at the top of people pages. We should have a process in place for verifying authorships (i.e. person-paper pairs) before displaying authors of a paper differently.

In the case where we name-match papers to an author page, the ORCID at the top would lend credibility to a paper assignment that was just done based on the name matching (and might be wrong). I thought that was a reason for considering this more important.

@mjpost
Copy link
Member Author

mjpost commented Dec 17, 2025

I was thinking that, as a first step, it would be OK just to signal (un)verified status at the top of people pages. We should have a process in place for verifying authorships (i.e. person-paper pairs) before displaying authors of a paper differently.

In the case where we name-match papers to an author page, the ORCID at the top would lend credibility to a paper assignment that was just done based on the name matching (and might be wrong). I thought that was a reason for considering this more important.

I would like to talk this through. My basic worry is that adding visual distinctions to paper assignments is going to create an unsatisfying aesthetics, and that this will trigger fastidious people to manually verify those assignments—in order to fix the aesthetics and the sense of "incompleteness"—creating lots of work for us.

For example, almost no paper before ~2020 will be verified, since there were no ORCID iDs at ingestion. Do we really want this to stick out, even subtly?

From the user perspective, what does it matter what the paper's status is? It only matters that the assignments are correct, and next, that there is a mechanism to correct mistakes.

This line of reasoning to me suggests we should do away with visual distinctions on papers altogether.

@nschneid
Copy link
Contributor

It certainly is a goal for the final design—in any listing of a paper's authors, the authors who are verified for that paper are displayed differently from the ones who are unverified. But as soon as we make this distinction salient to users, we will get lots of requests to update the info. And most pre-2023 authorships will be unverified, except for papers of manually disambiguated authors, right? Maybe there is a subtler temporary solution (e.g. a popup saying verified vs. unverified on hover)?

I don't think it would hurt to modify individual paper pages so the ORCID icon occurs next to any author with an explicit ORCID. I think users will understand that it means the ORCID was provided at submission time.

@mjpost
Copy link
Member Author

mjpost commented Dec 17, 2025

I don't think it would hurt to modify individual paper pages so the ORCID icon occurs next to any author with an explicit ORCID. I think users will understand that it means the ORCID was provided at submission time.

This is a good distinction. On listing pages (author, volume), we would make no distinction. But on the paper page itself, we could add a field noting its status: one of {matched by ORCID, matched by name, explicitly verified}.

@nschneid
Copy link
Contributor

And it would be good to develop a longer-term roadmap. E.g. maybe we prioritize getting as many authors as possible to submit ORCIDs in the next couple of major venues, so most active authors will have an ORCID in the database, then start nudging authors to verify their older authorships (with an appropriate UI for doing that).

@mbollmann
Copy link
Member

For example, almost no paper before ~2020 will be verified, since there were no ORCID iDs at ingestion.

In my mental model, a paper assignment is verified if an explicit author ID is assigned in the XML, which doesn't mean it has to have an ORCID.

@mbollmann
Copy link
Member

From the user perspective, what does it matter what the paper's status is? It only matters that the assignments are correct, and next, that there is a mechanism to correct mistakes.

That mechanism is not entirely unrelated to the paper's status, I think. A verified assignment should normally not need to be corrected, for example, since "verified" implies either "assigned via ORCID" or "manually checked at some point in the past". But I would have to think through the updated correction process in more detail to have a good idea here of the implications.

@nschneid
Copy link
Contributor

^ What about #6518?

@weissenh
Copy link
Contributor

  • Add tooling to show people how to become verified
  • Adapt metadata editing UI and submission pipeline

I guess in the task list above, it is implicitly covered that the issue template for author page requests should be adapted too (cf. "Update issue templates once new author representation/URL is in place" #6414). We might want to change terminology and think about what kind of requests we'd like to distinguish and how the workflow should look like.

@weissenh
Copy link
Contributor

weissenh commented Dec 17, 2025

Two comments I have while navigating through the preview:

(1) Why do some author page URLs in the preview end in people/author-name/author-name ? two random examples: Yan Qu, Anna Korhonen

(2) Imagine author A has one or more namesakes (all verified), so that disable matching is true: is there any hint on verified author pages to the unverified page (if it contains papers?). My thinking is, that a verified author A (or someone interested in what papers author A has published) will wonder why their paper X is not on the verified page? Do we want such a link similar to the "other authors with the same name: list-here"?
EDIT: Example "Lei Li" has been observed with multiple ORCIDs (e.g. this author page with ORCID), but there are also papers on an unverified catch-all page.

Sorry if this has been discussed somewhere else already, I'm starting to get lost in the different issues with discussion threads and closed PRs on author representation changes...

@mbollmann
Copy link
Member

mbollmann commented Dec 17, 2025

(1) Why do some author page URLs in the preview end in people/author-name/author-name ? two random examples: Yan Qu, Anna Korhonen

Huh, that looks like a bug, I will investigate tomorrow soon. Thanks!

(2) Imagine author A has one or more namesakes (all verified), so that disable matching is true: is there any hint on verified author pages to the unverified page (if it contains papers?).

This should happen automatically via the "similar names" functionality (same as we can specify via the "similar:" key in the YAML; and identical names should automatically be considered "similar"), but if this doesn't consistently happen on the preview that may be a bug also.

@weissenh
Copy link
Contributor

weissenh commented Dec 21, 2025

To me, it looks like all unverified pages actually have the pattern people/name/name. I don't see unverified/name URLs as the Author page plan suggested.

For the case of linking verified and unverified pages, here are some more examples:

All of them have in common that unverified page (e.g. people/yang-liu/yang-liu) and verified pages (people/yang-liu/, people/yang-liu-wl/ , people/yang-li-6011 , ...) are not linked.

How do we want to link verified pages and unverified page?
On a verified author page, do we really want to collapse links to verified namesakes and to the unverified page in one line starting with "Other people with similar names:" ?
If we do that, it would be helpful to add a comment to the unverified page link so it is more easily identified among the potentially many namesake links. Before the transition we had a "May refer to several people" comment for this, maybe this can be automatically added if the link contains unverified?) Alternatively, we can add another line saying something like "More papers published by name, but not associated with a verified author (yet): link.". On the unverified page, if it exists and there are verified authors at all, we could place a hint saying "Verified authors with this name: list-here".
However, if all papers for a name belong to defined verified authors, there is no need for an unverified page and a non-functional link to it. Examples: the name Patrick Haller is ambiguous, but has no unverified author papers. Same for Yifan Peng).

EDIT: revisiting the mockup slide set I like the phrasing there (slide 17ff) "This author may have additional unlinked papers."). For the unverified page it is suggestion to link to a disambiguation page instead of listing all verified Yang Lius? And the disambiguation page will have as last point "Other papers by 'Yang Liu' (that have not been linked to a specific individual)"

@mbollmann
Copy link
Member

To me, it looks like all unverified pages actually have the pattern people/name/name. I don't see unverified/name URLs as the Author page plan suggested.

Ah, then this probably comes from my changes to switch this around to name/unverified, as per discussions we had in previous issues, and introducing a bug along the way. Will investigate (when I have the time).

@weissenh
Copy link
Contributor

Ah, then this probably comes from my changes to switch this around to name/unverified, as per discussions we had in previous issues, and introducing a bug along the way. Will investigate (when I have the time).

Just as you posted this, maybe I've found the PR (closed without merging: #5471 ) you're referring to.
In that PR links to unverified pages did contain /unverified/ instead of duplicating the name, but then a discussion emerged on the position of unverified part in the URL. Also, the missing link between verified and unverified was observed and commented on in the same PR too (discussion on both started around #5471 (comment) )

@weissenh
Copy link
Contributor

In #5991 it was discussed to maybe have a three-way distinction when it comes to adding a symbol to the canonical name: verified with ORCID, verified but without ORCID (id in the yaml file), unverified.
From the discussion there, I am not sure whether a consensus was reached and whether the current preview is in line with it.

Here are some examples for verified but without ORCID and unverified so you can decide whether the preview for them should be different somehow or not:

(As you probably noticed, I picked examples from the list of ACL Lifetime Achievement Award recipients)

I've also just revisited the Mockup Slide set, slide 33 also mentions the quasi-verified (id, but no ORCID) case as separate from unverified.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants