pith. sign in

arxiv: cs/0608004 · v1 · submitted 2006-08-01 · 💻 cs.DL · cs.IR

Separating the articles of authors with the same name

classification 💻 cs.DL cs.IR
keywords articlesauthornamesameauthorsdistancemethodthen
0
0 comments X
read the original abstract

I describe a method to separate the articles of different authors with the same name. It is based on a distance between any two publications, defined in terms of the probability that they would have as many coincidences if they were drawn at random from all published documents. Articles with a given author name are then clustered according to their distance, so that all articles in a cluster belong very likely to the same author. The method has proven very useful in generating groups of papers that are then selected manually. This simplifies considerably citation analysis when the author publication lists are not available.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.