All posts by Damien

Searching

Seaching is an important part of Paperscape, since it allows you to find papers on the map. When you enter a search term in the box, all papers that match the search result have a large white halo drawn around them.

At the moment our search can handle arXiv identifiers (eg 1207.7214, hep-ex/9807003), author names (eg E.Witten), titles, keywords (the most common words in the title and abstract of a paper), and new papers (those that appeared on the arXiv today, eg ?n hep-th).

If you type in a list of words in the search box, we do a “boolean and” search for all those words using the authors and keywords of each paper. This gives decent results in a lot of the common cases. For example, searching for "witten qcd" finds papers written by Witten that are about QCD, and also finds papers written about QCD that mention Witten in the abstract.

It is not at the moment possible to construct your own boolean search phrases. For example "?au witten ?ti qcd" does not work, at least not yet!

We are still developing search. If you have any suggestions for how searching should work, please leave a comment.

Some teething issues

Paperscape has been getting quite a bit of traffic in the past 12 hours. Thanks for your interest!

With all the traffic, we have encountered one mild bug. When you click on a paper your browser sends the location of the click to our servers, which then return the associated paper id, if one exists at that location. On rare occasions it is possible to request a paper at a (NaN,NaN) location (yes, I know, that’s strange!), and this was causing issues with our server looking for that location. Consequently, search and clicking on papers was down for a few hours.

It should be fixed now. Please, let us know if you run into any problems.

Labelling regions of the map

The labels on the map are generated mostly automatically. When zoomed out, arXiv categories are displayed, and the position of the category label is computed as the average of all papers in that category. As you zoom in, these category labels disappear, and are replaced by individual labels on top of each paper, so long as that paper is “big enough” on screen. The labels for each paper are determined by analysing the title and abstract, looking for common keywords.

We have now added a third layer to this labelling process: we identify by eye regions of the map that have a definite theme, and give these regions a generic, but not too generic, label. For example, we can identify cleary the “neutrino” area in the north, and the “inflation” area at the interface of hep-th and astro-ph.

These new labels make the transition from arXiv category to keyword labels a bit easier to follow, and also allows you to more easily understand where you are on the map.

In the future we plan to implement a more sophisticated way of labelling that transits smoothly between zoom level, much like in a map of the geographic world. If you have any suggestions for this, please leave us a comment.