Jennifer Smith

Cat Scat DNA

The EOL Profile Newsfeed contains comments left for its owner by other members, EOL Community invitations, and gathers updates associated with the items in the owner's EOL watch list.

Add a new comment

  • Profile picture of Jeff Holmes who took this action.

    Jeff Holmes commented on "EOL API Discussion Group":

    @Yan Wong: You can use the "exact" parameter with the Search API. That should work by only returning pages that exactly match the search string.

    about 1 year ago

  • Profile picture of Yan Wong who took this action.

    Yan Wong commented on "EOL API Discussion Group":

    @Jeff Holmes:

    Great thanks Jeff.

    Another thing, I'm trying to find an exact match for species names, e.g. "Platanista gangetica". If I call the search API with q=Platanista%20gangetica, I get the genus page before the more precise species one. If I call it with quotes, q=%22Platanista%20gangetica%22, I get the most precise one first, but I also get 45 other matches.

    Is there a call I can make to get the most precise match first without all the other stuff? Or does the order of exact search results need tweaking by EoL?

    about 1 year ago

  • Profile picture of Jeff Holmes who took this action.

    Jeff Holmes commented on "EOL API Discussion Group":

    @Yan Wong: Hi Yan, I think "unreviewed" objects are being returned as "unknown". I have seen untrusted objects returned as "untrusted" though. Take a look at the results of this call: http://eol.org/api/pages/1.0/914512.json?images=75&videos=0&sounds=0&maps=0&text=2&iucn=false

    about 1 year ago

  • Profile picture of Barb Banbury who took this action.

    Barb Banbury commented on "EOL API Discussion Group":

    Is there any way to increase the limits on the page parameters (ie, images, videos, etc)? Is 75 the upper limit here? What if we are interested in the 76th image? Or all all media of Animalia?

    about 1 year ago

  • Profile picture of Yan Wong who took this action.

    Yan Wong commented on "EOL API Discussion Group":

    More random questions, sorry - I feel I should start a FAQ! The Creative Commons seem to have retired the public domain licence that is returned in the "license" field of PD dataObjects (http://creativecommons.org/licenses/publicdomain/). Are there plans to start classifying public domain items listed in EoL under the replacement CC-Zero or CC-PD-mark licences?

    about 1 year ago

  • Profile picture of Cyndy Parr who took this action.

    Cyndy Parr commented on "EOL API Discussion Group":

    @Yan Wong: Not all providers use the "source" agent. They probably will have a "supplier" though.

    about 1 year ago

  • Profile picture of Yan Wong who took this action.

    Yan Wong commented on "EOL API Discussion Group":

    @Jeff Holmes:

    My fault slightly, I don't think I'm getting "Unreviewed" as a return value for vettedStatus at all, just "Trusted" and "Unknown". Can I assume that there might be some objects labelled "Untrusted" in there as well?

    Another minor point: should all images have a "Source" field? DataObjectID 19887625 doesn't seem to.

    about 1 year ago • edited: about 1 year ago

  • Profile picture of Jeff Holmes who took this action.

    Jeff Holmes commented on "EOL API Discussion Group":

    Actually, I'm not sure why the API is returning all content by default with the pages API. Content that has been specifically "untrusted" by curators should not be returned by default in my opinion.

    about 1 year ago

  • Profile picture of Cyndy Parr who took this action.

    Cyndy Parr commented on "EOL Discussion Group":

    @Yan Wong: Okay, I'll contact you by email and we can pick this up there. Nice work!

    about 1 year ago

  • Profile picture of Jeff Holmes who took this action.

    Jeff Holmes commented on "EOL API Discussion Group":

    @Yan Wong: Hi Yan, I agree that it is a bit confusing since the vetted parameter documentation doesn't say that the parameter will be referred to as "vettedStatus" in the returned data. I think it's working though since setting the vetted parameter to "2" in an API call does return all content except those that have been marked "untrusted" specifically by a curator. Not sure where the "unknown" is coming from but I imagine that is equal to "unreviewed". In other words, content that has come in from a source that is not "trusted" by default. This would include Flickr, Wikipedia, etc. Some of the other values you mention (e.g. roles) are hard to predict because they depend on what content partners have provided in their data transfer. Cheers, Jeff

    about 1 year ago

  • Profile picture of Yan Wong who took this action.

    Yan Wong commented on "EOL API Discussion Group":

    Thanks Cyndy. Yes - I've been working from the API doc page. The only ref I can see to the vetting is describing the "vetted" parameter (0, 1 or 2) when calling the API. The actual field returned is a text field called "vettedStatus" which (as far as I have found) can be either "Trusted" "Unreviewed" or "Unknown". But I can't see that documented anywhere. I've been bitten when assuming certain text values in the API (e.g. sometimes instead of returning role="photographer", you get role="fotograaf" or various other translations). So it might be helpful to list the possible return values for these fields (as well as specifying the sort order).

    I see what you mean about discouraging excess html. But when I started posting it wasn't even clear to me that I could use html to break up paragraphs, etc.

    about 1 year ago • edited: about 1 year ago

  • Profile picture of Cyndy Parr who took this action.

    Cyndy Parr commented on "EOL API Discussion Group":

    @Yan Wong: Hi Yan, did you look at http://eol.org/api/docs/pages/1.0? I agree that the default sort order isn't explained in the documentation. I think that they are sorted in the order of dataRating although we also have a concept of "exemplar image" which is set by curators and can override (on EOL pages) the ratings (which could be vulnerable to user whimsy). I believe the exemplar image is supposed to be returned first by the API. But let's see what the developers say, and ask them to add it to the API documentation.


    I'm pretty sure we haven't documented what HTML is allowed because we don't want to necessarily encourage highly HTML-formatted text in general (can break pages in various ways). But I agree it would be helpful to have a whitelist somewhere.

    about 1 year ago • edited: about 1 year ago

  • Profile picture of Yan Wong who took this action.

    Yan Wong commented on "EOL Discussion Group":

    @Cyndy Parr:

    I've updated http://yanwong.me/MapImageTest.pdf with 225 new (green) data points pick haphazardly from monocots, composites, and pinophyta. The few green dots among the cloud of magentas are (I think) line drawings of plants. You can't see in this plot, but the png dimension completely separates out these plant line drawings from the maps, so yes - it seems to work almost better for plants. Dunno about microscopy images, though.

    Cyndy, I'll sort out a data file for you, so you can have a look in 3D yourself, if you like. Do you have an email address or whatever to which I can send the file? It won't be that big.

    about 1 year ago • edited: about 1 year ago

  • Profile picture of Cyndy Parr who took this action.

    Cyndy Parr commented on "EOL Discussion Group":

    @Yan Wong: Very promising approach! If you did some plants would you get the same results? I'm almost convinced we could use this to do some batch type changes.

    about 1 year ago

  • Profile picture of Yan Wong who took this action.

    Yan Wong commented on "EOL Discussion Group":

    Just looking for good images that are used in WikiSpecies, but not on EoL. Is there any reason why, for example, the nice image at http://commons.wikimedia.org/wiki/Condylura_cristata isn't being picked up. As far as I can see it meets the criteria at http://eol.org/info/curate_wiki#commons

    about 1 year ago

  • Profile picture of Yan Wong who took this action.

    Yan Wong commented on "EOL Discussion Group":

    @Cyndy Parr:

    I've written a little blog post about my first look at the problem of identifying maps. The plot on that page shows that just using the compression metrics seems to work extraordinarily well, at least for my limited dataset. http://yanwong.me/?p=709

    p.s. not sure this "map" that I picked up should be in EoL anyway: http://eol.org/data_objects/5853197

    p.p.s. Cyndy: I've just done a plot with all the images in your "Retag as map type" collection added (in magenta) - with this larger dataset just using jpg vs gif compression seems to work best, although adding PNG compression data as well would probably help remove a few of the false positives. See the plot at http://yanwong.me/MapImageTest.pdf - the numbers on each point are dataobject IDs, so you can find the corresponding pictures on EoL

    about 1 year ago • edited: about 1 year ago

  • Profile picture of Yan Wong who took this action.

    Yan Wong commented on "EOL API Discussion Group":

    Hi - new here. I've a few of specific questions about the page API - perhaps there is some detailed documentation somewhere?

    1) what are the possible values for the "vettedStatus" parameter on dataObjects? I'm guessing Trusted, Unreviewed, then Unknown?

    2) what order are the dataObjects returned in? I'm trying to grab a single public domain or CC-BY image for each species, and I'd like the one with the most trustworthy vettedStatus, and if several with the same value, the one with the highest dataRating. I imagine this is the most likely order to want. At the moment I'm just grabbing the first 10, and sorting them myself, but it would be less strain on EoL bandwidth to simply ask for a single image, with the appropriate sort order.

    p.s. - it's not clear to me what HTML formatting is allowed in posts on this group. Might it be an idea to put a link in to a page detailing how to use the discussion groups appropriately? I see I'm allowed <small> :)

    Cheers

    about 1 year ago • edited: about 1 year ago

  • Profile picture of Yan Wong who took this action.

    Yan Wong commented on "EOL Discussion Group":

    @Cyndy Parr: I've just had a quick play with classifying map images on EoL - since I've downloaded 738 pd/cc-by images to play with, of which 25 are maps (one for each placental mammal, as it happens). A first-pass is to see how well they compress using different methods. If you plot the compression ratios achieved using jpg and png compression, a good whack of the maps cluster in an obvious group. Have a look at http://www.yanwong.me/placentalmapclassification.pdf (images that are maps are marked in red). The EoL image name is written in tiny letters on each point, if you want to check up on them, but I don't know if these are unique names. It might take a few minutes longer for me to figure out how to put the EoL object id there instead. I've got a quick perl script that will calculate this in-memory, without saving jpg and png files to disk - you could try it on your maps collection. I've also got a few ideas for improving the clustering, but it seems a reasonable start.

    about 1 year ago

  • Profile picture of Yan Wong who took this action.

    Yan Wong commented on "EOL Discussion Group":

    @Cyndy Parr: Ah - thanks.

    about 1 year ago