April 5, 2024

New Computer Vision Model (v2.12) with 1,983 new taxa

We released a new computer vision model today. It has 86,861 taxa up from 84,878. This new model (v2.12) was trained on data exported on February 02, 2024.

Here's a graph of the models release schedule since early 2022 (segments extend from data export date to model release date) and how the number of species included in each model has increased over time.

Thanks to some work by the team described here, we are going to start posting accuracy estimates with these model releases estimated against 1,000 random Research Grade observations in each group not seen during training time. The paired bars below compare average accuracy of model 2.11 with the new model 2.12. Each bar shows the accuracy from Computer Vision alone (dark green), Computer Vision + Geo (geen), and Computer Vision + Geo + Cropping Change (light green). "Cropping Change" is a slight modification to the way images are prepared before they are sent to the CV model that resulted in an average 2.1% improvement.

Overall the average accuracy of 2.12 is 89.1%. You can see the average accuracy varies by taxonomic group and continent from as low as the 60s for Africa and as high as the 90s for Europe and North America. Also note that on average 2.12 is 1.1% more accurate than 2.11 which is consistent with our goal of keeping model accuracy roughly stable as we continue to add thousands of taxa each month (as described here we probably expect <2% variance all other things being equal).

Here is a sample of new species added to v2.12:

Posted on April 5, 2024 12:20 AM by loarie loarie | 39 comments | Leave a comment

April 3, 2024

3,000,000 Observers!


This week, we achieved a significant milestone on iNaturalist: passing 3 million observers!

A crucial 5%

While it might not appear so at first glance, the observer community on iNaturalist actually represents a small percentage of the total audience. For instance, this March, approximately 4.5 million people visited the iNaturalist website or used the iNaturalist or Seek mobile apps. Out of these, 600k (13%) had iNaturalist accounts, and only 245k people (~5%) posted observations.

Mapping observers

The maps show the location of first observations for a sample of about 15% of all observers around the globe. About 60% of all observers are in North America. In contrast, South America has about 6% of all observers.


Europe has the second largest number of observers followed by Asia. Africa has the least number of observers of any continent except Antarctica with most observers in the southern part of the continent.



Outsized Oceania

With an area of around 8.5M square kilometers, Oceania is the smallest continent. But it has an outsized number of observers thanks to the active observer communities in Australia and New Zealand.


Thank you!

If you're one of these 3 million observers, thank you for helping us reach this milestone! And if you'd like to join the even smaller, crucial group of iNaturalist supporters (0.05%), you can become one by clicking the link below!


Donate to iNaturalist



Posted on April 3, 2024 06:40 AM by loarie loarie | 29 comments | Leave a comment

March 31, 2024

iNaturalist March News Highlights

Happy Easter everyone! We had a hard time narrowing down our highlights this month from so many great stories. If you missed last month’s highlights, you can catch up here.

Species Discoveries

Here are three of the most amazing species discoveries on iNaturalist from this month:

A. In New Zealand, @pav_johnsson hung a mothlight from his hotel balcony while on a birding trip and became the only living person known to have seen the holy grail of New Zealand moths, the Frosted Phoenix moth.

B. In California, @lcollingsparker posted an observation that, with the help of scientists @easmeds, @bugsoundsjc, and @willc-t, turned out to be the rediscovery of Western Red Cicada which was thought to be extinct.

C. In Brazil, @birdernaturalist played an important role in the rediscovery of the Tananá, a katydid used as an example in Darwin’s writings on sexual selection that hasn’t been seen for 150 years.


Distributions and Range Extensions

These three examples illustrate why iNaturalist is such a powerful tool for monitoring changing species distributions in real time:

D. As shown in this observation by @pondgators, the sudden appearance of thousands of By-the-wind-sailors covering beaches documented by the iNaturalist community made headlines in California and Oregon.

E. In Arizona, @lynnharper, @hootyowl52, and @cameronramsey discovered the first U.S. record of a Mexican Beetle.

F. In Brazil, @rogerriodias contributed the northernmost record of an extremely rare Lance Lacewing to this study published by @calebcalifre.



Invasive Species Science

G. @brendaramirez and @jmccorm’s Free-Flying Los Angeles Parrot Project was featured in this article on how these non-native parrots like this one are adapting to urban forests of California.

Similarly, iNaturalist data is revealing how Swinhoe’s white-eye is expanding in California by using these urban forests according to a new study in Biological Invasions by @devonderaad and colleagues.


Conservation Science

iNaturalist is scaling how the conservation community is monitoring and protecting endangered species.

H. Thanks to observations like this one by @renae_mermaid, the iSeahorse project on iNaturalist improved International Union for the Conservation of Nature Red List assessments for 35 of the 46 known seahorse species.

I. Florida is home to the critically endangered Smalltooth Sawfish such as this one observed by @rangeldiaz. Scientists like @dean167 are trying to understand why they are mysteriously dying and are asking the public to post observations to iNaturalist.

J. The collision death of Flaco this month, the beloved owl that escaped from the Central Park Zoo in New York, has refocused attention on bird deaths by building collisions and the many iNaturalist projects seeking to better understand and prevent this phenomenon by gathering observations like this one by @jtmerryman.

Climate Change Science

This study in PLoS from March used iNaturalist data to project the extent to which climate change will shuffle the species found in North American cities in the next few decades. Here are two more examples of how scientists are using iNaturalist to understand climate change impacts.

K. Climate change is causing earlier springs and warmer winters which is taking a toll on species. In Massachusetts, @maryhannah are monitoring how native hemlock trees are under increased pressure from the hemlock wooly adelgid pest which thrives in warm winters.

L. Similarly, @fuzzybumblebee explains how warmer winters in Minnesota are stressing native bees like this one observed unusually early by @daniellehudson by interrupting their winter hibernation.

iEcology

We continue to be amazed at all the ecological insights scientists are gleaning from information contained in iNaturalist images (iEcology). For example, this month we saw of studies on:

M. Understanding color aberrations - such as this white Rufous-collared Sparrow observed by @myerssusan included in this publication from Ecuador. @allisonshultz is also using iNaturalist to understand bird plumage patterns. This PNAS study on neon colored sea anemones and this study on wasp pigmentation in Italy use iNaturalist to explore the link between climate and these changes in pigmentation

N. Discovering new hosts species for spider wasps - such as this tarantula hawk observed parasitizing Dotted Wolf Spider by @rochalita that was included in a new study by @rickcwest and @frankkurczewski.

O. Understanding life histories of gall wasps - such as this observation of the gall made by a Dandelion Gall Wasp by @earley_bird included in this study by @louisnastasi.



AI Naturalist

P. In Germany, @jchiavassa and colleagues published a study on an automated malaise trap that leverages iNaturalist's Computer Vision AI to demo autonomously recording and identifying biodiversity records.

iNaturalist continues to be a standard benchmarking dataset used in AI studies around the globe like these five this month from Zhang et al., Bu et al., Cong et al., Hogeweg et al., and Daroya et al.


Bioblitzes and Events


Three big upcoming events dominated the news in March:

iNaturalist and Social Sciences

In a Nature Sustainability paper titled “Mobile Apps for 30×30 Equity”, @duanbiggs and colleagues recommend building on iNaturalist to link data collection to payments in order to simultaneously fill data gaps and provide financial support for people in developing economies. This study dovetails with another study in People and Nature this month by @cesarestien and colleagues on how socioeconomic factors can drive participation gaps on iNaturalist.



iNaturalist’s Education Impact

R. This article describes an "Exploring Nature using iNaturalist" training in Indonesia that @naufalurfi helped teach.

Other articles on using iNaturalist in the classroom included this one on bioblitzes by @issacvshl, this one on elementary school climate change education in Portugal, and this one by @katetilly and colleagues on using iNaturalist in the college classroom setting.


iNatters in the News

S. Don’t miss this great profile on @dj_maple from Australia who has posted over 15,000 observations to iNaturalist including many rare moths.

T. In British Columbia, this article describes @stephbrulot’s encounter with a 2-foot deep sea Giant Siphonophore

U. This article by @underwaterpat describes his encounter with a ribbon worm hunting a goby off the California coast.

V. This article tells the story of @srichakrapranav and @vimalraj’s encounter with a group of Sea Swallows along the Bay of Bengal in India



Thank you to everyone who participated in iNaturalist this March. We're amazed by all the impact this incredible community is having around the world. You can become an iNaturalist supporter by clicking the link below:


Donate to iNaturalist


Posted on March 31, 2024 07:08 PM by loarie loarie | 27 comments | Leave a comment

March 27, 2024

Of Isopods and Whelk Algal Gardens - Observation of the Week, 3/27/24

Our Observation of the Week is this isopod (potentially Exosphaeroma kraussi) on a Smooth Plough Shell snail (Bullia rhodostoma), seen in South Africa by @penel1!

“I happened to be climbing the mountains above Llandudno Beach (about 5 km from where I live in Hout Bay, Cape Town) on the Southern Apostles,” recalls Penelope Brown. 

The day was hot so afterwards I dropped down for a swim. The tide happened to be very low and the sea calm and very cold (due to intense upwelling). I had time before going home for lunch so I did a bit of iNaturalisting in the intertidal zone…see other photos on iNat at the time! And that is when I noticed the isopod / sea louse on the shell of one of the Bullia (plough shells). I was intrigued, so I took a few photos, posted them and then promptly forgot about them until the smiley sandy beach ecologist, Linda Harris (I don't know her), got all excited and proposed it as the Observation of the Day…To be honest, not being a very skilled iNaturalist participant, I did not even know that there was something like that!!  

Linda Harris (@linda_harris) is a South African marine scientist who specializes in coastal ecosystems, and as Penelope said she was quite excited by this observation. I reached out Linda for more information about the species involved, and any insights she might have into what was documented in the photos. She tells me that the snail is a type of whelk which can be found in the intertidal areas of sandy beaches, migrating along the beach as the tides change. 

These plough shells are scavengers with an exceptional olfactory system that allows them to detect even the faintest scents of beach-cast fish, jellyfish and bluebottles. They are so well adapted to the erratic supply of food that they can consume enough from one meal to last them for 18 days. However, they also have “algal gardens” on their shells on which they can forage when food is scarce. Because of their lifestyle of burrowing in and emerging from sand in the swash zone, with constant, relatively fast water movement around them, only the snails themselves can graze on their algal gardens. No other animal has previously been observed foraging on the algal gardens growing on these snails’ shells. That’s what makes this observation so unique. 

The plough shells must have been in the sheltered, shallow pool of water long enough for the isopod to detect the source of food and to start grazing on the algal garden. I haven't had a chance to key the isopod out yet, but I suspect this is one of the rocky shore isopods because I don't recognise it as a (at least, common) beach species. It’s the first time an interaction like this has ever been observed.

Penelope (above, on Table Mountain) grew up on a farm in South Africa’s Eastern Cape and credits the freedom she had to explore her surroundings for instilling her an interest in nature. “Walking with my father, she says, “we used to  sometimes listen to the ‘grass grow’ or sit quietly watching a buck nibble the vegetation, and when having tea in the forest with my mother, we'd leave little bits of chocolate cake for the fairies…it was magical!”

She ended up studying zoology at university, “on the advice of some senior students at my residence, because that department was ‘way more fun’ than the botany department (not really a good reason, but good enough as it turned out)” but ended up spending a lot of time with plants anyway, researching phytoplankton production and bloom dynamics in the the southern Benguela off the Cape Peninsula and diving in kelp beds. Then,

being increasingly intrigued by the ‘fynbos’ in of the Cape Floral Region while exploring the very different (cf Eastern Cape) mountains, the vlakte (plains) and coast of the Western Cape, my interests veered more into the terrestrial biota ... and this is where, later in life, iNaturalist eventually provided a space to 'formalize' my interactions with the incredibly biodiverse region in which we are privileged to live. (I actually started with iSpot with Tony Rebelo, a SANBI botantist friend, and dedicated local curator of iSpot and now on iNat, and a wonderful advocate for it!) 

I enjoy using iNat especially when I am on my own, and can immerse myself in it and bumble along happily in nature. I use it wherever I am, on and off, as it is a good record of where I have been and what we saw there. However, more specifically, I am tending to use it more and more for recording the locally indigenous plant species, and also for invasive alien plants, in our catchment (the Baviaans river catchment between Skoorsteenkop and Constantiaberg) which our community group is systematically clearing of invasive alien vegetation.

(Photo of Penelope taken by Judy Jepson. Some quotes have been lightly edited for clarity.)


- Linda says she based a lot of what she wrote on research by A.C. Brown, which you can find on Google Scholar. She also recommends checking out page 14 of this paper [PDF] about whelk algal gardens

- way back in 2016, @oryzias‘s observation of an isopod attached to a fish was an Observation of the Week!

Posted on March 27, 2024 07:01 PM by tiwane tiwane | 12 comments | Leave a comment

March 26, 2024

New Evidence of Presence Values Added

Separate from the new Established annotation, we’ve also added four new values for the Evidence of Presence annotation to address some requests from the community. They’re listed below, along with their definitions. Mousing over an annotation value on an observation page will bring up its definition.

  • Leafmine: Evidence of feeding between the dermal layers of a leaf [within Pterygota, with a long list of exceptions found here] Keyboard shortcut: e + l
  • Hair: Hair that is no longer attached to the organism from which it originated [within Mammalia, except Homo] Keyboard shortcut: e + h
  • Egg: Whole egg or part of an egg [within Animalia except Placentalia] Keyboard shortcut: e + e
  • Construction: Something created by an animal, made with or excavated from other materials [within Animalia except Homo] Keyboard shortcut: e + c

“Leafmine” should get more observations of leafmines in front of the eyes of experts, and also be useful in the taxon photo browser.

“Hair” is similar to the current “Feather” annotation for birds, but for mammals. It’s for hair that’s found on the ground or on another object.

“Egg” allows one to annotate an observation of a broken egg or piece of an egg without resorting to the Life Stage annotation for “Egg”, which isn’t usually accurate for observations of egg pieces. Often, by the time you find egg pieces, the organism is likely at a different stage of development, so using the Life Stage “Egg” annotation can cause inaccuracies with phenology data.

“Construction” is meant for observations of anything an animal has constructed or excavated out of other materials, such as nests, burrows, hives, spider webs, beetle galleries, egg sacs, and the like. Rather than make separate values for all of those things and assign them to specific taxa, we decided to make a more general annotation for ease of translation and maintenance. It’s not meant for coral reefs, galls, or mollusc shells, which are either part of the animal’s body or are instigated but not built by the animal. Note that an observation can have multiple Evidence of Presence annotation values, so a bird in a nest could be annotated as both “Organism” and “Construction”.


We understand that these don't address every request or need - it's a balance of trying to find terms that work for a fairly broad set of observations and help with parts of the site like the taxon page and taxon photo browser.

Finally, these new values will not be immediately available in the taxon page graphs, it will require extra design work.

Posted on March 26, 2024 08:00 PM by tiwane tiwane | 49 comments | Leave a comment

Piloting an Establishment Annotation with Amphibians and Reptiles


We’re piloting a new Establishment annotation with Amphibians and Reptiles to track observations of organisms that are wild but likely don’t belong to an established population.

Why are we doing this?

There is ongoing confusion about whether observations of non established individuals such as escaped pets should be marked as wild or captive/cultivated. Here’s an example.

What does Establishment mean?

The Establishment annotation has a single value: “Not Established”. Observations of individuals belonging to established populations should not have the “Not Established” value. Established populations are native populations or introduced populations that are self-sustaining.


For Amphibians and Reptiles, the Not Established value should be set for observations of escaped pets, released pets, hitchhikers on nursery plants, populations confined to within greenhouses, or (for sea turtles and sea snakes) extreme vagrancy (e.g. an asian sea snake washing up in California). Long-lived escaped/released pets such as turtles should be assumed to be Not Established unless there is evidence that the population is reproducing.


There is no value to mark observations as Established, only as Not Established. We did this because we wanted to replicate the voting between binary states behavior that occurs in the Data Quality Assessment using Annotations without impacting the existing user interface.


Why just Reptiles and Amphibians?

Reptiles and amphibians are a good place to start because they account for a large portion of controversial escaped / released pet observations but they aren’t prone to less defined states (e.g. extreme vagrancy in birds, pinioned ducks, managed ungulates, garden volunteer plants, etc.) which may require more discussion about the definition of Not Established before rolling this annotation out to other taxa.

How does this impact Captive/Cultivated?

We’d like to clarify that observations of individuals not in captivity at the time they were encountered should not be marked as Captive/Cultivated. For example:

A note on gray areas

We’ve tried to be as clear as possible on where the line between Captive and Wild and Established and Not Established states should lie. But there are broad gray areas between these states and many states are simply unknowable. We therefore don’t expect there to always be agreement for every gray area observation. Please use comments on the observation to work towards a shared understanding, but please make an effort to be respectful and polite in your comments and remember we can disagree without being disagreeable.

How do I exclude Not Established Reptiles and Amphibians from my searches?

We added some new Explore parameters that allow you to manually construct URLs to exclude Not Established observations from your searches. Here are some examples:

  • a search for reptiles in the US state of Colorado excluding any that are not established
  • a search for not established reptiles in Colorado
  • a search for not wild reptiles in Colorado

We’re excited that by applying the Species Tab view to search number (1) above iNaturalist can now filter the set of reptiles typically found in field guides which are generally restricted to established species only.

It also makes it easier to surface newly established populations, for example Chihuahuan Spotted Whiptail (Aspidoscelis exsanguis) is not in the field guide above but is has been documented on iNaturalist as a newly established species in Colorado.

What about the escapee/non-established observation field value?

Many have been using the Escapee/Non-established Observation Field. While this annotation serves the same purpose, we did not migrate over data from the Observation Field. But please feel free to use that as a resource when adding your own annotations. It’s currently impossible to construct searches like number (1) above using Observation Fields so we encourage you to use the Annotation for Amphibians and Reptiles.

Posted on March 26, 2024 08:00 PM by loarie loarie | 34 comments | Leave a comment

A larger experiment to learn about the accuracy of iNaturalist observations

We launched our 3rd Observation Accuracy Experiment (v0.3) today. Thank you to everyone helping us conduct these experiments as we continue to learn how to improve accuracy on iNaturalist.

Changes in this Experiment

The only change is that this experiment uses a sample of 10,000 observations whereas the previous two experiments (v0.1 and v0.2) each used samples of 1,000 observations. We hope this larger sample size will allow give us more insight into more nuanced subsets of iNaturalist observations than the first two experiments.

The page for this experiment is already live and the stats will update once a day until the validator deadline at the end of the April 1st. You won’t be able to drill into the bars to see the sample observations or the validators until the deadline has passed.

Thank you!

Thank you to everyone contacted as a candidate validator for participating in this experiment. This is a more ambitious experiment than the first two and very much appreciate your participation with this larger sample size - especially for those of you asked to review up to 100 observations! As always, please share any feedback or thoughts you may have on the topic of data quality on iNaturalist.

Results (added 04/02/2024)

The results for our third observation accuracy experiment and our first one with a sample size of 10,000 observations are in. We’re so grateful to everyone who participated!

Results

The results of this experiment were very similar to the first two experiments. The average Research Grade accuracy (fraction correct) was 95%. You can explore the results including clicking through bar charts to observations here.

Other issues

Logistics
With over 2,750 participating validators, coordinating this was no small task. We’re mainly hearing three issues about the experiment logistics:

1. Concerns about mismatching validators and observations

We can’t perfectly predict which observations people are comfortable IDing, but we can probably improve upon the current methods by further constraining matching observations and candidate validators by location.

We want to emphasize adding a coarse, ancestor ID to an observation you are not able to comfortably ID does not interfere with the experiment design. But we realize that it is not a great experience for validators to receive a batch of observations they aren’t familiar with and may result in less participation in these experiments. Next time, we’ll try some new techniques to further constrain things by location.

2. Issues with communicating via messages

In v0.1 we contacted people via a no-reply email but realized that many people don’t receive or open emails from iNaturalist. So for v0.2 and v0.3 we used the messaging infrastructure to send messages from an Admin user account. Two issues are that:

  • Many people are responding to these messages since it’s not clear that it's a no-reply message. We don’t have the capacity to handle the hundreds of replies to the messages and our apologies if those messages go unanswered.
  • Many people interact with messages on the Android mobile device in which the URL doesn’t work which is causing confusion

In the next Experiment, we’ll explore other ways to address these issues with contacting and communicating with candidate validators.

Design
We’re mainly hearing two concerns about the design of the experiment

3. Distrust of the validators

There’s some continued distrust of the validators’ capacity to not incorrectly validate an observation. For example, there is concern that a validator would knowingly add an agreeing species level ID to an observation that they aren’t able to independently identify rather than follow the instructions and add a coarser family or order level ID.

Some of this distrust probably stems from discomfort with our methods of selecting semi-anonymous validators based on their reputation earned on the platform (number of improving identifications etc.) as opposed to externally credentialed validators selected based on their reputation earned elsewhere. We understand these concerns and we’d love to learn more about ways to vet, coordinate, and incentivize validators at scale that improves upon our methods here and increases the level of trust.

4. Interest in measurements other than average accuracy

Our main question is “how accurate are iNaturalist observations?”. To answer this, our methods are to draw a random sample form iNaturalist and estimate the average accuracy.

Several people have expressed more interest in knowing the accuracy of different subsets of the iNaturalist dataset such as their taxa or place of interest. We’re also interested in this and may adjust the sample structure in future experiments to more efficiently focus on different subsets. But we're excited that with the larger random sample size of this experiment (10,000 vs 1,000) we can start getting reasonably certain estimates for some less common subsets.

For example, here's a graph of the subsets by continent, taxon group, and rarity (<100 observations is rare) sorted by the uncertainty (95% confidence intervals) in accuracy (fraction correct) from Research Grade results from v0.3. We're more certain for the groups you expect (common North American and European plants insects and birds). Those groups also have high accuracy in the 90th percentile. But there are groups like common European Fungi with 0.81 (0.68-0.91) where we are nearly certain that the RG fraction correct is <90%. For other groups, we'll need much larger sample sizes or targeted samples to reduce the uncertainty.

Thanks again for participating in this experiment. It's a huge opportunity to be able to conduct experiments like this at this scale that wouldn't be possible without such a skilled, engaged, and generous community of identifiers.

Posted on March 26, 2024 01:56 AM by loarie loarie | 125 comments | Leave a comment

March 19, 2024

Getting a Jelly out of Salp Soup - Observation of the Week, 3/19/24

Our Observation of the Week is this Flag-mouth Jelly (in the genus Diplulmaris), seen off of New Zealand by @millamuck!

“I studied marine biology and although I didn't end up working in that area, I continue to love the water,” says Camilla Caton. “I joke that you could put me in a bathtub with some sea monkeys and I'd be happy.”

Camilla credits her father for her instilling in her a passion for nature. 

From when I was little we'd hang on the couch and watch nature documentaries, get up at 4am to go to a local estuary and watch the native birds arrive, visit aquariums, zoos and reptile parks.

On March 3rd of this year, Camilla and her diving club took to the water, which is when she spotted the jelly you see here.

Our local dive club had a lot of students that day, so a dive buddy and I decided to toddle off and try a different spot. I'd done a couple of night dives there previously but hadn't been there during the day. The whole dive was absolutely incredible. Some parts were salp soup, not too many jellies, but the Diplumaris was the first of three jellies I'd not seen before.

If you take a look at the observation page, you’ll see that Camilla’s find generated some excitement. Luca Davenport-Thomas (@luca_dt), a marine biology student in New Zealand and avid iNatter, was one of the identifiers, and I contacted him for some insight into the find. It looks like the jury’s still out on a confirmed species ID, but Luca said the observation could be significant.

I had thought it could be Diplulmaris antarctica since that habitat isn't too far away. And I don't think they have been seen beyond Antarctic waters, which is quite exciting. But I never realised there is actually a different species of Diplulmaris, D. malayensis which is apparently very rare. As part of the plankton bloom @millamuck saw (and I sadly missed) there were also Phacellophora jellyfish (I found a dead one at the same spot the next day after most of the plankton had gone. Which is a very southern record since they aren't known from Antarctica). It makes me think the rare organisms in the bloom like the Diplulmaris could instead have been from up North so could actually be Diplulmaris malayensis! 

Speaking of Luca, Camilla (above with her diving club, all the way on the left with the tattooed arm) jokingly says she joined iNat partly because of his encouragement. “I kept getting pressured to upload my photos onto here! Mostly by @luca_dt

I like posting unusual stuff or sometimes if I'm just stoked with a photo!! I use iNaturalist for ID a lot, and will quite often research what species I could find when I travel to new spots.

I only got my open water in February last year, and am up to 180 dives already! Best thing I've ever done. Loving taking photos at the moment. Mostly macro but am only just starting on that journey.

(Photo of Camilla and the diving club taken by Claire Murphy. Some quotes have been lightly edited for clarity.)


- you can follow Camilla on Instagram here!

- “salp soup” is a real thing. Here’s an article (and some video) about a bloom off of New Zealand back in 2015. 

- @luca_dt’s amazing siphonophore photo was an Observation of the Week back in 2022!

Posted on March 19, 2024 06:28 PM by tiwane tiwane | 5 comments | Leave a comment

March 12, 2024

So a Snake and a Centipede Meet on a Trail... - Observation of the Week, 3/12/24

Our Observation of the Day is this meeting of a Chinese Red-headed Centipede (Scolopendra mutilans, 왕지네 in Korean) and an Ussuri Mamushi pit viper (Gloydius ussuriensis, 쇠살모사 in Korean) in South Korea! Seen by @lhurteau.

Originally from Vancouver, British Columbia, Leslie Hurteau has been teaching English in South Korea since 2017 and takes daily morning birding trips on Jeju Island, where he currently lives. 

I always keep my eyes open for any wildlife, especially reptiles and interesting arthropods during the warmer months. I was walking along a trail in a local city park, and some movement caught my eye so I looked down to see a snake (Gloydius ussuriensis) and a Chinese Red-headed Centipede (Scolopendra mutilans). It seemed the snake was interested in the centipede and was eyeing it down. The centipede scuttled under a nearby leaf, at which point the snake prepared itself to strike. 

Unfortunately the excitement ended there, as some other people walked by and caused both the snake and centipede to move off trail and into the side vegetation. So, I didn't see who won the standoff. A friend (@j-j) previously found another Gloydius species consuming the same type of centipede on the mainland, so I think it's possible the snake would have won this, although given it appeared to be a smaller juvenile, it wouldn't surprise me if the centipede would have a strong chance against it.

A fun side note, Gloydius ussuriensis is the only Gloydius species (a group of venomous pit vipers in East Asia) found on Jeju, and its colouration is so different from the mainland populations that it has attracted some potential interest from local researchers.

Not only do Gloydius snakes live in mainland Korea, they range west to the Ural Mountains, into South Asia, and east to Japan. Like other pit vipers, they inject venom through hinged fangs and sport heat-sensing pits on their snouts.

Chinese red-headed centipedes occur in East Asia, including Japan, and average about 20 cm (8 in) in length. Like other centipedes, they’re predatory and they subdue prey by injecting their venom through forcipules, which are modified front legs. 

Leslie (above, scanning for birds on Jeju Island), credits his parents for his interest in nature. They’d take him to parks, especially Stanley Park, in Vancouver, “and my dad would point out the names of different birds and plants that he knew.” He eventually volunteered with the Wildlife Rescue Association of BC and the Stanley Park Ecology Society.

Leslie started using iNat back in Canada, but really got into it after his move to Korea.

After moving to Korea and seeing how scarce information was in English on local fauna and flora, I decided to go all in and document nearly everything I find, with a focus on birds. Uploading every bird photo I could take greatly improved my bird identification skills, helping me learn how to spot various features even in not-so-ideal photos. 

It's been a fun journey, having now seen over 370 different species of birds in Korea, including one species that was a national first for the country. I'm happy to say it's a different situation today in 2024 with many observations in Korea from many different people. After moving to Jeju Island, my favourite part of Korea, I have been working even harder to upload observations to document the wildlife here. With development happening so fast in this country, it can be important to document what is here now in case habitat changes or is lost (which happens often here), or as species ranges change to deal with development and climate change.

iNaturalist has helped improve my identification skills, as well as make connections with other like minded individuals, here in Korea and elsewhere. It's a wonderful globally minded website that I recommend to anyone with an interest in nature.

(Some quotes have been lightly edited for clarity. Photo of Leslie was taken by Jiwone Lee.)


- some giant centipedes eat bats!

- two previous Observations of the Week document centipedes as prey: this one by @msone, and another by @magdastlucia!

Posted on March 12, 2024 09:24 PM by tiwane tiwane | 10 comments | Leave a comment

March 7, 2024

Some thoughts on ML accuracy

The folks on the iNat team who work on the iNaturalist Computer Vision Model and the iNaturalist Geomodel spend a lot of time thinking about how accurate the Machine Learning (ML) models are, and how we can improve them. It's a particular challenge since our computer vision training dataset grows by over a thousand taxa and by over a million photos every month.

We tend to visualize accuracy using charts like you'll see below, and we use these charts to validate our models internally: are they ready for prime time? Is this month's model good enough for release?

Sometimes our accuracy numbers would drift from month to month, and recently we set out to explore why that might be.

Some of this could be due to test set sampling differences - if we test using a batch A of photos for January and batch B for February, then we can expect to get slightly different results. It's also a little tricky to compare models directly to each other since the taxonomies are different. The Feb model knows about taxa that the January model didn't, for example.

Another theoretical reason might be some combination of ML training run differences: random initialization, optimizer, and variance in batch selection. In our experience this doesn't look like a big source of model accuracy difference from month to month but we'll keep an eye on it.

Another reason why our models might show different accuracy numbers from month to month is training set sampling error. We previously described the transfer learning strategy that we are using to train these monthly models. Models share the same foundational knowledge, but we use new photos each month to fine-tune the base model. For commonly observed taxa, we don't train on every image we have. Instead, we select at most 1,000 photos for training. For a taxon like Western Honey Bee, which our users have upload more than 400,000 observations of, every month we select a different fraction of photos to train on. Some months we might get better or worse photos than others, and that might result in better or worse performance.

In order to start exploring this last reason, when we made the export for the 2.11 model, we also made an alternate version of the export with the same taxonomy but selecting a different batch of photos. Then we trained an alternate model on this alternate export, and we did some comparison.

For each of the major groupings shown in the chart below, we selected 1,000 random RG observations not seen by the model during training process and compared the species ID'd by the community to the species predicted by the model suggestions. If the model agreed with the community, we considered the model correct, while if the model disagreed, we considered it incorrect.

Here's a chart showing the two models, accuracy performance compared. The y-axis shows top1 accuracy as a percentage, the x-axis shows major clade or geographical groupings, and the colors represent the different models compared against each other. The *-vision colors are computer vision model accuracy only, and the *-combined colors show accuracy when combining the computer vision model with our geo model.

(Note: I'm red-green colorblind and I have some difficulty distinguishing the 2.11-alternate-vision and 2.11-alternate-combined colors, but they are in the same order in each major grouping, so I can still interpret the chart. If anyone is struggling to interpret the chart, please let me know and I'll see if I can make a more accessible variant.)

For someone who hasn't seen one of these charts before, I'd like to point out a few interesting interpretations or conclusions. First, combining computer vision and geo modeling performance helps a lot, sometimes as much as 20%. Second, we can see our dataset bias in this result: our models know a lot more about taxa in North America and Europe than it does about taxa in South America or Africa. This mirrors our observer community and our dataset of photos to train on. When we have more images from an area, we have better performance. There are some other interesting questions to explore about why our models perform better or worse on some taxonomy groupings like herps or mollusks compared to birds or plants.

However, in the context of understanding this alternate experiment, we can takeaway that two models trained on the same taxonomy but different photos will see a small degree of variance in both computer vision and combined performance on the same test set.

Here's a look at how the 2.11-alternate model improved (or didn't) when compared directly against the 2.11 model:

We can see from this that we can probably expect less than a 2% variance from model run to model run based on sampling.

From this, we can do a better job of characterizing our model accuracy month-to-month: are things improving, getting worse, staying the same? It looks to us like our models are staying about the same in average accuracy while adding 1,000 new taxa a month. We think that's a pretty great result.

Another great way we can use the this experiment is to judge how much a change to model training or technology is really improving model accuracy. If we try out a supposedly better piece of technology to train a future model, but we see less than a 2% accuracy bump, then we should probably be a little suspicious.

We're excited to share the results here with you, and we plan to share accuracy metrics about our models when we release them from now on.

Posted on March 7, 2024 04:26 PM by alexshepard alexshepard | 15 comments | Leave a comment