Statistics on pictures created with MapComplete: licenses and a top 50 of authors

Posted by Pieter Vander Vennet on 1/10/2023

What licenses are used?

Now that MapComplete is two-and-a-half year old, it’s a good time to see what license people are using to upload their images.

Why do I care?

The first reason to do this research is curiosity. How much pictures are uploaded with what license?

The second reason is a very practical and UX-driven: if a significant portion of contributors doesn’t bother to change the license, then the license picker can be moved from the ‘infobox’ into the ‘user settings’, freeing up valuable space there. User tests have pointed out that this is valuable.

Methodology

MapComplete uploads images to imgur.com and then links to this image using image=https://i.imgur.com/aBcDeF123.jpg. Some metadata (most notably the author and chosen license) is added as ‘description’ to the image on Imgur. If multiple images are added, then keys image:0, image:1, image:2… is used.

At last, themes can also add images under a specific key. For now, only the etymology-map does this with image:streetsign.

Overpass was used to download all features with a tag matching one of the described keys and matching an imgur-url.

Then, the description of all those images is downloaded and parsed, yielding the needed metadata.

Even though some people did add images to imgur to link them to OpenStreetMap before, we assume that (nearly) no images will also have the license information encoded as MapComplete does. Furthermore, this does not keep images of now-deleted features into account, nor does it take images into account that have been deleted in the mean time. I don’t think it’ll make a big difference though.

The resulting datasets are here. The script to download this all is in the MapComplete repository. Keep in mind that using this script will exhaust the daily IMGUR rate limit; so please use a different access token or spread the download over two days as was done for this research.

Results

In total, 12516 images with a parsable license were found - this is a huge amount of pictures, which I did not expect! This was done by 439 contributors in total

Unsurpisingly, the vast majority was uploaded with the default license, being CC0/public domain. This is about 10635 total pictures (or 84.9% of all pictures), taken by precisely 400 different contributors - 91.1% of contributors.

The second most popular license is the creative commons with attribution and sharealike license (CC-BY-SA), with 1707 images in total, or about 13.6% of all images. However, only 32 authors choose this license, or 7.2% of the photographers. Striking is that those are way more active, with an average of 53 images/person!

At last, the creative commons with attribution (CC-BY) is not popular at all. Only 117 pictures in total - 0.9% of all pictures - used this license. Only 10 authors picked this option, which also indicates that they are below-average in number of pictures taken with 11 images/contributor.

When the authors which used CC-BY and CC-BY-SA are summed, only 42 are found. This indicates that there is a big overlap between contributors that used the CC-BY license. Personally, I contributed under CC0 first, then a bit under CC-BY to switch to CC-BY-SA for the most part of my pictures. Other contributors probably did a similar trajectory.

Oh, and due to a bug, the license of some images got saved as "undefined" instead of the actual license. This bug only impacted 57 pictures (0.4% of all) taken by 20 authors. As we don’t know the license they took, we should stick to the most restrictive of the available licenses to reuse those images.

Averages and medians

On average, a contributor with at least one image, makes about half, namely 28.5 pictures/person! However, this is a typical power curve, with a few powerhouses that add tons of images. The median contributor with at least one image contributes two images.

Conclusion

First of all, I’m absolutely flabbergasted by the total amount of pictures taken! I knew it had to be in the thousands, but never realised it would be over 10k!

As only 42 contributors ever contributed under a different license, I feel comfortable to move the license-picker away into the user settings panel. Freeing this place will improve the experience of thousands of people at the cost of a few clicks that only a handful of people have to make - even though that this handful of people are the most active contributors.

I’m also very positively surprised by the high number of average pictures per person - even though the median is a bit more modest.

And the fact that someone has uploaded twice as much pictures then I did is really cool to. It’s also the only contributor (so far) to go over 1000 pictures and is even getting close to breaking the 2000-boundary… Congratulations, Awo!

The second place is for me (Pieter Vander Vennet), with 859 pictures added. (Damn, this much already?)

The third place is for vjyblauw, another power mapper in Belgium with 746 pictures. Congratulations as well!

At last: I’ve attachted the top 50 of contributors below.

But before showing it to you, I’d like to tell you all one more thing:

Thank you for contributing!

This wouldn’t be possible without all of you

PositionUsernameTotal number of pictures
1Awo1953
2Pieter Vander Vennet859
3vjyblauw746
4JLZIMMERMANN645
5Thierry1030622
6L’imaginaire589
7Jose Luis Infante575
8Toni Serra446
9APneunzehn74439
10joost schouppe310
11Maarten O301
125R-MFT254
13Wolfram Hoppe250
14Koen Rijnsent234
15WimBau229
16dentonny212
17Stijn Matthys137
18Polardfront126
19TauvicR119
20Locatus_Jori109
21Locatus_Raf100
22Robin van der Linde98
23wjtje88
24Marival75
25Pieter Nuytinck71
26Vincent Bombaerts68
27Rober castro65
2834949958
29Frans_Napaters57
30Thibaultmol57
31philippec56
32StefDeGreef52
33borgofumo52
34ClarissaWAM48
35jospyck48
36escobrice44
37KaiPankrath43
38Ninopiña1043
39Niels Elgaard Larsen42
40RodrigoKiger41
41MAGONA39
42sjokomoeske37
43ccasado36
44Piotr Barczak34
45lololailo34
46Manuel C Arco Martos33
47reginaldc33
48Hilde OSM32
49paunofu32
50Gruppe 24(2)30