Dataset Views and Downloads

The latest dataset reports are available at https://metrics.fairdata.fi/reports/DATASETS-this-month.html. In the future, the latest metrics for an individual dataset can be seen on the dataset landing page in Etsin.

What do you track in Fairdata Etsin?

We track two primary types of events pertaining to published datasets in Fairdata Etsin:

  1. Views of (visits to) dataset landing page or its sub pages.
  2. Downloads of dataset packages or individual dataset files.

For each type of event, we record (where applicable):

  1. Visitor: A fully anonymized visitor identifier
  2. Dataset ID: The unique internal identifier of the dataset
  3. Event scope: A normalized, path structured title defining the particular scope and nature of the event
  4. Content: For download events, whether the download includes the complete dataset, partial dataset, or an individual dataset file
  5. Success: For download events, whether the download was successful or not
  6. User environment: For view events, key characteristics of the user environment, including operating system, browser, language, and display resolution

 

What is an event?

An explicit request by a user (human or machine) for dataset related information from the service, either a web view or download of dataset data via Fairdata Etsin.

What is a view?

The presentation of a dataset specific web page in Fairdata Etsin, either the primary landing page for the dataset, or a secondary presentation providing more detailed or specialized information pertaining to the dataset.

What is a download?

A user (human or machine) successfully downloading a dataset package (complete or partial) or individual dataset file via Fairdata Etsin. Failed or incomplete download attempts are also recorded.

How do you track?

Events are reported and recorded as they occur, from which monthly reports are regularly generated. View and download totals for each month are calculated for month to date, year to date, and totals to date since recording of dataset events began (2021-08).

Where can I find the reports?

In the future, each dataset will show its’ own metrics in the Etsin’s dataset landing page. While that is not yet implemented the latest dataset reports are available at https://metrics.fairdata.fi/reports/DATASETS-this-month.html which provides the dataset reports for the current month, to date. To view reports for another month, click on “SELECT PERIOD” in the upper right corner and select the desired month from the listing on the left, then select “DATASETS” from the top level navigation menu to view the dataset reports for that month.

How can I see metrics for a particular dataset?

In the future, the latest metrics for an individual dataset can be seen on the dataset landing page in Etsin. The page will show total and unique views of a dataset and amount of downloads of dataset’s data.

Meanwhile, until the Etsin implementation is ready, you can search by dataset identifier in the report tables to limit the report view to matching table entries. Note! The dataset identifier here is Etsin’s internal identifier (not URN/DOI) visible in the url of the dataset’s landing page.

How often do you update reports?

Reports for the current month are updated once per hour, to continually reflect newly aggregated data.

How do you deal with robots?

Requests made by robots (aka crawlers, spiders, bots) are blocked by the service and therefore do not result in recorded events.

What is the difference between a machine and a robot?

A machine request is an automated request initiated by a human user, e.g. a script downloading data and running an analysis on the data. A robot request is an automated request made by e.g. a search engine crawler.

How can I see the most viewed datasets?

In view report tables in https://metrics.fairdata.fi/reports, datasets are ordered from most viewed to least viewed.

How can I see the most downloaded datasets?

In download report tables in https://metrics.fairdata.fi/reports, datasets are ordered from most downloaded to least downloaded.

How can I see dataset metrics for a particular month?

The latest dataset reports, containing all datasets, are available at https://metrics.fairdata.fi/reports/DATASETS-this-month.html which provides the dataset reports for the current month, to date. To view reports for another month, click on “SELECT PERIOD” in the upper right corner and select the desired month from the listing on the left, then select “DATASETS” from the top level navigation menu to view the dataset reports for that month.

If the selected month is prior to the current month, then the month to date totals will be complete for that month, else they will simply be the partial totals to date for the current month.

How can I see dataset metrics for a particular year?

From any report view, click on “SELECT PERIOD” in the upper right corner of the report view, select the latest month for for the desired year from the listing of available months on the left, and from the initial report presented, click on “DATASETS” in the top navigation menu. If the selected month is December (12) of a year prior to the current year, then the year to date totals will be complete for that year, else they will simply be the partial totals to date for the selected month and year.

How can I obtain metrics in a format suitable for automated processing?

Below each report table, there are links to both JSON and CSV encoded representations corresponding to the data included in the table.

In addition, the total views and downloads to date for all datasets are available in JSON format from https://metrics.fairdata.fi/reports/datasets.json

How do you anonymize users?

For each view event, we record an anonymized visitor identifier. This anonymized visitor identifier changes for a user every 24 hours, hence a user viewing the same record on two different days will have two different anonymized visitor identifiers.

The anonymized visitor identifier is generated using a “fingerprint” of user identifying information such as IP address, browser agent, etc. combined with a randomly generated text value (a salt) and encrypted using a one-way cryptographic hash function. The salt (random text value) is irretrievably discarded and regenerated every 24 hours. None of the information constituting the “fingerprint” used to generate the anonymous user identifier is stored persistently.

Discarding the randomly generated salt every 24 hours, such that it can never be retrieved, ensures that the anonymized visitor identifiers are fully anonymized, not merely pseudonomized.

Can I opt-out of the usage statistics tracking?

No, it is not possible to opt-out. The usage statistics tracking is cookieless and fully anonymized and is done on the server-side.

Who do you share usage statistics with?

Generated reports based on aggregated event data are publically available without restriction, and do not include any sensitive data nor any personal Identifiable Information (PII).

What do you share with third parties?

We share only the publically available data included in generated reports, which is available to anyone. No special data is ever shared with any third parties. We never share the raw event data, even though such data is fully anonymized.

Do you support usage statistics for a community?

No.

Do you track my search queries?

No.

Do you do any manual or automatic profiling of users?

No.