-
Notifications
You must be signed in to change notification settings - Fork 1.2k
1 General parameters for the Javascript tracker v2.2
🚧 The documentation for the latest version can be found on the Snowplow documentation site.
This page refers to version 2.2.x of the Snowplow JavaScript Tracker
- 2.1 Loading Snowplow.js
- 2.2 Initialising a tracker
- 2.2.1 Setting the application ID
- 2.2.2 Setting the platform
- 2.2.3 Configuring the cookie domain
- 2.2.4 Configuring the cookie name
- 2.2.5 Configuring base 64 encoding
- 2.2.6 Respecting Do Not Track
- 2.2.7 User fingerprinting
- 2.2.8 Setting the user fingerprint seed
- 2.2.9 Setting the page unload pause
- 2.2.10 Setting the event request protocol
- 2.2.11 Altering cookies
- 2.2.12 Configuring localStorage
- 2.3 Other parameters
- 2.3.1 Setting the user id
- 2.3.1.1
setUserId
- 2.3.1.2
setUserIdFromLocation
- 2.3.1.3
setUserIdFromReferrer
- 2.3.1.4
setUserIdFromCookie
- 2.3.1.1
- 2.3.2 Setting a custom URL with
setCustomUrl
- 2.3.3 Setting the pause time before leaving a page with
setLinkTrackingTimer
- 2.3.4 Configuring cookie timeouts using
setSessionCookieTimeout
- 2.3.1 Setting the user id
- 2.4 Setting onload callbacks
- 2.5 Managing multiple trackers
- 2.6 How the Tracker uses cookies
- 2.7 Getting the user ID from the first-party cookie
- 2.8 How the Tracker uses localStorage
Use the following tag to your page to load Snowplow.js:
<script type="text/javascript" async=1>
;(function(p,l,o,w,i,n,g){if(!p[i]){p.GlobalSnowplowNamespace=p.GlobalSnowplowNamespace||[];
p.GlobalSnowplowNamespace.push(i);p[i]=function(){(p[i].q=p[i].q||[]).push(arguments)
};p[i].q=p[i].q||[];n=l.createElement(o);g=l.getElementsByTagName(o)[0];n.async=1;
n.src=w;g.parentNode.insertBefore(n,g)}}(window,document,"script","//cdn.jsdelivr.net/gh/snowplow/[email protected]/sp.js","snowplow_name_here"));
</script>
Important note regarding testing: "//cdn.jsdelivr.net/gh/snowplow/[email protected]/sp.js"
is the protocol-relative URL used to fetch sp.js. It will work if the your web page is using the "http" or "https" protocol. But if you are testing locally and loading your page from your filesystem using the "file" protocol (so its URI looks something like "file:///home/joe/snowplow_test.html"), the protocol-relative URL will also use that protocol, preventing the script from loading. To avoid this, change the URL to "http://cdn.jsdelivr.net/gh/snowplow/[email protected]/sp.js"
when testing locally.
As well as loading Snowplow, this tag creates a global function called "snowplow_name_here" which you use to access the Tracker. You can replace the string "snowplow_name_here" with the function name of your choice. This is encouraged: if there are two Snowplow users on the same page, there won't be any conflict between them as long as they have chosen different function names. The rest of the documentation will assume that the function is called "snowplow_name_here".
Once the function snowplow_name_here
is created, the syntax for using Snowplow methods is as follows:
snowplow_name_here({{"methodName"}}, {{first method argument}}, {{second method argument}}, ...);
For example, the method trackStructEvent
has this signature:
function trackStructEvent(category, action, label, property, value, context)
where only the first two arguments are required. You would use it like this:
snowplow_name_here('trackStructEvent', 'Mixes', 'Play', '', '', 20);
Empty strings are provided for the label and value arguments to pad them out. (Null
would also work.) They won't be added to the event. Neither will the context argument, which isn't provided at all.
Back to top Back to JavaScript technical documentation contents
Tracker initialization is indicated with the "newTracker"
string and takes three arguments:
- The tracker namespace
- The collector endpoint
- An optional argmap containing other settings
Here is a simple example of how to initialise a tracker:
snowplow_name_here("newTracker", "cf", "my-collector.cloudfront.net", {
appId: "cfe23a",
platform: "mob"
});
The tracker will be named "cf" and will send events to my-collector.cloudfront.net, the cloudfront collector specified. The final argument is called the argmap. Here it is just used to set the app ID and platform for the tracker. Each event the tracker sends will have an app ID field set to "cfe23a" and a platform field set to "mob".
Here is a longer example in which every tracker configuration parameter is set:
snowplow_name_here("newTracker", "cf", "my-collector.cloudfront.net", {
appId: "cfe23a",
platform: "mob"
cookieDomain: null,
cookieName: "_sp534_",
encodeBase64: false,
respectDoNotTrack: false,
userFingerprint: true,
userFingerprintSeed: 6385926734,
pageUnloadTimer: 0,
forceSecureTracker: true,
writeCookies: true
});
We will now go through the various argmap parameters. Note that these are all optional. In fact, you aren't required to provide any argmap at all.
Set the application ID using the appId
field of the argmap. This will be attached to every event the tracker fires. You can set different application IDs on different parts of your site. You can then distinguish events that occur on different applications by grouping results based on application_id
.
Set the application platform using the platform
field of the argmap. This will be attached to every event the tracker fires. Its default value is "web". For a list of supported platforms, please see the Snowplow Tracker Protocol.
If your website spans multiple subdomains e.g.
- www.mysite.com
- blog.mysite.com
- application.mysite.com
You will want to track user behaviour across all those subdomains, rather than within each individually. As a result, it is important that the domain for your first party cookies is set to '.mysite.com' rather than 'www.mysite.com'. By doing so, any values that are stored on the cookie on one of subdomain will be accessible on all the others.
Set the cookie domain for the tracker instance using the cookieDomain
field of the argmap. If this field is not set, the cookies will not be given a domain.
Set the cookie name for the tracker instance using the cookieName
field of the argmap. The default is "sp". Snowplow uses two cookies, a domain cookie and a session cookie. In the default case, their names are "_sp_id" and "_sp_ses" respectively. If you are upgrading from an earlier version of Snowplow, you should use the default cookie name so that the cookies set by the earlier version are still remembered. Otherwise you should provide a new name to prevent clashes with other Snowplow users on the same page.
Most browsers have a Do Not Track option which allows users to express a preference not to be tracked. You can respect that preference by setting the respectDoNotTrack
field of the argmap to true
. This prevents cookies from being sent and events from being fired.
By default, the tracker generates a user fingerprint based on various browser features. This fingerprint is likely to be unique and so can be used to track anonymous users. You can turn user fingerprinting off by setting the userFingerprint
field of the argmap to false
.
The userFingerprintSeed
field of the the argmap lets you choose the hash seed used to generate the user fingerprint. If this is not specified, the default is 123412414.
Whenever the Snowplow Javascript Tracker fires an event, it automatically starts a 500 millisecond timer running. If the user clicks on a link or refreshes the page during this period (or, more likely, if the event was triggered by the user clicking a link), the page will wait until either the event is sent or the timer is finished before unloading. 500 milliseconds is usually enough to ensure the event has time to be sent.
You can change the pause length (in milliseconds) using the pageUnloadTimer
of the argmap. The above example completely eliminates the pause. This does make it unlikely that events triggered by link clicks will be sent.
See also How the Tracker uses localStorage
for an explanation of how the tracker can later recover and send unsent events.
Normally the protocol (http or https) used by the Tracker to send events to a collector is the same as the protocol of the current page. You can force it to use https by setting the forceSecureTracker
field of the argmap to true
.
The writeCookies
argument is a boolean value which determines whether the tracker instance will be able to alter cookies or add new ones. It does not affect whether the tracker instance will read cookies, so if it is turned off but Snowplow cookies with the tracker's configured cookie name already exist for the page, the tracker will continue to report those cookies' values. If you do use this argument, be careful - if two trackers on the same page are both initialised with the same cookie name and with writeCookies
turned on, inaccurate data will result from them both trying to alter the same cookies. Note that you will always be fine if the writeCookies
argument is not set - because the default behaviour avoids these problems.
By default the Tracker will store events in localStorage
before sending them so that they can be recovered if the user leaves the page before they are sent. You can disable this feature by setting a useLocalStorage: false
field in the argmap.
Back to top Back to JavaScript technical documentation contents
The JavaScript Tracker automatically sets a domain_userid
based on a first party cookie.
There are many situations, however, when you will want to identify a specific user using an ID generated by one of your business systems. To do this, you use one of the methods described in this section: setUserId
, setUserIdFromLocation
, setUserIdFromReferrer
, and setUserIdFromCookie
.
Typically, companies do this at points in the customer journey when the user identifies him / herself e.g. if he / she logs in.
Note: this will only set the user ID on further events fired while the user is on this page; if you want events on another page to record this user ID too, you must call setUserId
on the other page as well.
setUserId
is the simplest of the four methods. It sets the business user ID to a string of your choice:
snowplow_name_here('setUserId', '[email protected]');
setUserIdFromLocation
lets you set the user ID based on a querystring field of your choice. For example, if the URL is http://www.mysite.com/home?id=user345
, then the following code would set the user ID to "user345":
snowplow_name_here('setUserIdFromLocation', 'id');
setUserIdFromReferrer functions in the same way as
setUserIdFromLocation`, except that it uses the referrer querystring rather than the querystring of the current page.
snowplow_name_here('setUserIdFromReferrer', 'id');
Use setUserIdFromCookie
to set the value of a cookie as the user ID. For example, if you have a cookie called "cookieid" whose value is "user123", the following code would set the user ID to "user123":
snowplow_name_here('setUserIdFromCookie', 'cookieid');
Back to top Back to JavaScript technical documentation contents
The Snowplow JavaScript Tracker automatically tracks the page URL on any event tracked. However, in certain situations, you may want to override the actual URL with a custom value. (For example, this might be desirable if your CMS spits out particularly ugly URLs that are hard to unpick at analysis time.) In that case, you can override the default value using the setCustomUrl
function.
To set a custom URL, use the setCustomUrl
method:
snowplow_name_here('setCustomUrl', 'http://mysite.com/checkout-page');
The JavaScript Tracker sets two cookies: a visitor cookie and a session cookie. The visitor cookie contains all persistent information about the user, including a visit count (the number of times the user has visited the site). It lasts for two years. The session cookie is specific to an individual session. By default, it expires after 30 minutes pass with no event fired. Whenever a Snowplow event is fired, if no session cookie is found, the Tracker takes this to mean that a new session has started. It therefore increments the visitor cookie's visit count. If the user leaves the site and returns before the 30 minutes is up, the visit count is not incremented.
The visit count is added to each event querystring as "vid". "vid=3" would mean an event was fired during the user's third session.
You can change the default from 30 minutes by using setSessionCookieTimeout
. You should give the expiration time of the session cookie in seconds:
snowplow_name_here('setSessionCookieTimeout', 3600);
The above code would cause the session cookie to last for one hour.
Back to top Back to JavaScript technical documentation contents
If you call snowplow_name_here
with a function as the argument, the function will be executed when sp.js loads:
snowplow_name_here(function () {
console.log("sp.js has loaded");
});
Or equivalently:
snowplow_name_here(function (x) {
console.log(x);
}, "sp.js has loaded");
The callback function should not be a method:
// TypeError: Illegal invocation
snowplow_name_here(console.log, "sp.js has loaded");
will not work, because the value of this
in the console.log
function will be window
rather than console
.
You can get around this problem using Function.prototoype.bind
as follows:
snowplow_name_here(console.log.bind(console), "sp.js has loaded");
You have more than one tracker instance running on the same page at once. This may be useful if you want to log events to different collectors. By default, any Snowplow method you call will be executed by every tracker you have created so far:
snowplow_name_here("newTracker", "cf1", "my-collector-1.cloudfront.net", {
appId: "cfe23a",
platform: "mob"
});
snowplow_name_here("newTracker", "cf2", "my-collector-2.cloudfront.net", {
appId: "cfe23a",
platform: "mob"
});
// Both trackers will use this custom title
snowplow_name_here('setCustomUrl', 'http://mysite.com/checkout-page');
// Both trackers will fire a structured event
snowplow_name_here('trackStructEvent', 'Mixes', 'Play', 'MrC/fabric-0503-mix', '', '0.0');
You can override this behaviour and specify which trackers will execute a Snowplow method. To do this, change the method name by adding a colon followed by a list of tracker names separated by semicolons:
// Only the first tracker will fire this structured event
snowplow_name_here('trackStructEvent:cf1', 'Mixes', 'Play', 'MrC/fabric-0503-mix', '', '0.0');
// Only the second tracker will fire this unstructured event
snowplow_name_here('trackUnstructEvent:cf2', 'com.acme_company' 'Viewed Product',
{
product_id: 'ASO01043',
category: 'Dresses',
brand: 'ACME',
returning: true,
price: 49.95,
sizes: ['xs', 's', 'l', 'xl', 'xxl'],
available_since$dt: new Date(2013,3,7)
}
);
// Both trackers will fire a page view event
snowplow_name_here('trackPageView:cf1;cf2');
Unless you have enabled respectDoNotTrack
in the configuration argmap, the tracker will use cookies to persist information. There are two first party cookies: the session cookie and the ID cookie. By default their names are prefixed with "sp", but you can change this using the "cookieName" field in the argmap. Their names are suffixed with a hash of the current domain, so the full cookie names might look something like _sp_ses.4209 and _sp_id.4209.
Called _sp_ses.{{DOMAIN HASH}} by default, the only purpose of this cookie is to differentiate between different visits. Whenever an event is fired, the session cookie is set to expire in 30 minutes. (This value can be altered using setSessionCookieTimeout
.)
If no session cookie is already present when an event fires, the tracker treats this as an indication that long enough has passed since the user last visited that this session should be treated as a new session rather than a continuation of the previous session. The visitCount
(how many times the user has visited) is increased by one and the lastVisitTs
(the timestamp for the last session) is updated.
This cookie is called _sp_id.{{DOMAIN HASH}} by default. It is used to persist information about a user's activity on the domain between sessions. It contains the following information:
- An ID for the user based on a hash of various browser attributes
- How many times the user has visited the domain
- The timestamp of the user's first visit
- The timestamp of the current visit
- The timestamp of the last visit
It expires after 2 years.
There is a third sort of Snowplow-related cookie: the cookie set by the Clojure Collector, independently of the JavaScript Tracker. If you are using another type of collector, this cookie will not be set. The Clojure Collector cookie is called "sp". It is a third-party cookie used to track users over multiple domains. It expires after one year.
You can use the following function to extract the user ID from the ID cookie:
/*
* Function to extract the Snowplow user ID from the first-party cookie set by the Snowplow JavaScript Tracker
*
* @param string cookieName (optional) The value used for "cookieName" in the tracker constructor argmap
* (leave blank if you did not set a custom cookie name)
*
* @return string or bool The ID string if the cookie exists or false if the cookie has not been set yet
*/
function getSnowplowDuid(cookieName) {
cookieName = cookieName || '_sp_';
var matcher = new RegExp(cookieName 'id\\.[a-f0-9] =([^;] );');
var match = document.cookie.match(matcher);
if (match && match[1]) {
return match[1].split('.')[0];
} else {
return false;
}
}
If you set a custom cookieName
field in the argmap, pass that name into the function; otherwise call the function without arguments. Note that if the function is called before the cookie exists (i.e. when the user is visiting the page for the first time and sp.js has not yet loaded) if will return false
.
The Snowplow JavaScript Tracker uses window.localStorage
to store events in case the user goes offline. Whenever the Tracker tries to fire an event, it first appends it to the queue in localStorage
, and then sends events from the front of the queue until the queue is empty or an event fails to send.
localStorage
is only shared between pages with the exact same domain. So if a user clicks on an internal link to another page in the same domain but the link click event fails to send before the page unloads, the event will be available in localStorage
to the destination page, and if sp.js is also loaded on that page, it will send the request. Note that the tracker on the second page must have the same Snowplow function name (e.g. "snowplow_name_here") and the same tracker namespace (e.g. "cf") as the tracker on the first page for this to work.
Home | About | Project | Setup Guide | Technical Docs | Copyright © 2012-2021 Snowplow Analytics Ltd. Documentation terms of use.
HOME » TECHNICAL DOCUMENTATION » Trackers » Javascript tracker
1. Trackers Overview Javascript Tracker
2. Collectors Overview Cloudfront Collector Clojure Collector (Elastic Beanstalk) Scala Stream Collector
3. ETL Overview EmrEtlRunner
C. Canonical Snowplow event model
4. Storage Overview S3 / Hive Amazon Redshift Infobright
D. Snowplow storage formats (to write)
5. Analytics Analytics-documentation
Common Artifact-repositories