pith. sign in

arxiv: cs/0511106 · v1 · submitted 2005-11-30 · 💻 cs.DB

Benefits of InterSite Pre-Processing and Clustering Methods in E-Commerce Domain

classification 💻 cs.DB
keywords datadatasetintersiteanalysisbenefitsclickstreamclusteringdomain
0
0 comments X
read the original abstract

This paper presents our preprocessing and clustering analysis on the clickstream dataset proposed for the ECMLPKDD 2005 Discovery Challenge. The main contributions of this article are double. First, after presenting the clickstream dataset, we show how we build a rich data warehouse based an advanced preprocesing. We take into account the intersite aspects in the given ecommerce domain, which offers an interesting data structuration. A preliminary statistical analysis based on time period clickstreams is given, emphasing the importance of intersite user visits in such a context. Secondly, we describe our crossed-clustering method which is applied on data generated from our data warehouse. Our preliminary results are interesting and promising illustrating the benefits of our WUM methods, even if more investigations are needed on the same dataset.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.