MOBILE INTERNET USAGE ANALYSIS
Data profiling is the process of examining the data available from an existing information source (e.g. a database or a file) and collecting statistics or informative summaries about the data. The purpose of these statistics were to find out whether existing data can be easily used for other purposes, improve the ability to search data by tagging it with keywords, descriptions, or assigning it to a category, and/or assess data quality, including whether the data conforms to particular standards or patterns.
> Objective
Paques Data Lake advised a mobile network operators to observe where the error occurred by combining data from various machine sources.
> Data Source
Paques Data Lake used data from cell-monitoring, weekly report, and router data that contained regions, locations, clients, connection status in CSV format.
> Solution
Paques Data Lake processed the data from several machines by correlating them based on defining the ‘key’ from each file into a single data set. And to find out where the error occurred, Paques Data Lake made an individual text data manually in defining error pattern based on data classification.
> Workflow
> Result
By using Paques, users can perform correlation of data from various machine sources into an easy to read data based on defined patterns and allows users to easily monitor profiling data. From the generated data, the users can immediately find out where the error occurred based on existing knowledge and may work on it right away.