[Python] finding the correct data mining approach

eherrtelle59 · Jul 20, 2012

I'm having trouble finding the correct approach to my (fairly simple) example.

Let's say I have months of data for log-in times of a certain website. The data has been selected and cleaned such that I have a list of Date_Time for each log-in.

Now, suppose I wanted to predict the log-ins for the next two weeks by day and hour, based on these past trends.

I imagine I would cluster the data by day (assuming beforehand that there will be different trends with respect to Monday vs. Friday) and make some regression analysis to predict the next two (say) Mondays.

Similarly, I could cluster by the hour and do a regression analysis to extrapolate the trend of log-ins.

Anyone know of a resource which tells you how to do this in Python? I want to keep this example fairly straightforward, but I'm open to any more ideas on how to model this behavior more efficiently.

chiro · Jul 21, 2012

Hey eherrtelle59.

You should probably take a look at this:

http://mlpy.sourceforge.net/

gsal · Jul 21, 2012

There is also lowess.

[Python] finding the correct data mining approach

Related to [Python] finding the correct data mining approach

1. What is data mining and why is it important?

2. How do I choose the right data mining approach for my project?

3. What is the difference between supervised and unsupervised learning in data mining?

4. Can Python be used for data mining?

5. What are some common challenges in data mining and how can they be addressed?

Similar threads

Hot Threads

Recent Insights