- #1
Frank Einstein
- 170
- 1
- TL;DR Summary
- I need to know how to cluster data measured at different time instants.
Hello everyone. I have a machine with a series of sensors. All sensors send a signal each minute. I want to know if any of those sensors are redundant. The data is available as an Excel file, where the columns are the variables and the rows are the measurements. I have 1000 rows.
To do this, I have used DBSCAN in Python as
However, I think that there has to be a better way to find relationships between variables (each sensor or columns of the data file).
Could someone please point me towards a methodology more suitable for my goals?
Any answer is appreciated.
Tanks for reading.
Best regards.
Frank.
To do this, I have used DBSCAN in Python as
Data clusterization:
scaler = StandardScaler()
data_normalized = scaler.fit_transform(data)
data_normalized = data_normalized.T
dbscan = DBSCAN(eps=15, min_samples=2)
clusters = dbscan.fit_predict(data_normalized)
However, I think that there has to be a better way to find relationships between variables (each sensor or columns of the data file).
Could someone please point me towards a methodology more suitable for my goals?
Any answer is appreciated.
Tanks for reading.
Best regards.
Frank.