Feature engineering duty cycles

Complicated time series data; where to begin?
Author

Sjoerd de Haan

Published

January 30, 2025

Industrial time series are hard. Not knowing the system is even harder.

Before modeling, I want to have an understanding of the data. Knowing how your data is generated, leads to better modeling decisions.

Let me talk you through feature engineering duty cycles; or how to understand the system you are modeling based on data.

Diving into the data

Generally we know the system that we are modeling, but what if we don’t?

Figure Figure 1 shows an impression of how the torque of a clutch may vary over over time. Besides this, we have no other information about the system.

Can we learn something about the system, purely based on the data?

Figure 1: Clutch torque, hand-drawn impresison

The clutch torque is rapidly oscillating between 0 and a maximum, and we can make some more observations:

  • Spikes are irregular
  • Spikes frequency increases over time
  • Spikes occur in bursts
  • Bursts are not periodic
  • “Burst packages” follow a pattern that looks like a wave
  • There is a “near zero” period

Basically there are 2 points:

  1. The system is changing over time, in the sense that it is non-stationary.
    We can capture that with some features:
  • Sliding windows of variance, max, min, mean
  • Zero crossing rates; how many times signal crosses mean in a given time window
  1. Engagement / disengagement matters for clutches.
    We can capture that with features like:
  • Max of the peak
  • Min of low
  • Difference between max and min
  • Time to peak
  • Peak width
  • Area under peak
  • Time since last peak
  • Time since last mean crossing
  • Time since last low
  • Indicator engaging / the the disengaging
  • Rate of change (1th, 2th order difference)

Time to peak means how long it reach a peak from the last minimum

For those features we need to do peak identification first.

From the engagement / disengagement features we can derive moving statistics to capture changes; e.g. running average of time to peak, running average of peak height, running average of peak width.

Wrapping up

The process was

  1. Make some observations
  2. Determine what is of interest in the data & systems; in this case
  1. Distribution changes over time (non-stationary)
  2. Engagements / disengagements
  1. Engineer features that expose those characteristics further