doenut.data.modifiers.column_selector

Module Contents

Classes

ColumnSelector

DataSet Modifier to remove columns from the dataset

class doenut.data.modifiers.column_selector.ColumnSelector(inputs: pandas.DataFrame, responses: pandas.DataFrame, input_selector: List[str | int] = None, response_selector: List[str | int] = None)[source]

Bases: doenut.data.modifiers.data_set_modifier.DataSetModifier

DataSet Modifier to remove columns from the dataset

Parameters:
  • inputs (pd.DataFrame) – The dataset’s inputs

  • responses (pd.DataFrame) – The dataset’s responses

  • input_selector (List["str | int"], optional) – A list to filter the inputs by

  • response_selector (List["str | int"], optional) – A list to filter the responses by

Warning

At least one of input_selector and response_selector must be specified.

classmethod _parse_selector(data: pandas.DataFrame, selector: List[str | int]) Tuple[List[str], List[int]][source]

Internal helper function to take either a list of column names or column indices and convert it to the other.

Parameters:
  • data (pd.DataFrame) – The data set the list applies to

  • selector (List["str | int"]) – The known selector list

Returns:

  • List[str] – The list of column names selected

  • List[int] – The list of column indices selected

apply_to_inputs(data: pandas.DataFrame) pandas.DataFrame[source]

Applies the modifier to the inputs of the dataset.

Parameters:

data (pd.DataFrame) – The input data

Returns:

The modified input data

Return type:

pd.DataFrame

apply_to_responses(data: pandas.DataFrame) pandas.DataFrame[source]

Applies the modifier to the responses of the dataset.

Parameters:

data (pd.DataFrame) – The response data

Returns:

The modified response data

Return type:

pd.DataFrame