SubsetDataByContinuousCol.RdExample. If you want to only keep the bottom 25% and top 75% of data, then the lower.bound.percent will be 0.25 and the upper.bound.percent will be 0.75.
SubsetDataByContinuousCol( inputted.data, col.name.to.subset.on, lower.bound.percent, upper.bound.percent )
| inputted.data | A dataframe. |
|---|---|
| col.name.to.subset.on | Name of column with continuous data for subsetting. |
| lower.bound.percent | A numeric value from 0 to 1 that specifies the first bound. |
| upper.bound.percent | A numeric value from 0 to 1 that specifies the second bound. |
A subsetted dataframe.
Other Preprocessing functions:
AddColBinnedToBinary(),
AddColBinnedToQuartiles(),
AddPCsToEnd(),
ConvertDataToPercentiles(),
CorAssoTestMultipleWithErrorHandling(),
DownSampleDataframe(),
GenerateElbowPlotPCA(),
GeneratePC1andPC2PlotsWithAndWithoutOutliers(),
Log2TargetDensityPlotComparison(),
LookAtPCFeatureLoadings(),
MultipleColumnsNormalCheckThenBoxCox(),
NormalCheckThenBoxCoxTransform(),
RanomlySelectOneRowForEach(),
RecodeIdentifier(),
RemoveColWithAllZeros(),
RemoveRowsBasedOnCol(),
RemoveSamplesWithInstability(),
SplitIntoTrainTest(),
StabilityTestingAcrossVisits(),
TwoSampleTTest(),
ZScoreChallengeOutliers(),
captureSessionInfo(),
correlation.association.test(),
describeNumericalColumnsWithLevels(),
describeNumericalColumns(),
generate.descriptive.plots.save.pdf(),
generate.descriptive.plots()