pyspark.pandas.Series.str.slice_replace¶
-
str.
slice_replace
(start: Optional[int] = None, stop: Optional[int] = None, repl: Optional[str] = None) → ps.Series¶ Slice substrings from each element in the Series.
- Parameters
- startint, optional
Start position for slice operation. If not specified (None), the slice is unbounded on the left, i.e. slice from the start of the string.
- stopint, optional
Stop position for slice operation. If not specified (None), the slice is unbounded on the right, i.e. slice until the end of the string.
- replstr, optional
String for replacement. If not specified (None), the sliced region is replaced with an empty string.
- Returns
- Series of object
Series from sliced substrings from original string objects.
Examples
>>> s = ps.Series(['a', 'ab', 'abc', 'abdc', 'abcde']) >>> s 0 a 1 ab 2 abc 3 abdc 4 abcde dtype: object
Specify just start, meaning replace start until the end of the string with repl.
>>> s.str.slice_replace(1, repl='X') 0 aX 1 aX 2 aX 3 aX 4 aX dtype: object
Specify just stop, meaning the start of the string to stop is replaced with repl, and the rest of the string is included.
>>> s.str.slice_replace(stop=2, repl='X') 0 X 1 X 2 Xc 3 Xdc 4 Xcde dtype: object
Specify start and stop, meaning the slice from start to stop is replaced with repl. Everything before or after start and stop is included as is.
>>> s.str.slice_replace(start=1, stop=3, repl='X') 0 aX 1 aX 2 aX 3 aXc 4 aXde dtype: object