cudf.core.column.string.StringMethods.character_tokenize#

StringMethods.character_tokenize() → SeriesOrIndex#

Each string is split into individual characters. The sequence returned contains each character as an individual string.

Returns

Series or Index of object.

Examples

>>> import cudf
>>> data = ["hello world", None, "goodbye, thank you."]
>>> ser = cudf.Series(data)
>>> ser.str.character_tokenize()
   h
   e
   l
   l
   o
5
   w
   o
   r
   l
  d
  g
  o
  o
  d
  b
  y
  e
  ,
19
  t
  h
  a
  n
  k
25
  y
  o
  u
  .
dtype: object