pyspark.sql.functions.
array_repeat
Collection function: creates an array containing a column repeated count times.
New in version 2.4.0.
Changed in version 3.4.0: Supports Spark Connect.
Column
column name or column that contains the element to be repeated
column name, column, or int containing the number of times to repeat the first argument
an array of repeated elements.
Examples
>>> df = spark.createDataFrame([('ab',)], ['data']) >>> df.select(array_repeat(df.data, 3).alias('r')).collect() [Row(r=['ab', 'ab', 'ab'])]