How To Use Pandas Udf Functionality In Pyspark

April 22, 2024 Post a Comment

I have a spark frame with two columns which looks like: +-------------------------------------------------------------+------------------------------------+ |docId

Solution 1:

Done. Simple function can help to achieve this:

@pandas_udf(returnType=StringType())
def convert_id(id):
    converted = id.map(lambda x : str(bs.a85encode(bytearray.fromhex(str(x).replace("-", ""))))[2:-1])
    return converted

Getting Started with Python

How To Use Pandas Udf Functionality In Pyspark

Solution 1:

Post a Comment for "How To Use Pandas Udf Functionality In Pyspark"