python - pyspark 'DataFrame' object has no attribute '_get ...
https://stackoverflow.com/questions/57363618I am trying to run some code, but getting error: 'DataFrame' object has no attribute '_get_object_id' The code: items = [(1,12),(1,float('Nan')),(1,14),(1,10),(2,22 ...
python - pyspark 'DataFrame' object has no attribute '_get ...
stackoverflow.com › questions › 57363618I am trying to run some code, but getting error: 'DataFrame' object has no attribute '_get_object_id' The code: items = [(1,12),(1,float('Nan')),(1,14),(1,10),(2,22 ...
[pyspark] AttributeError: ‘DataFrame’ object has no attribute ...
cumsum.wordpress.com › 2020/10/10 › pysparkOct 10, 2020 · Unfortunately this throws a big error: AttributeError: ‘DataFrame’ object has no attribute ‘_get_object_id’. The reason being that isinexpects actual local values or collections but df2.select('id')returns a data frame. Solution: The solution to this problem is to use JOIN, or inner joinin this case: df.join( df2.select('id').drop_duplicates(), # df2 with id column on=['id'], # join on id how='inner' # inner join to keep only common ids).show()+---+---+---+| ...
AWS Developer Forums: AttributeError: 'DataFrame'
forums.aws.amazon.com › messageAttributeError: 'DataFrame' object has no attribute '_get_object_id' when I run the script. I'm pretty confident the error is occurring during this line: datasink = glueContext.write_dynamic_frame.from_catalog(frame = source_dynamic_frame, database = target_database, table_name = target_table_name, transformation_ctx = "datasink")