python (12.9k questions)
javascript (9.2k questions)
reactjs (4.7k questions)
java (4.2k questions)
java (4.2k questions)
c# (3.5k questions)
c# (3.5k questions)
html (3.3k questions)
Is it possible to define recursive DataType in PySpark Dataframe?
I want to create a schema like this example:
friendSchema = StructType([ \
StructField("firstname",StringType(),True), \
StructField("middlename",StringType(),True), \
...
Omid
Votes: 0
Answers: 4
GroupBy based on condition Pyspark
I have a peculiar problem to which I don't see an easy solution (and efficient one). I have a list of addresses (more than 10M) with different user_time_active. I also have count per user_time_active ...
Martin Walczyński
Votes: 0
Answers: 1
How to use spark.DataFrameReader from Foundry Transforms
I have a file with non-classic formatting so I need to use the spark.DataFrameReader (spark.read.csv) on the raw file directly so that I can set the appropriate parsing configurations.
How can I do th...
Dovid Braverman
Votes: 0
Answers: 1
Prevent commas in open text columns from messing up spark read csv
I have a directory of CSVs that has some text columns that DO NOT have quotes around them (I cannot control this as these are from an external source). These open text columns have commas in them and...
ben890
Votes: 0
Answers: 1