Skip to content

.NET for Apache Spark v2.1.2 in .NET7 ? #1139

Open
@GeorgeS2019

Description

@GeorgeS2019

Update 28thFeb2023

Thanks to @dragorosson
Following the advice by @dragorosson

It is possible in Windows to prepare openjdk-8-jdk, mvm, spark-3.2.3-bin-hadoop3.2 to create

version = 2.1.1
microsoft-spark-3-2/target/microsoft-spark-3-2-3_2.12-[2.1.1].jar

By compiling Microsoft.Spark.Worker in .NET7, instead of .NET6.0, this ensure consistency of addressing the BinaryFormatter compiling error. UDF now works. Delta works.

Next open issue:

  • Key issue left is support for microsoft-spark-3-3.jar
Previous discussions #### Update 23rdFeb2023

Please share your feedback to this observation provided here, but please do this within this issue for tracking purposes.

For Spark, this project is key for .NET developers to stay within .NET when dealing with big data analytics. It is unclear WHY there are questionable and sporadic commitments shown here. If this effort here fails OR with further delay, it could have a ripple effect on the ENTIRE machine learning and deep learning .NET efforts.

The triangular THREE PRONGED .NET efforts to KEEP big data analytics within .NET could be questionable.

  • Machine Learning (ML.NET)
  • PolyGlot
  • .NET for Spark

Update 20thFeb2023

  • Test the WIP .NET6 version
  • Check the WIP .NET6 with microsoft-spark-3-2.jar
    • Check the WIP .NET6 with microsoft-spark-3-2_2.12-2.1.1.jar

      So far, I could only get the WIP merged .NET6 to work with microsoft-spark-3-1_2.12-2.1.1.jar

Update 15thFeb2023

It seems the Azure Synapse team has officially deleted ALL C# .NET for Spark samples for Synapse Jan 2023. Sad!

image

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions