Friday, July 11, 2014

Programming Languages for Big Data, Part 2

I mentioned the recent study on the relative speeds of programming languages to Tommy Jones, a specialist in natural language processing and fellow member of the Data Community DC, and he, being more industrious than I, dove into the code used by the authors of the paper in question. In their R code, he found gems such as a triple-nested "for" loop inside a "while" loop (instead of the much faster "apply" functions), which made the comparisons pretty useless, at least in the case of R. See Tommy's blog, Biased Estimates, for more details.

Nonetheless, it's a pretty interesting question, and I'd love to see someone who's proficient in all of the languages involved try this test again, using better code. I'm still intrigued by the very high speed of MATLAB/Octave—something that leads Andrew Ng to recommend those languages over R for prototyping—though Tommy pointed out to me that, since R is closer to being a full-featured language, it's more flexible than the former languages.

9 comments:

  1. Nice overview of big data programming languages 👍
    Informative post for learners exploring big data technologies. Thanks for sharing!modern traditional outfits

    ReplyDelete
  2. Great overview of programming language options for big data very informative and easy to understand.
    I especially appreciate how you break down strengths and use-cases for each language.
    Thanks for sharing this useful resource.
    esim turkey

    ReplyDelete
  3. Interesting discussion about programming languages used in big data development. The insights help readers understand which technologies are commonly used in the data ecosystem. Very informative article.
    USA Canada eSIM plans

    ReplyDelete
  4. Informative post for learners exploring big data technologies.
    Hair Clipping in Hyderabad

    ReplyDelete
  5. Great article! The discussion about programming languages for big data is still highly relevant as organizations continue to process and analyze massive datasets. Choosing the right language depends on factors such as scalability, ecosystem support, performance, and integration capabilities. It's interesting to see how big data technologies have evolved alongside machine learning and AI. I recently explored Generative AI Training in Hyderabad and found it useful for understanding how modern AI systems leverage large-scale data processing, analytics, and distributed computing frameworks. Thanks for sharing these valuable insights on big data technologies.

    ReplyDelete
  6. Great post! Understanding the strengths of different programming languages is essential for anyone working with Big Data, analytics, and AI. The information shared here provides useful guidance for developers and data professionals. Brolly Academy's Generative AI Training in Hyderabad also helps learners gain practical experience with AI, data engineering, and modern technology stacks. Thanks for the informative article!

    ReplyDelete
  7. Excellent article! The comparison of programming languages used in Big Data is very informative. Understanding the strengths of languages like Java, Python, R, and Scala is essential for building effective data solutions.
    Interior Designers in Hyderabad

    ReplyDelete