<div dir="ltr">P.S. I should say that I don't have a horse in this race, I am not selling anything, I am not pushing anything. I am just asking :-)<br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Wed, Oct 14, 2020 at 9:02 AM Oddo Da <<a href="mailto:oddodaoddo@gmail.com">oddodaoddo@gmail.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div dir="ltr">On Wed, Oct 14, 2020 at 8:42 AM Michael Di Domenico <<a href="mailto:mdidomenico4@gmail.com" target="_blank">mdidomenico4@gmail.com</a>> wrote:<br></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">On Wed, Oct 14, 2020 at 8:02 AM Oddo Da <<a href="mailto:oddodaoddo@gmail.com" target="_blank">oddodaoddo@gmail.com</a>> wrote: </blockquote><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">i think you're implying (perhaps not consciously or i'm reading more<br>
into your statements) that MPI/PVM are the only frame works for<br>
message passing out there. this isn't true, charm, upc, shmem, etc<br>
have all been developed to do basically what MPI does. MPI (i<br>
believe) is just the only one that's ratified into a standard. and<br>
thus it provides the code stability interconnect vendors need to write<br>
the shim code between the hardware and the language<br></blockquote><div><br></div><div>Perhaps but I am seeing these at the same level of abstraction - the low level where you have to spell everything out. <br></div><div><br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
i think your "why" spark evolved separately is flawed. spark/hadoop<br>
didn't evolve out of a company's lack of ability to pay for a cluster<br>
or a filesystem. It was designed to solve a very specific problem.<br>
bigdata processing. you could certainly write an mpi program to do<br>
the same thing, but the design of spark/hadoop does it more<br>
efficiently and at a lower cost. but the point is each language does<br>
something specific really well, spark does data processing, mpi can do<br>
complex math (yes no need to circle the flaming wagons, both can do<br>
both, but each does one better than the other)<br></blockquote><div><br></div><div>I see what you wrote above as contradicting yourself. First you say that Spark/hadoop did not evolve from economic motivations but then you say that it did. You say that Spark and MPI can be used interchangeably. OK if that's the case, why does the data science/ML industry just not use MPI everywhere. Why did they not start with MPI as the underlying paradigm and just build all the tooling on top of it? I wish they did... ;-)</div><div> <br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
i believe your "lack of progress" statement is really just a<br>
misunderstanding of what mpi really represents. to me MPI is like the<br>
flathead screw, they've been around a long time and there are<br>
certainly a ton of alternate head designs on the market. however,<br>
wooden boat builders still use them because they just work, they're<br>
easy to make, and they're easy to fix when it comes time to repair<br></blockquote><div><br></div><div>I see MPI as a low level solution for a problem, at the abstraction level where you need to spell everything out. It is like the comparison between C and languages like Scala or Haskell or Julia. I am asking why there is no progress on the latter in this scenario - we have the message passing interface level of abstraction, why are we not interested in using this to build tooling that is at the higher level, where we can hide the how and focus on the what.</div><div> <br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
you're equating a 20yr old book with lack of progress and frankly i<br>
think that's a flawed statement.<br></blockquote></div><div class="gmail_quote"><br></div><div class="gmail_quote">20 years ago, when I was an undergrad, I took a 200-level course in data structures and algorithms and it was taught in a programming language called Eiffel. The professor started with saying - "Eiffel is gaining in popularity, there are many new books being written about Eiffel but none about C. Do you know why?". I raised my hand and said "because C is an old language and nothing has changed and everything that was to be said about it, was already said about it in many previous books". At least in that domain we could discuss these things - new languages and paradigms and tooling abound. In the world of HPC, not so much.<br></div><div class="gmail_quote"><br></div><div class="gmail_quote"><br></div></div>
</blockquote></div>