* Posts by Jon Reade

7 publicly visible posts • joined 20 Apr 2014

You need to assemble a crack AI team: Where do you even start?

Jon Reade

De-Hype

The problem I have is not the article, but the generic use of the term AI across the media. Use of it immediately discredits virtually any article, as we're not even close to AI yet. Like endless power from nuclear fusion, it's this year's hype phrase, banded about by journalists and CEOs, without the faintest clue as to what it means or how far we are from it. Just like Big Data, Hadoop, Prince, etc etc before it. It's even on Gartner's 2017 hype curve, far on the left hand side under General Purpose Machine Intelligence.

Machine learning, data science and predictive analytics to usefully employ as a tool alongside people to make money/cut costs/improve productivity? Yes. Although all of those are still in their infancy in terms of being incorporated into deliverable systems. But AI? No, it's still in the pre-Asimov era, we're not even close, in all but a handful of specialist, well defined skills.

Just on a technical inaccuracy: You don't need C/C++ coders to take advantage of GPU cards for machine learning. Sure, you can use it, just as you can use assembler to write your own word processor, if you have a few decades and a bottomless pit of cash. The better alternative is to use Google's Tensorflow, and Theano, both high-level, optimised, broad functionality machine learning libraries with a variety of models for different tasks. These libraries will talk to GPU cards via their CUDA drivers, removing the need for C++ people to unproductively write tedious less time productive code. The hard work has already been done. Python and Tensorflow (with maybe Keras sat between as a high level framework) is mainly where it's at both commercially and academically, with R and Theano also up there. But not C++, unless you want to hand craft or optimise the low level code of your own ML libraries and credibly believe your developers are better at both C++ and machine learning than Google's/MILA's engineers).

Inside the Hekaton: SQL Server 2014's database engine deconstructed

Jon Reade

"SQL Server revenue market share is growing. It is also fastest large database (check TPC benchmarks)."

Hi Slawek,

SQL Server's market share is I believe now at over 50% of corporate database platforms purchases, though I can't recall if this is by volume or dollar sales.

With regards to the TPC benchmarks, I think these are a good indicator of performance, though it's essential to look at those particular tests that match the workload that you intend to present to the platform. Ultimately, all of the major platforms are good, relatively stable pieces of software that have exceptional performance. However, a purchasing decision, even on a greenfield project, will be influenced by factors such as available skill set, legacy investment in other supporting technologies and the requirement for supporting specific applications or development requirements.

Although I pinned my colours to SQL Server's mast many moons ago, I think it's important to recognise it may not be the database of choice, even if it is the fastest, for all shops, no more than Oracle or DB2 may be. But from a production and operational cost point of view, I'd consider any one of these platforms in preference to a less mature platform - the licensing costs are often dwarfed by the installation, ongoing maintenance and overtime costs incurred in a less reliable product, a criticism I would have equally applied to SQL Server back in 1996.

In my opinion, where Microsoft definitely score bonus points is in the security of their product (not a praise it could once be considered worthy of), the tightly integrated development tools, and of course the ETL, OLAP and reporting products that ship with it out of the box, which can often be a costly additional purchase with other platforms. But equally I do think Microsoft have to watch their licensing policy as it's causing much disconcert and consolidation out here in the real world, and it's causing it right now.

Jon.

Jon Reade

Re: A pinch of salt

Yes, it's worth reading about, the implementation is that rare mix of intelligent simplicity. Again, I'd point interested readers to attend Tony Rogerson's presentation on Hekaton if you're lookig for a more in-depth understanding of how things are implemented, it's fascinating and answers a lot of questions everyone has when they first encounter this technology, including the authors.

Jon Reade

Hi Ian,

Point conceded, I personally deal with mostly Enterprise Edition installations, and we should have pointed that out. Thank you for your correction, noted.

Jon Reade.

Jon Reade

Re: Greed is good. At making you stupid.

DJ - agreed. I've seen clients suffer sharp price increases in their SQL Server estate and will do so over the next 12-24 months as they upgrade. It's still cheaper than the competition by a long way, but I'm very concerned that Microsoft will kill off the goose that laid the golden egg. I'm equally as critical as yourself of their pricing policy, and of the feature set that is being gradually eroded or stagnating in SQL Server Standard Edition.

Jon Reade

Re: Data Warehouse customers

Hi Richard,

The Parallel Data Warehouse applicance is a data warehousing applicance based on SQL Server. There's a short, high level overview article on TechNet at:

http://blogs.technet.com/b/dataplatforminsider/archive/2012/11/09/seamless-insights-on-structured-and-unstructured-data-with-sql-server-2012-parallel-data-warehouse.aspx