This blog features classification in Mahout and the underlying concepts. I will explain the basic classification process, training a Logistic Regression model with Stochastic Gradient Descent and a give walkthrough of classifying the Iris flower dataset with Mahout.
NLUUG DevOps Conference 2013 – Reliability, clouds and the UNIX way
Last Thursday I attended the NLUUG DevOps conference in Bunnik, near Utrecht. The NLUUG is the Dutch UNIX user group. In this blog I will summarize the talks I attended, some fun things I learned and I will discuss my own talk about continuous integration at a large organization.
Bash – A few commands to use again and again
Introduction These days I spend a lot of time in the bash shell. I use it for ad-hoc scripting or driving several Linux boxes. In my current project we set up a continuous delivery environment and migrate code onto it. I lift code from CVS to SVN, mavenize Ant builds and funnel artifacts into Nexus. […]
QCon London 2013 – Simplicity, complexity and doodles
Westminster Abbey – View from the Queen Elizabeth II conference center …and now back home On my desk lies a stack of notepads from the QCon sponsors. I pick up one of them and turn few pages trying to decipher my own handwriting. As I read my notes I reflect back on the conference. QCon […]
Build massively scalable soft real-time systems with Erlang
Today I just wanted to take the opportunity to introduce you to Erlang Solutions (a part of the Trifork group). From later this week, Erlang Solutions starts a new series of webinars aiming to showcase practical use cases of Erlang. As an open source language designed for programming concurrent, real-time, distributed fault-tolerant systems, Erlang found […]
Migrating Apache Solr to Elasticsearch
Elasticsearch is the innovative and advanced open source distributed search engine, based on Apache Lucene. Over the past several years, at Trifork we have been doing a lot of search implementations. Driven by the fact that every other customer wanted the ‘Google-experience’ (just a text box, type some text and get relevant results) as part […]
There’s More Lucene in Solr than You Think!
We’ve been providing Lucene & Solr consultancy and training services for quite a few years now and it’s always interesting to see how these two technologies are perceived by different companies and their technical people. More precisely, I find it interesting how little Solr users know about Lucene and more so, how unaware they are […]
Berlin Buzzwords 2012
Yes, Berlin Buzzwords is back on the 4th & 5th June 2012! This really is only conference for developers and users of open source software projects, focusing on the issues of scalable search, data-analysis in the cloud and NoSQL-databases. All the talks and presentations are specific to three tags; “search”, “store” and “scale”.. Looking back […]
Hot off the press, as of today “JTeam” will be known as “Dutchworks”!
Many of you probably know that JTeam has gone through quite the evolutionary process over the last two years; the combination of our two acquisitions (Func & Net Effect), and our strong autonomous growth has resulted in over 250% increase in revenue and number of Dutchworkers. Further, it has allowed us to develop a services […]
Indexing your Samba/Windows network shares using Solr
Many of JTeam’s clients want to search the content of their existing network shares as part of their Enterprise Search infrastructure. Over the last couple of years, more and more people are switching to Apache Lucene / Solr as their preferred, open source search solution. However, many still have the misconception that it is not […]