Listing of all 55 Speakers


Aaron Beppu

Aaron Beppu is a software engineer with the Search and Data team at Etsy.com. His primary focus is improving search through large-scala analysis and mining of clickstream data. Prior to coming to Etsy, he worked for Amazon on the Search Analytics team at A9.com.

Alan Gates

Alan is a co-founder of Hortonwors, where he leads development on Pig, HCatalog, and Hive. He is an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan also designed HCatalog and guided its adoption as an Apache Incubator project. Alan has a BS in Mathematics from Oregon State University and a MA in Theology from Fuller Theological Seminary. He is also the author of Programming Pig, a book from O’Reilly Press.

Alex Baranau

Alex Baranau is a Software Engineer at Sematext, a New York-based products and services company with extensive expertise in search and big data. For the last few years Alex has been working on complex data analytics-focused projects that utilize Hadoop, HBase, and Flume. During that time Alex contributed to HBase and Flume, and has created several open-sourced projects. He writes articles covering Hadoop, HBase, and related technologies.

Alex Lloyd

Alex Lloyd is a Senior Staff Software Engineer in the storage infrastructure group at Google. In this role, he led the replication implementations for Megastore and Spanner, global storage systems that underpin a wide array of Google services. He is currently working on distributing multiversion concurrency control algorithms. Prior to his current role, Alex worked with the Blogger team. Before joining Google in 2004, Alex built mobile sync middleware at BEA Systems. As an undergraduate, his dormroom startup led him to his first job at Object Design, where he worked on visual templating tools. Alex has a bachelor’s degree in computer science from Harvard University.

Alex Pinkin

Alex Pinkin is a contributor to a few open source projects including JBoss Application Server. He currently works as a director of engineering at Bazaarvoice. His teams focus on big data problems, analytics, and search. Alex lives in Austin, TX where he enjoys sunny weather and outdoor life when he is not glued to a computer.

Andrzej Bialecki

Andrzej Bialecki, Apache Lucene PMC Member, is a committer in the Lucene-java, Nutch and Hadoop projects. He has broad expertise, across domains as diverse as information retrieval, systems architecture, embedded systems, networking and business process/e-commerce modeling. He's also the author of the popular Luke index inspection utility. Andrzej holds a master's degree in Electronics from Warsaw Technical University.

Bertrand Delacretaz

Bertrand Delacretaz works as a Senior Developer in the CQ5 R&D team of Adobe Systems Inc (www.day.com), using open source tools to create world-class content management systems and frameworks. Bertrand is an active member and current director of the Apache Software Foundation, involved in a number of Apache projects as a committer, PMC member and incubation mentor.

Chris Harris

Chris Harris is a European Solution Architect at 10gen. Prior to 10gen, Chris was EMEA Architect at SpringSource responsible for evangelising vFabric products and defining architectural solutions for customers across EMEA. Chris specializes in addressing and simplifying complex middleware architectures within development and operational environments. With the acquisition of SpringSource by VMware, Chris focused on how virtualization and cloud computing can be used to address the complexity within the Enterprise. Before joining SpringSource, Chris spent his time at RedHat/JBoss providing consultancy to major clients across EMEA.

Christian Gügi

Christian is a Software Architect at sentric.ch where he is working extensively with open source software on big data and search problems. He has over ten years of experience in software system architecture, design and development, recently focusing on processing large data sets using the Hadoop ecosystem. He is also a co-founder and organizer of the Swiss Hadoop User Group.

Christian Moen

Christian founded Atilika in 2009 to help Japanese companies apply advanced search technologies to their businesses.

Atilika is an innovation partner and proficient in the fields of search, recommendations, computational linguistics and data mining, and has experience applying these technologies to leading businesses within news media, publishing, advertisement and e-commerce in Japan.

Prior to starting Atilika, Christian spent 10 years at FAST (now Microsoft). In FAST R&D headquarters in Oslo, Christian led product development teams responsible for developing the FAST ESP (Enterprise Search Platform) product infrastructure. In 2005, he relocated to the Tokyo branch office and served in management and architecture roles.

Christian is a new Lucene/Solr committer and he's mostly working on Japanese support.

Christian holds an M.S. in computer science from the University of Oslo, Norway.

Christian Richter

Christian Richter is the founder of altusInsight, a big data startup, making it easy for its customers to employ big data technologies to gain a competitive advantage in their market segment. Prior to this he built the backend infrastructure of MOG Inc., an on demand audio streaming service, and implemented a recommendation engine as a distributed system. A patent was awarded to him as a co inventor on identification of audio data during his work at Fraunhofer Gesellschaft.

Christoph Goller

Christoph Goller has more than 10 years of experience in the search industry. He got a Ph.D in computer science from the Technical University of Munich where he worked in several research projects on artificial intelligence, machine learning and neural networks. Christoph started his career at Lernout & Hauspie. Since 2002 he has been Director Research of Intrafind Software AG (www.intrafind.de), a German company specialized on full-text search and text mining based on Lucene and Solr. Christoph has been a Lucene committer since 2004. He has accompanied dozens of commercial projects using Lucene and Solr. Christoph is author of more than 15 scientific papers, frequently gives presentations on search related topics (e.g. at Lucene Revolution 2011) and is responsible for partner training at IntraFind.

Doug Judd

Doug is the CEO of Hypertable Inc., the company that develops and provides professional services for Hypertable, a high performance, open source, scalable NoSQL database. Doug was the original creator of the Hypertable project and has led the Hypertable development team from its inception. He first gained big data experience when he joined Inktomi's Web Search division in 1997. There he held both management and engineering positions and was a driving force behind the development of the company's distributed crawling and indexing infrastructure. With twenty years in the technology industry, Doug has a breadth of experience working for a broad range of companies including, Booz Allen Hamilton, Cambrian Ventures, Inktomi, Kosmix (@walmartlabs), Verity, Unisys, and Zvents. Doug earned a B.S. in Computer Science from U.C. Santa Barbara in 1992 and holds four patents in search technology.

Dragan Milosevic

04/2011-present: Lead Architect Distributed Computing, Zanox.de AG, Berlin

01/2008-03/2011: Senior Architect, Zanox.de AG, Berlin

06/2006-12/2007: Senior Research Scientist, DAI-Lab, Technical University Berlin

01/2002-05/2006: Postgraduate Researcher, DAI-Lab, Technical University Berlin

04/2000-10/2001: Teaching Assistant, University of Nis, Serbia

11/2001-12/2001 and 10/2000-12/2000: Visiting scientist and DAAD fellow, Computing Science, Dortmund

07/2000-06/2001: Part-time worker, RistanCase, Zürich ... Research: Hadoop and Lucene related technologies, several machine learning techniques, clustering techniques, information retrieval, logical programming with mathematica Development: Complex data analyses with Hadoop, Distributed reporting systems with Lucene, Product categoriesation and indexing, Resource-aware coordination techniques Teaching: Smart information filtering and filtering services, Programming techniques und database systems, Database techniques ... 06/2006: Obtaining Dr.-Ing. degree (very good), TU Berlin 09/2001 Obtaining M.Sc. degree (10.00/10.00), University of Nis 09/1999 Obtaining Dipl.-Ing. degree (10.00/10.00), Universtiry of Nis ... Autor of "Beyond Centralised Search Engines: An Agent-Based Filtering Framework", Vdm Verlag Dr. Müller

Eric Evans

Eric has more than a decade of experience in large-scale distributed systems, having held roles in both operations and engineering. An early employee of Rackspace, he implemented a global DNS infrastructure utilizing IP anycast (possibly the first), and a novel data-center-wide IDS for which a patent was awarded.

An avid open source hacker, Eric is a developer with the Debian Project and a member of the Apache Cassandra PMC. He resides in Texas and works on distributed systems for Acunu.

Frank Scholten

I am a software developer at SearchWorkings and contributor to the Apache Mahout & Apache Whirr projects. One of my initial goals with my involvements with these projects was to help and make Mahout more accessible for both developers and system integrators. As part of this work I’ve developed the Mahout Whirr service which dramatically simplifies the deployment of Mahout on large clusters and cloud environments. I’ve also started an initiative for wrapping the different Mahout jobs with more developer friendly APIs. More recently I’ve been focusing my work on building tighter integration between Lucene and Mahout.

Friso van Vollenhoven

Friso is a developer who has lately been setting up and using Hadoop a lot for a living. Also, he is a trainer teaching the Cloudera Hadoop developer classes and (co-)organizer of the Dutch Hadoop community meetup (NL-HUG) and the Dutch NoSQL NL meetup.

Grant Ingersoll

Grant Ingersoll is a co-founder of Lucid Imagination as well as an active member of the Lucene community – a Lucene and Solr committer, co-founder of the Apache Mahout machine learning project and a long standing member of the Apache Software Foundation. Grant’s prior experience includes work at the Center for Natural Language Processing at Syracuse University in natural language processing and information retrieval. Grant earned his B.S. from Amherst College in Math and Computer Science and his M.S. in Computer Science from Syracuse University. Grant is also the co-author of the upcoming "Taming Text" from Manning Publications.

Ian Plosker

Ian Plosker is a Developer Advocate at Basho Technologies, the makers of Riak, an open source distributed NoSQL database. Ian has years of experience designing real world applications on datastores of all shapes and sizes. He uses those experiences every day to help users of Riak successfully exploit its capabilities of fault-tolerance, availability, and low latency.

Ioan Eugen Stan

Long time free software/technology enthusiast, turned firefighter by twist of fate, turned software developer by own will, passionate about distributed computing, information retrieval and big data. Currently: Apache James committer, employed by 1and1 Internet Romania (division of United Internet ), working with Mahout, HBase and Lucene to provide search and clustering for email data.

Plans for James: working on making Apache James work in clustering mode to provide the best distributed email server. I will tell you how at the conference.

Jakob Homan

Senior software engineer at LinkedIn working on the Hadoop ecosystem. Apache Hadoop and Giraph committer and PMC member, as well as contributor to Apache Hive, Avro, Pig and Kafka.

Jodok Batlogg

Determined, straight, innovative and energetic Entrepreneur. Loves to move things, motivate people and work on hard problems. Agile Leader and pragmatic Web Enthusiast.

Jodok Batlogg is CEO at Lovely Systems and working with a highly motivated team in the Austrian alps. He was CTO at VZnet Netzwerke Ltd. and is a well-known speaker at various conferences (Nokia World, Plone/Zope conferences, and many more). He was member of the board of directors of the Zope and Plone Foundation. Jodok is experienced in agile processes and needs no bugtracking system :)

Joel Westberg

I'm a consultant in enterprise search at Findwise, a leading vendor-independent consultancy firm headquartered in Sweden, with offices in Sweden, Norway, Denmark, Poland and UAE. At Findwise, I’ve been part of search projects using Solr at half a dozen customers, both as a developer and as technical lead and architect.

Aside from these projects, I’m also the lead developer and architect of the Hydra document processing framework.

Kai Voigt

Kai is a training instructor at Cloudera. Prior to Cloudera, he spent 6 years as a principal instructor, teaching all MySQL courses delivered from Oracle, and did this globally. Kai currently resides in Kiel, Germany, and holds a Master's degree in computer science.

Karel Minarik

I am a web designer and developer living in Prague, Czech Republic. I work primarily with Ruby, Rails and NoSQL databases.

In the last couple of years, I've been focused on creative uses of Lucene-based search engines for querying, aggregating and data-mining applications.

Find out more about me in my online resume at https://www.linkedin.com/in/karelminarik.

Lars George

Lars is building scalable web applications since 2000, and got involved into Hadoop and HBase at the end of 2007. With his previous company WorldLingo, he had one of the first HBase clusters in production. He is now a Solutions Architect with Cloudera and helps customers come to terms with Big Data challenges. He is also the O'Reilly author of "HBase - The Definitive Guide", as well as regular speaker at Hadoop related events around the world. He is a committer for both Apache HBase and Apache Whirr.

Leslie Hawthorn

An internationally known community manager, speaker and author, Leslie Hawthorn has over 10 years experience in high tech project management, marketing and public relations. In March 2012 she joined Red Hat, Inc., where she is responsible for Community Action & Impact on the company's Open Source and Standards team. Prior to Red Hat, she served as Outreach Manager at Oregon State University’s Open Source Lab and as a Program Manager for Google’s Open Source Team, where she managed the Google Summer of Code Program, created the contest now known as Google Code In and launched the company’s Open Source Developer Blog.

Lukas Kahwe Smith

Lukas has been working on web applications since 1999 mostly leveraging PHP authoring various popular PHP components and being co-release manager on PHP 5.3. Though he was always passionate about database technologies. Initially mostly about RDBMS, then increasingly Lucene based full text search solutions and these days PHPCR, a port of the Java Content Repository specification to PHP. He currently works in Switzerland for Liip AG. When not coding he is busy beating up his body playing ultimate frisbee.

Markus Weimer

Dr. Markus Weimer is a Principal Scientist working in the Cloud Information Services Laboratory (CISL), Server and Tools Business (STB) at Microsoft.

His research area is Machine Learning with an emphasis on large scale algorithms, applications and systems. In the past, Markus has worked on collaborative filtering and ranking, abuse prevention and detection models at Yahoo! Labs and applications of Machine Learning to the educational domain. Markus received his PhD (suma cum laude) from the Technische Universität Darmstadt, Germany in 2009 under the joint supervision of Alex Smola (Australian National University) and Max Mühlhäuser and joined Yahoo! Labs in the same year.

Martijn van Groningen

Martijn van Groningen, Programmer & Search consultant for SearchWorkings. Martijn is also Lucene committer and a pmc member. He has made a significant contribution to the community with his involvement with his work on result grouping and joining in Lucene and Solr. Working extensively with the technologies for the past 4 years, he has helped many companies resolve their Solr & Lucene challenges.

Matt Patterson

I've been building for the web for over 10 years, doing everything from web design and front-end development all the way through to back-end development.

I've worked with Javascript, Perl, PHP, Python, Ruby, various XML technologies. In my day-to-day work I mainly work with Ruby, Rails, HTML/CSS, and Javascript. I've been exploring various kinds of data storage techniques for years, trying everything from RDF stores, through conventional SQL RDBMS, KV stores and Document databases, and I was one of the organisers of the 2010 NoSQL Europe conference. Last year I organised the first History Hack Day.

Along the way I've co-written a book on CSS (Friends of Ed's Cascading Style Sheets: Separating Content from Presentation), which is currently in its second edition and has been translated into Italian and Spanish, led a software team at the BBC, and spent five of the last ten years running design- and development-focussed consultancies. Recently, I was part of the small development team which built the Alpha prototype of the UK Government's landmark https://www.gov.uk/ project.

Michael Brückner

Between 2005 and 2011, Michael Brückner co-developed the content-based email spam filter of STRATO AG which processes several hundreds of million emails daily. In December 2011, he joined SoundCloud as Anti-Spam Engineer.

Michael is currently completing his PhD in machine learning at the University of Potsdam. His research focuses on supervised learning in adversarial environments where the construction of a prediction model establishes a game between a model builder and an adversarial data generator. He published and presented his findings at all major conferences on machine learning including the International Conference on Machine Learning (ICML), the international conference on Knowledge Discovery and Data Mining (KDD), and the international conference on Neural Information Processing Systems (NIPS). He is reviewer of several conferences and journals on machine learning and computer security, such as ICML, KDD, NIPS, the Data Mining and Knowledge Discovery Journal, the Transactions on Data Privacy Journal, the Conference on Collaboration, Electronic Messaging, Anti-Abuse and Spam.

Michael Busch

Michael Busch is the technical lead of Twitter's search infrastructure team. He designed and implemented Twitter's current real-time search index, which is based on Apache Lucene and serves more than 2 billion queries per day, while indexing more than 300 million documents per day.

Prior to Twitter Michael worked at IBM's Silicon Valley Lab on Enterprise Search and eDiscovery software. There he worked closely with IBMs research group on novel indexing technologies.

Michael contributed to Apache Lucene since 2005 and is committer on this project since 2006. During those years he added several features like posting payloads, multi-level posting list skipping and various indexing and search performance improvements. Most recently he helped implementing a new indexer threading model in preparation for Twitter's real-time indexing features.

Michael studied Computer Science at the University of Paderborn in Germany.

Michael Hunger

Michael Hunger has been passionate about soſtware development for a long time. He is particularly interested in the people who develop soſtware, soſtware craſtsmanship, programming languages, and improving code. For the last two years he has been working with Neo Technology on the Neo4j graph database. As the project lead of Spring Data Neo4j he helped developing the idea to become a convenient and complete solution for object graph mapping. He is also taking care of Neo4j cloud hosting efforts. Good relationships are everywhere in Michael’s life. His “real” life concerns his family and children, running his coffee shop and co-working-space “die-buchbar“, living in the depths of a text-based multi-user dungeon, tinkering with and without Lego® and much more. As a developer he loves to work with many aspects of programming languages, learning new things every day, participating in exciting and ambitious open source projects and contributing to different programming related books. Michael is also an active editor and interviewer at InfoQ.

Mikio Braun

Machine learner and data scientist, Ph.D. from the University of Bonn in 2005, now working as a PostDoc at TU Berlin and co-founder of TWIMPACT, a startup focussing on real-time social media analysis.

Mircea Markus

Mircea Markus has joined JBoss's clustering team in 2007 as a core engineer. He has been working on various clustering components, such as JBossCache, PojoCache or JGoups. He is one of the founders of Infinispan project on on which he has been concentrating his efforts for the last three years. He has also the founder of project Radargun - a benchmarking framework data grids.

Nico Kruber

[2009-now] Research assistant at ZIB

Nicolas Spiegelberg

Nicolas Spiegelberg is a storage engineer in the Facebook HBase Engineering team. For over two years, he has helped take HBase from concept to production on multiple critical & large-scale systems within Facebook. Additionally, Nicolas is an HBase committer and PMC, who has contributed many critical features such as HDFS data reliability, Bloom Filters, and compaction optimizations.

Otis Gospodnetić

Otis Gospodnetić is a coauthor of Lucene in Action (1st and 2nd edition). He has been involved with Lucene since 2000 and Solr since 2006. He is also a member of Nutch, and Mahout development teams, as well as Lucene Project Management Committee. Otis is an Apache Software Foundation member and the founder of Sematext, a products and services company focused on Search & Analytics using Solr, ElasticSearch, Lucene, Hadoop, HBase, Flume, Mahout, and other open-source technologies to serve customers world-wide.

Owen O'Malley

Owen contributed patches to Hadoop before it became an independent Apache project. He was the first committer added and still remains one of the most active contributors to Apache Hadoop. He was also the founding chair of the Apache Hadoop Project Management Committee. Prior to co-founding Hortonworks, Owen worked on Yahoo! Search’s WebMap project, which built and performed heuristic analyses over a graph of the known web. Once ported to Apache Hadoop, it became the single largest known Hadoop application. He has a PhD in Software Engineering from the University of California, Irvine. Owen may be followed on Twitter: @owen_omalley.

Paolo Fragomeni

Co-Founder and CTO of Nodejitsu.

Pavlo Baron

Pavlo Baron is lead architect with codecentric AG. His passion are distributed systems and large data sets – the infrastructure behind what they call Big Data. Pavlo is frequent conference speaker and has written three German books: “Erlang/OTP”, “Pragmatic IT Architecture” and “Fragile Agile”.

Peter Voss

 
Peter Voss is CTO at Datameer with extensive experience in software engineering and architecture of large-scale data processing. His focus has been largely on UNIX based enterprise systems with extensive background in Java, Spring, Hadoop, Lucene and Eclipse plug-in development.

Prior to Datameer, Peter consulted on a number of big data business intelligence projects with companies such as EMI Music and Krugle. Earlier, he was architect and developer for Deutsche Post and their ePost project, a distributed production system that processed more than 1 billion letters per year. Peter studied biology and has a Diploma (i.e., a Masters) in biochemistry and bioinformatics from the University of Köln.

Rafał Kuć

Author of Solr 3.1 Cookbook and Co-founder of https://solr.pl blog where he tries to share his knowledge. Currently working as search consultant and software engineer at Sematext. Mainly focused on Lucene, Solr, Elasticsearch, Hadoop and Mahout.

Ralf Neeb

CTO at GameDuell, biography will be added later

Robert Muir

Robert Muir is a Lucene/Solr PMC Member and Committer. He earned his BS in Computer Science from Radford University and a MS in Computer Science from Johns Hopkins University. Prior to Lucid Imagination, Robert worked for Ntrepid Corporation.

Sean Cribbs

Sean joined Basho Technologies in March 2010 after three years as a freelance web developer, in the course of which he worked on numerous startups and open-source projects, including the popular Radiant content management system which is built with Ruby on Rails. Prior to that, he studied Music Theory and Composition at the graduate level and directed a chorus after receiving bachelors degrees in Computer Science and Music from the University of Tulsa.

Sean brings his practical experience in building scalable web applications, passion for educating others, and broad theoretical knowledge of technical topics to the role of Software Engineer. He can often be found speaking at conferences and other events on Riak, and enjoys playing the piano in his free time.

Shay Banon

Founder, ElasticSearch.

Stefan Pohl

Stefan Pohl is a senior research engineer, currently working on Local Search and Discovery as part of the Nokia Maps offer. In the past, he has been working on several search-related projects in industry and academia, and his interests include analytics & data mining on large data, machine learning and information retrieval. He authored several publications and completed a dissertation on biomedical text search at the University of Melbourne, Australia.

Steffen Bickel

Steffen Bickel is principal engineer for local search and discovery at Nokia Berlin. His work centers around search relevance optimization, recommender systems, machine learning, and data analytics on large scale data. In a previous life, Steffen finished a PhD in machine learning and statistical prediction models.

Steve Loughran

Steve Loughran is a committer on Apache Hadoop

@steveloughran

(a more detailed Bio will be provided in April, once I have changed employers)

Ted Dunning

Ted Dunning has been involved with a number of startups with the latest being MapR Technologies where he is Chief Application Architect working on advanced Hadoop-related technologies. He is also a PMC member for the Apache Zookeeper and Mahout projects. Opinionated about software and data-mining and passionate about open source, he is an active participant of Hadoop and related communities and loves helping projects get going with new technologies.

You can find out more about his recent work at https://mapr.com

Torsten Curdt

Torsten Curdt is a senior software developer. Just out of university he successfully co-founded a company which let him to contribute to open source. He was one of the core contributors of Apache Cocoon and also works on projects like Apache Commons. Today he is an elected member of the Apache Software Foundation, part of several PMCs and active citizen of the general open source world. His professional development let him to work with large systems and big data as well as projects in the mobile space. Torsten is a published author and speaker at conferences.

Uwe Schindler

Uwe is committer and PMC member of Apache Lucene and Solr. His main focus is on development of Lucene Java. He implemented fast numerical search and is maintaining the new attribute-based text analysis API. He studied Physics at the University of Erlangen-Nuremberg and works as managing director for SD DataSolutions GmbH in Bremen, Germany, a company that provides consulting and support for Apache Lucene and Solr. A primary customer of his company is “PANGAEA – Publishing Network for Geoscientific & Environmental Data” where he implemented the portal's geo-spatial retrieval functions with Lucene Java. Uwe had talks about Lucene at various international conferences like the previous Berlin Buzzwords, Lucene Revolution, ApacheCon EU/US, Lucene Eurocon, and various local meetups.