Welcome to Rilhia.com!

Welcome to Rilhia.com! This website focuses on Data Integration and Application Integration using Talend technologies. Living in a new "age of context" it is imperative that all businesses make the most out of all of their data. Big businesses have been making data work for them for some time using expensive tools and consultants to maximise profits, save time, reveal new revenue streams and free up man hours for creativity. Now with the emergence of Open Source tools in this domain like Talend's applications, it is the time for small businesses and individuals to make their data work for them.

The purpose of Rilhia.com is as a resource for Talend solutions/tutorials/discussions that are not your usual, often somewhat dry, business oriented examples. I believe that the best way to learn a new technology is to find a personal use for it. ETL and ESB don't immediately appear to be areas in which there are many personal uses. However, in our data, Internet Of Things (IOT), social media driven world, it is clear to me that ETL and ESB can potentially be as useful at home as they are at work. Forget your "Hello World" examples/tutorials and think more along the lines of linking Twitter to your Sonos speakers. Learn by creating something useful at home and take the lessons you have learnt back to the office.

If you have any questions, any recommendations, corrections to content, would like your site linked to from here, or any other reason to get in touch, please use the contact form.

 

Google Home + API.AI + Talend = Awesome!!

I had a bit of free time over the Christmas holidays and realised that I have left this site a little idle for a while. I have been very fortunate in that I have had a lot of Talend work over the last year, but with that comes less time to update this site. However I decided that I would use Christmas to build a new tutorial based on something new, interesting and fun.....but what? Then my sister bought me a Google Home device for Christmas!!

Load Data into a Dynamic Number of Files

A question that I have seen multiple times on forums and have been asked several times while I have been on site somewhere, is "How can I load data to a dynamic number of files where I don't know the total number?". When I first heard this I have to admit I thought that the person was just trying to pick holes in what Talend can do. Why would you want to load to files that you have no idea about? But then it occurred to me, with Big Data we are seeing a resurgence of flat file data storage/usage. Maybe there is a requirement to split files into smaller chunks by a key within those files?

Load XML files in batches of records

There are many things that Talend is great at doing with XML....there are also many things that it is not so great at. One such example of something that Talend is not so great at is loading multiple XML files with batches of data. Say for example you have 100 records and you want to load multiple records into single XML files, but have a limit on the number of records per file? How would you do that with the tXMLMap component.

Quick Tip: Row Multiplication

Sometimes we need to multiply rows based on a column in that row, multiple columns in that row, or by some other factor dependent on the row. In Talend there is a really nice mechanism that allows this to be done quite simply....but it is not very well known about. This QuickTip shows how to use this functionality with a very basic example.

Lets say we have some data which has a numeric column indicating how many times that row should be multiplied.  For example....

Using Neural Networks with Talend DI and ESB

Many times during Data Integration projects we have situations where we have to analyse the data in order to come up with acceptance criteria for it. In a lot of cases, this is pretty straight forward and can be easily written into simple rule based logic. But in some situations, it is not so cut and dry. In these situations a lot of people will generate rule of thumb logic which will isolate certain rows to be double checked by a human. This works. It is time consuming and requires human intervention, but it works.

Using a third party Java library to scrape the content of a table on a web page

Recently I was contacted by a visitor to this site who asked me to put together a tutorial on using Talend for web data crawling purposes. This interested me as I have myself come across situations where I have used other software to scrape websites for data (links, pictures, emails addresses). While it is not difficult to find software to do this, it usually comes with a cost or is very limited in what you can do. After a few minutes of Googling, I came across several Java libraries which offered this functionality.

Dynamic column order

This tutorial was inspired by questions I get asked a lot when out on sites, I have also seen it asked on forums. The question is "Is there a way to deal with files that have the same columns, but in different orders?" or "Can I idnetify the column order from the header row?". There are several ways in which this can be achieved and this is just one of them. Some ways may required much more complicated logic and maybe a bit of Java. This way makes use of the tMap component and the ordered processing of variables in that component.

Using UPnP enabled devices with Talend - Control Sonos Speakers

In the last tutorial published on here, we discussed using Talend with a UPnP device by Belkin. The tutorial looked at how to discover UPnP devices and how to use the device's UPnP description xml to work out how to use the actions available. The tutorial can be found here and will be useful to keep in mind before looking at this approach.

Using UPnP enabled devices with Talend - A Belkin WEMO Switch

Universal plug and play (UPnP) devices are ubiquitous these days. More and more homes are filling up with devices that make use of UPnP functionality and this opens lots of doors for Talend users to derive more functionality from connecting these devices. In my home I have the following devices which make use of UPnP protocols....

Samsung Smart TV
Belkin WEMO Switch
Belkin WEMO Sensors
Philips HUE lighting
SONOS Speakers
BT Home Hub Router