Tag Archives: ETL

Linked Data Manufacturing Tools


Grafter

"For the hard graft of linked data processing."

Grafter is a library, DSL and suite of tools for flexible, efficient, ETL, data transformation and processing. Its primary use is for handling Linked Data conversions from tabular data formats into RDF linked data format, but it is equally adept at handling tabular data conversions.

See the official grafter website at grafter.org for more details.

For the Grafter rationale see our blog post: The hard graft of Linked Data ETL.

What plans are there for Grafter?

Grafter is currently in the early stages of development, however we Swirrl have been using it to transform significant amounts of data for our clients within the government.

Grafter is currently an API and a small DSL for converting tabular data into Linked Data. However we have ambitious plans to develop a suite of tools on top of it. These tools are planned to include:

  1. Command line tools for data processing.
  2. Import services to load pipelines and execute predefined data transformations.
  3. A Graphical ETL Tool to assist non-programmers in creating data transformation pipelines.

Development

Grafter is deployed on the standard Clojure build repository Clojars.

To use the Grafter API please add the following to your Clojure projects project.clj file. For more details on how to do this see the leiningen build tool:

 [grafter/grafter "0.2-SNAPSHOT"]

Release candidates are released as SNAPSHOT builds and our first official release will be 0.2, this will hopefully be released in the coming weeks.

NOTE: We are currently following a MAJOR.MINOR.PATCH versioning scheme, but are anticipating significant breaking API changes between minor versions at least until we reach 1.0.0.

PATCH versions should be close to being backwardly compatible with previous MINOR versions.

Releases will be tagged with an appropriate tag indicating their MAJOR.MINOR.PATCH version.

License

Copyright © 2014 Swirrl IT Ltd.

Distributed under the Eclipse Public License version 1.0, the same as Clojure.

Informatica Developer at Pleasanton, CA – 1+ year Contract


Role : Informatica Developer
Location: Pleasanton, CA
Duration: 1+ year Contract

Note: Send your resume at kdinesh@prokarma.com or you can also reach me at (402) 905 9212. Please share or like this post.

• Participate in managing code and configurations for multiple environments, release management process, creating and maintaining environment configurations and controls, code integrity and work closely with platform team
• Ability to coordinate across teams, working closely with peers to ensure the appropriate focus and sense of urgency is applied to all production issues
• Work with third party suppliers and vendors for support, upgrades and implementations
• Serve as a mentor to junior level associates who will provide backup in your absence
• Provide assistance in root cause analysis for service interruptions
• Interface with and provide technical leadership to others in the IT division and business to address ongoing business needs
• Document the application infrastructure and teach/share with others as necessary
• Participation in projects; providing work estimates, partnering with others to design and execute on projects that enhance system availability or application capabilities

Informatica Architect at Pleasanton, CA – 1+ year Contract


Role: Informatica Architect
Location: Pleasanton, CA
Duration: 1+ year Contract

Note: Send me your resume at kdinesh@prokarma.com or you can also reach me at (402) 905 9212. Please share or like this post.

  • Architecture and Technology is Core Competency
  • Experience in Architecture Assessment, Capacity Planning, System Architecture Design for High Availability and Load Balancing/Grid Configuration of Informatica
  • Experience in implementing Large Informatica solutions
  • Part of Projects which have created Enterprise Data Integration Hubs using Informatica (across departments, and subject areas)
  • Experience in architecting and implementing real time data integration solutions
  • Experience in parsing and integrating semi-structured/unstructured data sources
  • Excellent Documentation skills
  • Excellent Presentation skills
  • Excellent Client Facing skills

SQL Server DBA


Role : SQL Server DBA
Duration: 2 month contract with possible extension
Location: Houston, TX

Note: Interested candidates can send me your updated resume at kdinesh@prokarma.com or you can also reach me at (402) 905 9212. Please share or like this post.

Must Have

  • SQL Server 2005/2008/2008R2
  • T-SQL, stored procedures
  • Query Performance Optimization
  • DBA
  • Database backup/recovery
  • SSRS administration
  • SSIS administration
  • Clustering, SAN, Replication
  • SSIS ETL development
  • SSRS report development
  • Communication Skills

Big Data Integration for Amazon Web Services: Syncsort Takes Aim at Legacy ETL Market with Low Cost ETL- Engine Running on Amazon EC2 Available on AWS Marketplace


Big Data Integration for Amazon Web Services: Syncsort Takes Aim at Legacy ETL Market with Low Cost ETL- Engine Running on Amazon EC2 Available on AWS Marketplace

Big Data Integration for Amazon Web Services: Syncsort Takes Aim at Legacy ETL Market with Low Cost ETL- Engine Running on Amazon EC2 Available on AWS Marketplace