Skillman, NJ
609
CON_HIRE_IND CON_HIRE_W2
Open
Contract to Hire
11694
aegisoft
none
no
Web Crawler C#/C++ (11694)
C#, XML, HTML, C++
11-25-2009
Web Crawler C#/C++ -
Skillman, NJ

Our client is the leading global provider of data, news and analytics. Their products and services provide real-time and archived financial and market data, pricing, trading, news and communications tools in a single, integrated package to corporations, news organizations, financial and legal professionals and individuals around the world.

The project is to build a .NET application which will build the trainer component of a web crawler. This application will be used to identify data fields of interest on a web page, then to build extraction rules using XSLTs which can be executed by our back-end crawler.

Required Skills:
The trainer uses .NET 2008, Infragistics, C#, the WebBrowser control, XML/XSLT libraries and the Tidy "C" library. Ideally, the candidate will have worked on such a project in the past. At the very least, the person will need to be very familiar with web technologies, processes, and features, such as:
C#
C/C++ a plus
HTML and the parsing of HTML (such as is done by browsers)
XML/XSD (validation)
XSLT (should know how to use variables, loops, multiple templates)
Javascript
Extensive knowledge of design patters, especially the ones related to GUIs
Must be able to write and maintain a test suite using .NET2008 or Nunit
Experience with IE (com) or Firefox (xpcom/xulrunner) or Webkit is a plus
Knowledge of C# managed/unmanaged memory is a plus
Windows networking skills especially related to building/maintaining a proxy is a plus
Knowledge of Browser/Web Server interaction is a plus (cookies, post requests, forms)
Knowledge of Infragistics forms is a plus
Aegistech, LLC.
14 Penn Plaza
Suite 806
New York, NY 10122