
Closed
Posted
Paid on delivery
I’m streamlining a recurring data-driven workflow that lives entirely on a Unix box. The goal is to replace a patchwork of manual steps with a single Python-based utility that can: • Pick up incoming files from predefined directories, validate them, and move them through an archive/processing path. • Trigger the correct Hive queries to load, transform, or refresh tables, then write completion status back to a control table. • Interact with other database systems when needed for look-ups or logging. • Produce clear, rotating log files and raise Unix-level exit codes so the whole thing can be scheduled in cron or any enterprise scheduler. Core requirements – Written in Python 3, callable from the Unix shell. – Uses native Hive connections (beeline/pyhive or similar) for all Hive operations. – All paths, connection strings, and query text must be externalised in a simple config file so I can retarget environments without code edits. – Idempotent: if the same run fires twice, it should recognise completed steps and skip or safely overwrite. – Detailed inline comments plus a README that covers setup, dependency installation, and a sample cron entry. Deliverables 1. Fully-tested source code and shell wrapper(s). 2. Config / template files and a quickstart README. 3. One brief hand-off call or document that walks through deploying it on my server. I have direct access to the Unix environment and can supply sample files, current Hive DDL, and test tables as soon as we start. Let me know any libraries you expect to add so I can confirm they’re allowed on the host. Looking forward to seeing how you’d approach this.
Project ID: 40191586
10 proposals
Remote project
Active 2 mos ago
Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
10 freelancers are bidding on average ₹12,450 INR for this job

Hi there, I’ve carefully reviewed your Unix Python Hive workflow automation project and am confident in delivering a seamless, efficient solution. - Develop a robust Python 3 utility with shell-callable scripts for Unix - Implement native Hive connections using pyhive or beeline ensuring reliable query executions - Externalize all configurations for paths, connections, and queries for easy environment targeting - Build idempotent processing to handle duplicate runs gracefully - Provide thorough inline comments, a detailed README, and rotation-capable logging with exit codes for cron integration **Skills:** ✅ Python 3 development for Unix shell utilities ✅ Hive query execution with pyhive/beeline ✅ Linux environment automation and cron scheduling ✅ Config-driven architecture enabling flexible deployment ✅ Clear documentation and logging best practices **Certificates:** ✅ Microsoft® Certified: MCSA | MCSE | MCT ✅ cPanel® & WHM Certified CWSA-2 I’m ready to start immediately and can ensure quick delivery of a fully-tested, well-documented utility plus a smooth hand-off session. Which libraries are you currently allowed to run on your Unix host, and do you have any preferred logging frameworks? Best regards,
₹12,500 INR in 7 days
6.7
6.7

This is a great fit for my experience building automation utilities on Unix with Python and data platforms like Hive. I’ll develop a Python 3 command-line tool that handles file intake, validation, archiving, and processing flow, then securely executes Hive queries using Beeline/PyHive. The script will also support lookups or logging to other databases as required, with all activity written to structured, rotating log files and proper Unix exit codes for scheduler integration. All paths, queries, and connection details will live in an external config file so you can switch environments without touching the code. The workflow will be idempotent, meaning repeat runs won’t duplicate work and will safely detect completed steps. You’ll receive clean, well-commented source code, shell wrappers, config templates, a clear README with setup steps and a sample cron entry, plus a short handoff walkthrough. I’m ready to start as soon as you share sample files and Hive details. Best regards,
₹18,000 INR in 2 days
5.7
5.7

Hello pavank152, I checked your project, and it looks interesting. This is something we already work on, so the requirements are clear from the start. We mainly work on Python, Linux, Data Processing, Software Architecture, Hive, Data Integration, Automation We focus on making things simple, reliable, and actually useful in real life not overcomplicated stuff. Let’s connect in chat and see if we’re a good fit for this. Best Regards, Ali nawaz
₹25,000 INR in 8 days
3.9
3.9

This is really about turning a fragile, manual Unix + Hive workflow into a single repeatable, scheduler-safe pipeline with full traceability. I’d deliver a Python CLI that manages file intake, validation, archiving, and idempotent processing. Hive loads and transforms would run via native connections with completion tracked in a control table. Config-driven environments, rotating logs, strict exit codes, and edge-case safety are built in from day one. I can begin by locking down the execution states and failure boundaries so work starts cleanly.
₹9,000 INR in 4 days
2.8
2.8

I can build this as a clean, production-ready Python 3 utility designed specifically for Unix scheduling and repeatable data workflows. I’ll consolidate the full flow file pickup, validation, archiving, Hive execution, status tracking, and cross-DB lookups—into a single, idempotent command-line tool that behaves safely even if triggered twice. All environment-specific details (paths, Hive connections, queries, control tables) will be externalised in a simple config file so you can retarget environments without touching code. I’ll use native Hive access (beeline / PyHive as approved), implement structured, rotating logs, and return clear Unix exit codes for cron or enterprise schedulers. The delivery will include fully tested source code, shell wrappers, config templates, and a concise README covering setup, dependencies, and a sample cron entry. I’m comfortable working directly on Unix systems, coordinating quickly, and can walk you through deployment in a short hand-off so you’re confident running and maintaining it long-term.
₹14,000 INR in 7 days
0.0
0.0

Hi, This is a workflow I’m very comfortable with. I’ve built Unix-native, cron-driven Python utilities that orchestrate file intake, validation, Hive loads, control tables, and logging exactly in the way you’re describing. How I’d approach this: A Python 3 CLI utility callable directly from the Unix shell (argparse-based). File pickup → validation → processing → archive flow with clear state transitions. Hive operations executed via PyHive or Beeline (depending on what’s allowed on your host). A control table in Hive to track execution status, enabling idempotent re-runs (safe to execute twice). Externalized configuration (paths, queries, connections) via a simple config file — no code edits needed between environments. Structured, rotating logs + meaningful Unix exit codes so it integrates cleanly with cron or enterprise schedulers. Optional integration with other databases for lookups or logging if required. Deliverables: Fully tested Python source + optional shell wrapper. Config templates and a clear Quickstart README (dependencies, setup, cron example). Short handoff document or call to walk through deployment on your server. I’m happy to review your existing Hive DDL and sample files upfront and will confirm all required libraries before starting. Let me know if you’d like me to propose the initial config structure or execution flow diagram. Best regards, Felipe
₹10,000 INR in 7 days
0.0
0.0

i do this work fast and profession .work will do under time limit .And very clearly j j j j
₹7,000 INR in 7 days
0.0
0.0

As a passionate and experienced Data Scientist, I bring a diverse skill set that is well-suited to your Unix Python Hive Workflow Automation project. I have honed my proficiency in Python, SQL, and Unix, which perfectly aligns with your project's core requirements. My familiarity with scripting in Unix shell ensures the seamless integration of the Python utility into your existing workflow. Furthermore, my past experiences have involved writing highly-efficient, idempotent codes that optimize operations and enhance overall productivity. My expertise extends beyond just proficiency in programming languages. I have expertise in working with databases such as PostgreSQL and Oracle, which translates well to the nature of your project that involves interaction with other databases like Hive.
₹7,000 INR in 7 days
0.0
0.0

Mumbai, India
Member since Jan 30, 2026
£20-250 GBP
$750-1500 USD
₹1500-12500 INR
₹400-750 INR / hour
min €100000 EUR
$30-250 USD
$250-750 CAD
₹600-1500 INR
£250-750 GBP
$30-250 USD
$250-750 USD
$30-250 USD
$30 USD
$30-250 USD
₹100-400 INR / hour
$10-50 USD
$10000-20000 USD
$25-50 AUD / hour
₹1250-2500 INR / hour
₹1500-12500 INR