
Đã hoàn thành
Đã đăng vào
Thanh toán khi bàn giao
Project Overview: I am looking for an experienced data scraper/engineer to extract 10 years of DAILY historical power grid data for South Africa (1 January 2014 - 31 December 2024). Specifically, I need the Unplanned Capability Loss Factor (UCLF) and Planned Capability Loss Factor (PCLF) data from Eskom (the national utility). CRITICAL NOTE: Do not use the EskomSePush (ESP) API. ESP only provides consumer-facing loadshedding schedules. I need the actual macro-generation MW (Megawatt) breakdown of power plant failures. Data Sources to use: The official Eskom Data Portal ([login to view URL] - under "Outages" and "Supply Side"). Alternative public archives: Because the official portal sometimes restricts downloads to a 5-year rolling window, you may need to pull from open-source community SQLite databases (e.g., [login to view URL] or GitHub repositories tracking Eskom data) or CSIR energy publications to get the full 2014-2024 timeline. Specific Data Points Required (Hourly extraction, aggregated to Daily): Unplanned Outages (MW) Planned Outages (MW) Total Installed Capacity (MW) / RSA Contracted Demand Calculations & Output Formatting: The raw data is usually reported in hourly Megawatts (MW). I need you to calculate the percentages and provide a clean, daily CSV file with the following columns: Date (Format: YYYY-MM-DD) Daily_Avg_UCLF_Percentage: (Average Hourly Unplanned MW / Total Installed MW) * 100 Daily_Max_UCLF_Percentage: The highest UCLF percentage recorded that day. UCLF_at_1700_SAST: The specific UCLF percentage at exactly 17:00 South African Standard Time (Market Close). Daily_Avg_PCLF_Percentage: (Average Hourly Planned MW / Total Installed MW) * 100 Deliverables: A single, clean CSV or Excel file containing the 2014-2024 DAILY data. A brief text file or README explaining exactly where the data was sourced from and how missing values (if any) were handled. The Python/scraper script used to generate the data (for my own reproducibility records)
Mã dự án: 40301054
27 đề xuất
Dự án từ xa
Hoạt động 1 tháng trước
Thiết lập ngân sách và thời gian
Nhận thanh toán cho công việc
Phác thảo đề xuất của bạn
Miễn phí đăng ký và cháo giá cho công việc

With my comprehensive skills in data analysis, extraction, and mining, particularly in Python, I am confident that I can deliver precisely what you're looking for. My ability to masterfully scrape data from multiple sources, including deep web digging, will ensure that we have a complete dataset spanning the 10 year timeframe that you need. Additionally, my thorough understanding of Eskom's official data portal and alternative archives like [login to view URL] and CSIR energy publication guarantees that the extraction process will be as efficient as possible. I'll provide a single clean CSV or Excel file with all the specified data points in the given format along with a README document explaining how I handled any missing values. As someone who believes in reproducibility and transparency, I will also furnish you with the Python/scraper script used to generate the data. More than just delivering on the task at hand, I am dedicated to ensuring that my clients have all the necessary tools for future use. My overall objective is to provide you with a premium client experience and accurate data that meets your needs. Let me handle this project for you and prove why I am ranked among the top 1% of freelancers.
$140 USD trong 7 ngày
6,4
6,4
27 freelancer chào giá trung bình $128 USD cho công việc này

⭐⭐⭐⭐⭐ Extract Historical Power Grid Data for South Africa Efficiently ❇️ Hi My Friend, I hope you are doing well. I've reviewed your project requirements and see you are looking for a data scraper to extract 10 years of historical power grid data for South Africa. Look no further; Zohaib is here to help you! My team has successfully completed over 50 similar projects in data scraping. I will use the official Eskom Data Portal and alternative sources to gather the required data efficiently. ➡️ Why Me? I can easily extract the UCLF and PCLF data as I have 5 years of experience in data scraping, data analysis, and Python programming. My expertise includes web scraping, data manipulation, and creating clean datasets. Additionally, I have a strong grip on handling different data sources and ensuring accuracy in my outputs. ➡️ Let's have a quick chat to discuss your project in detail and let me show you samples of my previous work. I look forward to discussing this with you in our chat. ➡️ Skills & Experience: ✅ Data Scraping ✅ Python Programming ✅ Data Analysis ✅ CSV File Creation ✅ Web Data Extraction ✅ Data Cleaning ✅ API Integration ✅ SQL Databases ✅ Data Visualization ✅ Data Processing ✅ Error Handling ✅ Documentation Writing Waiting for your response! Best Regards, Zohaib
$150 USD trong 2 ngày
8,0
8,0

Hello Sir, How would you like to gain access to 10 years of critical historical power grid data without any upfront commitment? I specialize in extracting and processing large datasets efficiently, ensuring you receive accurate and well-structured information tailored to your needs. Let's connect to discuss how I can help you retrieve and format this essential data seamlessly. Best, Smith
$140 USD trong 7 ngày
7,1
7,1

Hi There, Ready right now I'm ready to Historical Power Grid Data Extraction. I will show you sample for your satisfaction and project accuracy then we will go to start, so please contact me and share more details thanks. Check My Profile: https://www.freelancer.pk/u/WelcomeClient I would like to work on this project and can complete with 100% accuracy within the time frame. https://www.freelancer.pk/projects/excel/business-profit-loss-reporting-excel/reviews https://www.freelancer.pk/projects/data-entry/copy-listings-from-website-another/reviews Thanks, Umer
$40 USD trong 1 ngày
6,8
6,8

Hi! I specialize in data scraping and energy dataset engineering with 9+ years of experience extracting and transforming large historical datasets. I can collect Eskom power grid outage data and produce a clean daily dataset for 2014–2024 with the exact UCLF/PCLF calculations you need. Here's how I can help: * Extract hourly outage and capacity data from the Eskom Data Portal and archival sources * Aggregate hourly MW data into daily metrics including Avg/Max UCLF and 17:00 SAST values * Clean, validate, and structure the dataset into a well-formatted CSV/Excel file * Provide the Python scraping/processing script plus a README documenting sources and handling of missing data Do you prefer the final dataset strictly in CSV, or both CSV and Excel for easier analysis?
$200 USD trong 7 ngày
6,6
6,6

As an experienced full-stack developer with a strong background in database technologies, I believe I am the right fit for your South African power grid data extraction project. My expertise in Python, SQL, and statistical techniques will enable me to effectively extract and manipulate the data points you require. In fact, I have previously undertaken similar projects involving large dataset parsing from diverse sources. Additionally, my proficiency in data visualization using tools like SPSS, Tableau, and Excel will be an invaluable asset to present the extracted data in a clear and understandable manner. This is crucial given the particular formatting you require, especially calculating percentages and aggregating hourly data into daily records.
$140 USD trong 7 ngày
5,6
5,6

Greetings, I see that you’re looking for someone to extract 10 years of daily historical power grid data from Eskom, specifically the Unplanned and Planned Capability Loss Factor. This involves not just scraping the data but also calculating the percentages and formatting it into a clean CSV file. My approach would be to utilize the official Eskom Data Portal as the primary source while ensuring I gather the complete data range using alternative public archives when necessary. With my experience in Python and web scraping, I can efficiently handle the data extraction and processing to meet your specific needs. I’ll also provide a detailed README to clarify the data sources and any missing values. Looking forward to helping you with this project. Best regards, Saba Ehsan
$80 USD trong 4 ngày
5,4
5,4

Hi, Lets get connect over a chat. I have more than 9 years of experience in building custom platforms in python. I will walk through to my work samples as well. I am online right now. Thanks Ali
$140 USD trong 2 ngày
5,4
5,4

Hello, I can deliver what you need. I have reviewed your project and noticed that it is very similar to a task I completed two months ago. I am an experienced and specialized freelancer with 6+ years of practical experience in Python, Web Scraping and I’m able to complete and deliver this project promptly. You can visit my profile to check my latest work and recent reviews. Connect in chat to discuss details and next steps. Regards.
$250 USD trong 7 ngày
5,1
5,1

Dedicated Freelancer Ready to Elevate Your Project for Historical Power Grid Data Extraction: South Africa. I have a solid background in Data Visualization, Data Processing, Data Management, Data Scraping, Data Analysis, Software Architecture, Python, Web Scraping, Data Extraction and Data Mining, I bring valuable expertise to your project. I have successfully completed many projects with 100% client satisfaction. Clear and timely communication is my priority. I believe in keeping you informed throughout the project lifecycle. I am available for a discussion at your earliest convenience. Please feel free to contact me to further discuss your project details. Thank you for considering my bid. I am excited about the opportunity to contribute to the success of your project. Please visit my portfolio to check my previous work samples, here - https://www.freelancer.com/u/GraphicsHub2k24?page=portfolio&w=f&ngsw-bypass= Best regards, Muhammad Asim Khan
$30 USD trong 1 ngày
4,4
4,4

********** For sample of the output CSV inbox me ************** I can deliver the full 2014-2024 South Africa grid history in a reproducible way, with both the cleaned daily dataset and the extraction script. I would build a Python pipeline that first pulls from Eskom’s official Data Portal endpoints/pages for Outages and Supply Side, then backfills older gaps from archived public datasets/community mirrors where the portal’s rolling window blocks access. After collection, I will normalize timestamps to SAST, align hourly Unplanned MW, Planned MW, and Installed Capacity, then compute Daily_Avg_UCLF %, Daily_Max_UCLF %, exact UCLF at 17:00 SAST, and Daily_Avg_PCLF %. My approach is not just scraping tables - I will add schema validation, duplicate-hour checks, source-priority rules, and gap detection so the final CSV is audit-ready. I will also provide a README documenting every source used, assumptions, backfill logic, and how missing hours/days were treated. The script will be clean Python with pandas plus archival/source adapters so you can rerun it later without manual work. Deliverables: • Daily CSV/XLSX for 2014-01-01 to 2024-12-31 • Python scraper/processing script • README with methodology and data lineage Timeline: 4 days Budget: $80 I have strong experience with messy historical data extraction, time-series normalization, and reproducible ETL workflows, so I can make this both accurate and easy to verify.
$65 USD trong 4 ngày
4,4
4,4

Hello client, I can extract and process the 10 years of Eskom historical power grid data (2014–2024), calculate the required UCLF and PCLF metrics, and deliver a clean daily CSV/Excel dataset. I will collect the hourly MW data from the Eskom Data Portal and reliable public archives, then compute daily averages, maximum values, and the 17:00 SAST metric with a reproducible Python scraping and processing script. You will receive the final dataset, the Python script, and a README explaining the data sources and handling of any missing values. Looking forward your response. Thank you.
$140 USD trong 7 ngày
4,0
4,0

Hello, I am an experienced data engineer specializing in extracting and analyzing complex energy datasets. I will precisely scrape and compile the 10-year daily power grid data for South Africa, focusing on UCLF and PCLF from Eskom, ensuring accuracy and compliance with your requirements. Could you clarify if there are any specific open-source repositories or archives you'd prefer I prioritize, or should I explore all available sources for completeness? Thanks, Juan Aponte
$180 USD trong 5 ngày
2,5
2,5

Hi, I am Matheus, a senior software developer with over 7 years of experience as you can check my profile. I am a senior engineer with over 7 year of experience on Python, Data Processing, Web Scraping, Software Architecture, Data Mining, Data Scraping, Data Extraction, Data Visualization, Data Analysis, Data Management. Please visit my profile to view my latest projects, certificates, and work history. Let's connect in chat to discuss more. Thank you, Matheus
$30 USD trong 7 ngày
2,2
2,2

Hello, With over 9 years of experience in Python, Software Architecture, and Data Management, I am well-equipped to handle your project extracting historical power grid data for South Africa. I understand your requirement for extracting 10 years of DAILY historical power grid data from Eskom, specifically focusing on Unplanned Capability Loss Factor (UCLF) and Planned Capability Loss Factor (PCLF) data. I will utilize the official Eskom Data Portal and alternative public archives to extract the necessary data points on a daily basis, aggregating them into a clean CSV file with the required columns. Throughout the project, I will ensure effective communication to provide you with a professional solution according to your project description. I am excited about the opportunity to work on this project and look forward to hearing from you. Thanks.
$250 USD trong 7 ngày
0,0
0,0

County of Sussex, South Africa
Phương thức thanh toán đã xác thực
Thành viên từ thg 1 5, 2019
$10-30 USD
$10-30 USD
$10-30 USD
$30-250 USD
$30-250 USD
₹1500-12500 INR
₹1500-12500 INR
₹750-1250 INR/ giờ
$15-25 USD/ giờ
$1500-3000 USD
$250-750 USD
$8-20 USD/ giờ
€30-250 EUR
$10-30 USD
$750-1500 USD
€12-18 EUR/ giờ
₹1500-12500 INR
₹1500-12500 INR
₹600-1500 INR
₹800-1000 INR
$15-25 USD/ giờ
€12-18 EUR/ giờ
€8-30 EUR
$250-750 USD
$1500-3000 USD