Skip to content
pvmehta.com

pvmehta.com

  • Home
  • About Me
  • Toggle search form
  • longtx.sql with the flag whether session is blocking any DML locks or not. Oracle
  • Transfer SQL Profiles from One database to other database. Oracle
  • runsql_once.ksh Linux/Unix
  • default permission on ~/.ssh/authorized_keys2 or authorized_keys Linux/Unix
  • V$CONTROLFILE_RECORD_SECTION reference notes. Oracle
  • myfile Oracle
  • Goldengate Tutorial Oracle
  • Composite Index creation tip from Vivek Oracle
  • DBMS_JOB all example Oracle
  • checking connectivity between two servers Linux/Unix
  • to see when crontab is changed. Linux/Unix
  • purge_trc.sh Linux/Unix
  • Goldengate document from Porus Oracle
  • Drop all SPM baselines for SQL handle Oracle
  • telnet listening Linux/Unix

Read CSV File using Python

Posted on 30-Sep-202330-Sep-2023 By Admin No Comments on Read CSV File using Python
# Databricks notebook source
import pandas as pd

# preapre dataframe
df_h=pd.read_csv("https://people.sc.fsu.edu/~jburkardt/data/csv/zillow.csv")

#Print dataframe
df_h

#define custom columns
columns = ["Srno", "SQFT", "BR", "Baths", "Zipcode", "YearBuilt", "ListPrice"]
df_h=pd.read_csv("https://people.sc.fsu.edu/~jburkardt/data/csv/zillow.csv",
header=None,
names=columns)

#Print dataframe
df_h

# show top 10 rows
df_h.head(10)

# show last 10 rows
df_h.tail(10)

#Print dataframe
df_h

#Drop non-needed columns, column names are case sensitive
df_d=df_h.drop(["SQFT", "Srno"], axis=1)

#Print dataframe
df_d

# Select specific columns
df_s = df_d[["BR", "ListPrice"]]

#Print dataframe
df_s


Python/PySpark

Post navigation

Previous Post: Creating never expiring DB user accounts in Oracle
Next Post: Read CSV file using PySpark

Related Posts

  • How to connect to Oracle Database with Wallet with Python. Oracle
  • Python class import from different folders Python/PySpark
  • Reading config file from other folder inside class Python/PySpark
  • Read CSV file using PySpark Python/PySpark
  • Getting started with notebook Python/PySpark
  • Add new columns in dataframe Python/PySpark

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Categories

  • Ansible (0)
  • AWS (2)
  • Azure (1)
  • Django (0)
  • GIT (1)
  • Linux/Unix (149)
  • MYSQL (5)
  • Oracle (395)
  • PHP/MYSQL/Wordpress (10)
  • POSTGRESQL (1)
  • Power-BI (0)
  • Python/PySpark (7)
  • RAC (17)
  • rman-dataguard (26)
  • shell (150)
  • SQL scripts (343)
  • SQL Server (6)
  • Uncategorized (0)
  • Videos (0)

Recent Posts

  • prepfiles.sh for step by step generating pending statistics files10-Mar-2026
  • tracksqltime.sql05-Mar-2026
  • Complete Git Tutorial for Beginners25-Dec-2025
  • Postgres DB user and OS user.25-Dec-2025
  • Trace a SQL session from another session using ORADEBUG30-Sep-2025
  • SQL Server Vs Oracle Architecture difference25-Jul-2025
  • SQL Server: How to see historical transactions25-Jul-2025
  • SQL Server: How to see current transactions or requests25-Jul-2025
  • T-SQL Vs PL/SQL Syntax25-Jul-2025
  • Check SQL Server edition25-Jul-2025

Archives

  • 2026
  • 2025
  • 2024
  • 2023
  • 2010
  • 2009
  • 2008
  • 2007
  • 2006
  • 2005
  • More info about /proc folder and its relation with processes. Linux/Unix
  • CTAS with LONG Column for 9i and higher Oracle
  • Optimizer SORT Operations Oracle
  • Renaming Global Name GLOBAL_NAME Oracle
  • find_idle_cpu.sql Oracle
  • v$backup.status information Oracle
  • Oracle Release Explaination Oracle
  • How to specify 2 arch location to avoid any kind of DB hanging. Oracle

Copyright © 2026 pvmehta.com.

Powered by PressBook News WordPress theme