Pandas Data Manipulation

Data cleaning and transformation with Pandas.

Filtering Data

Filter by condition
df[df["age"] > 25]

Multiple conditions
df[(df["age"] > 25) & (df["city"] == "NYC")]

Filter by list
df[df["name"].isin(["John", "Jane"])]

Missing Data

Check for nulls
df.isnull()
df.isnull().sum()

Drop missing values
df.dropna()
df.dropna(subset=["column"])

Fill missing values
df.fillna(0)
df["age"].fillna(df["age"].mean())

Grouping & Aggregation

Group by
df.groupby("city").mean()
df.groupby("city")["age"].sum()

Multiple aggregations
df.groupby("city").agg({
  "age": ["mean", "max"],
  "salary": "sum"
})

Merging & Joining

Merge DataFrames
pd.merge(df1, df2, on="id")

Different join types
pd.merge(df1, df2, how="left")
pd.merge(df1, df2, how="outer")

Concatenate
pd.concat([df1, df2])
pd.concat([df1, df2], axis=1)