lazybear

API Reference

LazyBearFrame

The main object representing a lazy SQL query. It is immutable; every transformation returns a new LazyBearFrame.

Properties

columns

engine

Transformations

select(*items)

filter(predicate)

with_columns(*exprs, **named)

sort(by, *more_by, descending=False)

order_by(*keys)

limit(n)

join(other, on=None, *, left_on=None, right_on=None, how='inner', suffixes=('_x', '_y'))

group_by(*keys)

GroupedLazyBearFrame

A frame representing grouped data, returned by LazyBearFrame.group_by.

agg(**aggregations)

lf.group_by('department').agg(
    total_salary=('salary', 'sum'),
    avg_age=(col('age'), 'mean'),
    employee_count=('id', 'count')
)

Execution & Materialization

collect(limit=None, infer_schema_length=200)

to_arrow(limit=None)

collect_batches(chunk_size=10_000)

iter_rows(named=False, chunk_size=10_000)

explain()

I/O Helpers

write_parquet(file, chunk_size=None, start_index=0, **kwargs)

write_csv(file, chunk_size=None, **kwargs)

Advanced

to_select()

Scanning Functions: Create Temporary Tables on the Server

This has limited testing.

scan_table(table_name, engine, schema=None, lowercase=True)

scan_sql_query(query, engine, columns=None)

scan_df(df, engine, table_name=None)