ZenTables
transforms your pandas
DataFrames into beautiful, publishable tables in one line of code, which you can then transfer into Google Docs and other word processors with one click. Supercharge your workflow when you are writing papers and reports.
import zentables as zen
df.zen.pretty()
- Beautiful tables in one line
- Google Docs/Word ready in one click
- Descriptive statistics at varying levels of aggregation
- Control table aesthetics globally
- and many more to come....
Via pip
from PyPI:
pip install zentables
Via pip
from GitHub directly
pip install -U git+https://github.com/thepolicylab/ZenTables
First, import the package alongside pandas
:
import pandas as pd
import zentables as zen
Then, to format any DataFrame
, simply use:
df.zen.pretty()
And this is the result:
Click on the Copy Table
button to transfer the table to Google Docs and Word. Formatting will be preserved.
Results in Google Docs (Tested on Chrome, Firefox, and Safari):
Results in Microsoft Word:
ZenTables
provides two ways to control the aesthetics of the tables. You can use global settings to control the font and font size of the tables via:
zen.set_options(font_family="Times New Roman, serif", font_size=12)
Note: When font_size
is specified as an int
, it will be interpreted as points (pt
). All other CSS units are accepted as a str
.
Or you can override any global options by specifying font_family
and font_size
in zen.pretty()
method:
df.zen.pretty(font_family="Times New Roman, serif", font_size="12pt")
Both will result in a table that looks like this
We are working on adding more customization options in a future release.
Use df.zen.freq_tables()
to create simple frequency tables:
freq_table = df.zen.freq_table(
index=["Segment", "Region"],
columns=["Category"],
values="Order ID",
props="index",
totals=True,
subtotals=True,
totals_names="Total"
subtotals_names="Subtotal",
)
freq_table.zen.pretty() # You can also chain the methods
Use props
to control whether to add percentages of counts. When props
is not set (the default), no percentages will be added. You can also specify props
to calculate percentages over "index"
(rows), "columns"
, or "all"
(over the totals of the immediate top level).
Use totals
and subtotals
parameters to specify whether totals and subtotals will be added. Note that when props
is not None
, both totals
and subtotals
will be True
, and when subtotals
is set to True
, this will also override totals
settings to True
.
Additionally, you can control the names of the total and subtotal categories using totals_names
and subtotals_names
parameters.
Use df.zen.mean_sd_table()
to create descriptives with n, mean, and standard deviation:
mean_sd_table = df.zen.mean_sd_table(
index=["Segment", "Region"],
columns=["Category"],
values="Sales",
margins=True,
margins_name="All",
submargins=True,
submargins_name="All Regions",
)
mean_sd_table.zen.pretty() # You can also chain the methods
Similar to freq_tables
, you can use margins
and submargins
parameters to specify whether aggregations at the top and intermediate levels will be added. Additionally, you can control the names of the total and subtotal categories using margins_names
and submargins_names
parameters.
For all other types of tables, ZenTables
provides its own df.zen.pivot_table()
method:
mean_median_table = df.zen.pivot_table(
index=["Segment", "Region"],
columns=["Category"],
values="Sales",
aggfunc=["count", "mean", "median"],
margins=True,
margins_name="All",
submargins=True,
submargins_name="All Regions",
).rename( # rename columns
columns={
"count": "n",
"mean": "Mean",
"median": "Median",
}
)
mean_median_table.zen.pretty().format(precision=1) # Specify level of precision
There are two differences between this pivot_table()
method and the pandas
pivot_table
method. First, like mean_sd_table()
, it provides submargins
and submargins_names
for creating intermediate-level aggregations. Second, results are grouped by values
, columns
, and aggfuncs
, instead of aggfuncs
, values
, and columns
. This provides more readability than what the pandas
version provides.
-
df.zen.pretty()
returns a subclass ofpandas
Styler
, which means you can chain all other methods afterdf.style
.format()
in the previous section is an example. For more formatting options, please see this page inpandas
documentation -
All other methods in
ZenTables
returns a regularDataFrame
that can be modified further. -
The names of the index and columns are by default hidden. You can get them back by doing this:
df.zen.pretty().show_index_names().show_column_names()
- You can also disable the
Copy Table
button like this:
df.zen.pretty().hide_copy_button()
- More tests on compatibility with
Styler
inpandas
- More options for customization
- A theming system
- More to come...
Contributions are welcome, and they are greatly appreciated! If you have a new idea for a simple table that we should add, please submit an issue.
Principally written by Paul Xu at The Policy Lab. Other contributors:
- Kevin H. Wilson
- Edward Huh
- All the members of The Policy Lab at Brown University for their feedback
- The
sidetable
package for ideas and inspiration.