smunger

munger for GWAS summary statistics

Features

data munging

EA ≠ NEA
if EAF presents, MAF = min(EAF, 1-EAF)
convert OR/ORSE to BETA/SE, if BETA, SE are absent and OR, ORSE are present
remove duplicate SNPs with same chr-bp-sorted(EA,NEA), keep the one with lowest P
output: \t separated, bgzip compress, tabix index.
optional output: significant SNPs, munge report

	CHR	BP	rsID	EA	NEA	EAF	MAF	BETA	SE	P	OR	OR_SE	Z
type	int	int	str	str	str	float	float	float	float	float	float	float	float
allow null	False	False	True	False	False	False	False	True	False	True	True	False	True
null value								0		0.999	1		0
range	[1，23]	(0,inf)		only contains ‘ACGT’	only contains ‘ACGT’	[0,1]	[0,0.5]	(-inf,inf)	(0, inf)	(0,1)	(0, inf)	(0, inf)	(-inf,inf)

This package was created with Cookiecutter and the waynerv/cookiecutter-pypackage project template.

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
.conda		.conda
.github		.github
docs		docs
smunger		smunger
tests		tests
.bumpversion.cfg		.bumpversion.cfg
.editorconfig		.editorconfig
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
build_dbsnp_tabix.py		build_dbsnp_tabix.py
environment.yml		environment.yml
makefile		makefile
mkdocs.yml		mkdocs.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg