lector

Charactar normalisation service that renders unicode confusables and send back the string via ocr and makes a judgement about profanity.

There is here a sample Postman collection.

To test it on your local machine just forward the service to your localhost and try the examples.

Current integration environment access is needed:

kubectl -n lector port-forward service/lector 8000:8000

Sample payload:

{"toCheck": "ꜰᴜᴄᴋ ᴍᴇ"}

Sample Response

{
    "ocr": {
        "string": "FUCK ME",
        "profan": true
    },
    "raw": {
        "string": "ꜰᴜᴄᴋ ᴍᴇ",
        "profan": false
    },
    "transcribed": {
        "string": "ꜰucĸ ʍᴇ",
        "profan": false
    }
}

Response struct:

type Response struct {
	Ocr struct {
		String string `json:"string"`
		Profan bool   `json:"profan"`
	} `json:"ocr"`
	Raw struct {
		String string `json:"string"`
		Profan bool   `json:"profan"`
	} `json:"raw"`
	Transcribed struct {
		String string `json:"string"`
		Profan bool   `json:"profan"`
	} `json:"transcribed"`
}

Confusbales in unicode are characters that look a like another one.

http://www.unicode.org/Public/security/latest/confusables.txt

If you like to try more sophisticated strings you can create one on your own here

One possible answer would be this lector service.

Credits to:

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.config		.config
charts/lector		charts/lector
controllers		controllers
img		img
routes		routes
.gitignore		.gitignore
Dockerfile		Dockerfile
Lector.postman_collection.json		Lector.postman_collection.json
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lector

About

Releases

Packages

Contributors 2

Languages

hingstarne/lector

Folders and files

Latest commit

History

Repository files navigation

lector

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages