-
-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New practice exercise - Perceptron #678
Closed
Closed
Changes from all commits
Commits
Show all changes
30 commits
Select commit
Hold shift + click to select a range
3c62326
New practice exercise - Perceptron
depial 7b291e5
Merge remote-tracking branch 'origin/main' into depial-main
cmcaine 053cb63
Update runtests.jl
depial c821340
Update runtests.jl
depial a4bc759
Update runtests.jl
depial 185e88c
Delete exercises/practice/perceptron/testtools.jl
depial 475efe3
Update config.json
depial 4a24b2c
Update tests.toml
depial e4760e2
Update instructions.md
depial 39ba0fa
Update instructions.md
depial 735743f
Update instructions.md
depial 121e5fa
Update exercises/practice/perceptron/perceptron.jl
depial dd96c6c
Update exercises/practice/perceptron/.meta/example.jl
depial f7d054f
Merge branch 'exercism:main' into main
depial 7bcaf89
Update example.jl
depial 4ac9805
Update config.json
depial c8d226b
Update example.jl
depial fe7eb26
Update runtests.jl
depial bed28be
Update tests.toml
depial 9d9ce4b
Update config.json
depial 76f21f2
Create introduction.md
depial ff9ce68
Update instructions.md
depial 9c8193d
Update runtests.jl
depial 5ac2eec
Update runtests.jl
depial 7ec87e8
Update runtests.jl
depial 0af5de7
Merge branch 'exercism:main' into main
depial 233f618
Adding new practice exercise Binary Search Tree
depial f444359
config changes
depial d963209
Merge branch 'main' of https://github.com/depial/julia
depial c947630
Update config.json
depial File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
47 changes: 47 additions & 0 deletions
47
exercises/practice/binary-search-tree/.docs/instructions.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,47 @@ | ||
# Instructions | ||
|
||
Insert and search for numbers in a binary tree. | ||
|
||
When we need to represent sorted data, an array does not make a good data structure. | ||
|
||
Say we have the array `[1, 3, 4, 5]`, and we add 2 to it so it becomes `[1, 3, 4, 5, 2]`. | ||
Now we must sort the entire array again! | ||
We can improve on this by realizing that we only need to make space for the new item `[1, nil, 3, 4, 5]`, and then adding the item in the space we added. | ||
But this still requires us to shift many elements down by one. | ||
|
||
Binary Search Trees, however, can operate on sorted data much more efficiently. | ||
|
||
A binary search tree consists of a series of connected nodes. | ||
Each node contains a piece of data (e.g. the number 3), a variable named `left`, and a variable named `right`. | ||
The `left` and `right` variables point at `nil`, or other nodes. | ||
Since these other nodes in turn have other nodes beneath them, we say that the left and right variables are pointing at subtrees. | ||
All data in the left subtree is less than or equal to the current node's data, and all data in the right subtree is greater than the current node's data. | ||
|
||
For example, if we had a node containing the data 4, and we added the data 2, our tree would look like this: | ||
|
||
4 | ||
/ | ||
2 | ||
|
||
If we then added 6, it would look like this: | ||
|
||
4 | ||
/ \ | ||
2 6 | ||
|
||
If we then added 3, it would look like this | ||
|
||
4 | ||
/ \ | ||
2 6 | ||
\ | ||
3 | ||
|
||
And if we then added 1, 5, and 7, it would look like this | ||
|
||
4 | ||
/ \ | ||
/ \ | ||
2 6 | ||
/ \ / \ | ||
1 3 5 7 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,16 @@ | ||
{ | ||
"authors": [], | ||
"files": { | ||
"solution": [ | ||
"binary-search-tree.jl" | ||
], | ||
"test": [ | ||
"runtests.jl" | ||
], | ||
"example": [ | ||
".meta/example.jl" | ||
] | ||
}, | ||
"blurb": "Insert and search for numbers in a binary tree.", | ||
"source": "Josh Cheek" | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,38 @@ | ||
mutable struct BinarySearchTree | ||
data | ||
left | ||
right | ||
BinarySearchTree(node::T) where T<:Real = new(node, nothing, nothing) | ||
end | ||
|
||
function BinarySearchTree(vec::Vector{T}) where T<:Real | ||
tree = BinarySearchTree(popfirst!(vec)) | ||
foreach(node -> push!(tree, node), vec) | ||
tree | ||
end | ||
|
||
function Base.in(node, tree::BinarySearchTree) | ||
tree.data == node && return true | ||
if node ≤ tree.data | ||
isnothing(tree.left) ? false : in(node, tree.left) | ||
else | ||
isnothing(tree.right) ? false : in(node, tree.right) | ||
end | ||
end | ||
|
||
function Base.push!(tree::BinarySearchTree, node) | ||
if node ≤ tree.data | ||
isnothing(tree.left) ? (tree.left = BinarySearchTree(node)) : push!(tree.left, node) | ||
else | ||
isnothing(tree.right) ? (tree.right = BinarySearchTree(node)) : push!(tree.right, node) | ||
end | ||
tree | ||
end | ||
|
||
function traverse(tree::BinarySearchTree, channel::Channel) | ||
!isnothing(tree.left) && traverse(tree.left, channel) | ||
put!(channel, tree.data) | ||
!isnothing(tree.right) && traverse(tree.right, channel) | ||
end | ||
|
||
Base.sort(tree::BinarySearchTree) = collect(Channel(channel -> traverse(tree, channel))) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,40 @@ | ||
# This is an auto-generated file. | ||
# | ||
# Regenerating this file via `configlet sync` will: | ||
# - Recreate every `description` key/value pair | ||
# - Recreate every `reimplements` key/value pair, where they exist in problem-specifications | ||
# - Remove any `include = true` key/value pair (an omitted `include` key implies inclusion) | ||
# - Preserve any other key/value pair | ||
# | ||
# As user-added comments (using the # character) will be removed when this file | ||
# is regenerated, comments can be added via a `comment` key. | ||
|
||
[e9c93a78-c536-4750-a336-94583d23fafa] | ||
description = "data is retained" | ||
|
||
[7a95c9e8-69f6-476a-b0c4-4170cb3f7c91] | ||
description = "insert data at proper node -> smaller number at left node" | ||
|
||
[22b89499-9805-4703-a159-1a6e434c1585] | ||
description = "insert data at proper node -> same number at left node" | ||
|
||
[2e85fdde-77b1-41ed-b6ac-26ce6b663e34] | ||
description = "insert data at proper node -> greater number at right node" | ||
|
||
[dd898658-40ab-41d0-965e-7f145bf66e0b] | ||
description = "can create complex tree" | ||
|
||
[9e0c06ef-aeca-4202-b8e4-97f1ed057d56] | ||
description = "can sort data -> can sort single number" | ||
|
||
[425e6d07-fceb-4681-a4f4-e46920e380bb] | ||
description = "can sort data -> can sort if second number is smaller than first" | ||
|
||
[bd7532cc-6988-4259-bac8-1d50140079ab] | ||
description = "can sort data -> can sort if second number is same as first" | ||
|
||
[b6d1b3a5-9d79-44fd-9013-c83ca92ddd36] | ||
description = "can sort data -> can sort if second number is greater than first" | ||
|
||
[d00ec9bd-1288-4171-b968-d44d0808c1c8] | ||
description = "can sort data -> can sort complex tree" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,2 @@ | ||
# Create a (Mutable) Struct BinarySearchTree, which has fields: data, left, right | ||
# Also write a sort method, which returns a sorted array of the elements in a BinarySearchTree | ||
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,85 @@ | ||
using Test | ||
include("binary-search-tree.jl") | ||
|
||
@testset "data is retained" begin | ||
tree = BinarySearchTree([4]) | ||
@test tree.data == 4 | ||
@test isnothing(tree.left) | ||
@test isnothing(tree.right) | ||
end | ||
|
||
@testset "insert data at proper node" begin | ||
@testset "smaller number at left node" begin | ||
tree = BinarySearchTree([4, 2]) | ||
@test tree.data == 4 | ||
@test tree.left.data == 2 | ||
@test isnothing(tree.left.left) | ||
@test isnothing(tree.left.right) | ||
@test isnothing(tree.right) | ||
end | ||
|
||
@testset "same number at left node" begin | ||
tree = BinarySearchTree([4, 4]) | ||
@test tree.data == 4 | ||
@test tree.left.data == 4 | ||
@test isnothing(tree.left.left) | ||
@test isnothing(tree.left.right) | ||
@test isnothing(tree.right) | ||
end | ||
|
||
@testset "greater number at right node" begin | ||
tree = BinarySearchTree([4, 5]) | ||
@test tree.data == 4 | ||
@test isnothing(tree.left) | ||
@test tree.right.data == 5 | ||
@test isnothing(tree.right.left) | ||
@test isnothing(tree.right.right) | ||
end | ||
end | ||
|
||
@testset "can create complex tree" begin | ||
tree = BinarySearchTree([4, 2, 6, 1, 3, 5, 7]) | ||
@test tree.data == 4 | ||
@test tree.left.data == 2 | ||
@test tree.left.left.data == 1 | ||
@test isnothing(tree.left.left.left) | ||
@test isnothing(tree.left.left.right) | ||
@test tree.left.right.data == 3 | ||
@test isnothing(tree.left.right.left) | ||
@test isnothing(tree.left.right.right) | ||
@test tree.right.data == 6 | ||
@test tree.right.left.data == 5 | ||
@test isnothing(tree.right.left.left) | ||
@test isnothing(tree.right.left.right) | ||
@test tree.right.right.data == 7 | ||
@test isnothing(tree.right.right.left) | ||
@test isnothing(tree.right.right.right) | ||
|
||
end | ||
|
||
@testset "can sort data" begin | ||
@testset "can sort single number" begin | ||
tree = BinarySearchTree([2]) | ||
@test sort(tree) == [2] | ||
end | ||
|
||
@testset "can sort if second number is smaller than first" begin | ||
tree = BinarySearchTree([2, 1]) | ||
@test sort(tree) == [1, 2] | ||
end | ||
|
||
@testset "can sort if second number is same as first" begin | ||
tree = BinarySearchTree([2, 2]) | ||
@test sort(tree) == [2, 2] | ||
end | ||
|
||
@testset "can sort if second number is greater than first" begin | ||
tree = BinarySearchTree([2, 3]) | ||
@test sort(tree) == [2, 3] | ||
end | ||
|
||
@testset "can sort complex tree" begin | ||
tree = BinarySearchTree([4, 2, 6, 1, 3, 5, 7]) | ||
@test sort(tree) == [1, 2, 3, 4, 5, 6, 7] | ||
end | ||
end |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,27 @@ | ||
# Instructions | ||
|
||
#### Updating | ||
Checking if an object is on one side of a hyperplane or another can be done by checking the normal vector which points to the object. The value will be positive, negative or zero, so all of the objects from a class should have normal vectors with the same sign. A zero value means the object is on the hyperplane, which we don't want to allow since its ambiguous. Checking the sign of a normal to a hyperplane might sound like it could be complicated, but it's actually quite easy. Simply plug in the coordinates for the object into the equation for the hyperplane and check the sign of the result. For example, we can look at two objects `v₁, v₂` in relation to the hyperplane `[w₀, w₁, w₂] = [1, 1, 1]`: | ||
|
||
`v₁ = [x₁, y₁] = [2, 2]` | ||
|
||
`w₀ + w₁⋅x₁ + w₂⋅y₁ = 1 + 1⋅2 + 1⋅2 = 5 > 0` | ||
|
||
|
||
`v₂ = [x₂, y₂] = [-2, -2]` | ||
|
||
`w₀ + w₁⋅x₂ + w₂⋅y₂ = 1 + 1⋅(-2) + 1⋅(-2) = -3 < 0` | ||
|
||
If `v₁` and `v₂` have the labels `1` and `-1` (like we will be using), then the hyperplane `[1, 1, 1]` is a valid decision boundary for them since the signs match. | ||
|
||
Now that we know how to tell which side of the hyperplane an object lies on, we can look at how perceptron updates a hyperplane. If an object is on the correct side of the hyperplane, no update is performed on the weights. However, if we find an object on the wrong side, the update rule for the weights is: | ||
|
||
`[w₀', w₁', w₂'] = [w₀ + 1ₗ, w₁ + x⋅1ₗ, w₂ + y⋅1ₗ]` | ||
|
||
Where `1ₗ ∈ {1, -1}`, according to the class of the object (i.e. its label), `x, y` are the coordinates of the object, the `wᵢ` are the weights of the hyperplane and the `wᵢ'` are the weights of the updated hyperplane. | ||
|
||
This update is repeated for each object in turn, and then the whole process repeated until there are no updates made to the hyperplane. All objects passing without an update means they have been successfully separated and you can return your decision boundary! | ||
|
||
Notes: | ||
- Although the perceptron algorithm is deterministic, a decision boundary depends on initialization and is not unique in general, so the tests accept any hyperplane which fully separates the objects. | ||
- The tests here will only include linearly separable classes, so a decision boundary will always be possible (i.e. no need to worry about non-separable classes). |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
# Introduction | ||
|
||
### Perceptron | ||
[Perceptron](https://en.wikipedia.org/wiki/Perceptron) is one of the oldest and bestestly named machine learning algorithms out there. Since it is also quite simple to implement, it's a favorite place to start a machine learning journey. Perceptron is what is known as a linear classifier, which means that, if we have two labled classes of objects, for example in 2D space, it will search for a line that can be drawn to separate them. If such a line exists, Perceptron is guaranteed to find one. See Perceptron in action separating black and white dots below! | ||
|
||
<p align="center"> | ||
<a title="Miquel Perelló Nieto, CC BY 4.0 <https://creativecommons.org/licenses/by/4.0>, via Wikimedia Commons" href="https://commons.wikimedia.org/wiki/File:Perceptron_training_without_bias.gif"><img width="512" alt="Perceptron training without bias" src="https://upload.wikimedia.org/wikipedia/commons/a/aa/Perceptron_training_without_bias.gif"></a> | ||
</p> | ||
|
||
### Details | ||
The basic idea is fairly straightforward. As illustrated above, we cycle through the objects and check if they are on the correct side of our guess at a line. If one is not, we make a correction and continue checking the objects against the corrected line. Eventually the line is adjusted to correctly separate all the objects and we have what is called a decision boundary! | ||
|
||
Why is this of any use? The decision boundary found can then help us in predicting what a new, unlabeled, object would likely be classified as by seeing which side of the boundary it is on. | ||
|
||
#### A Brief Word on Hyperplanes | ||
What we have been calling a line in 2D can be generalized to something called a [hyperplane](https://en.wikipedia.org/wiki/Hyperplane), which is a convenient representation, and, if you follow the classic Perceptron algorithm, you will have to pick an initial hyperplane to start with. How do you pick your starting hyperplane? It's up to you! Be creative! Or not... Actually perceptron's convergence times are sensitive to conditions such as the initial hyperplane and even the order the objects are looped through, so you might not want to go too wild. | ||
|
||
We will be playing in a two dimensional space, so our separating hyperplane will simply be a 1D line. You might remember the standard equation for a line as `y = ax+b`, where `a,b ∈ ℝ`, however, in order to help generalize the idea to higher dimensions, it's convenient to reformulate this equation as `w₀ + w₁x + w₂y = 0`. This is the form of the hyperplane we will be using, so your output should be `[w₀, w₁, w₂]`. In machine learning, the `{w₀, w₁, w₂}` are usually referred to as weights. | ||
|
||
Scaling a hyperplane by a positive value gives an equivalent hyperplane (e.g. `[w₀, w₁, w₂] ≈ [c⋅w₀, c⋅w₁, c⋅w₂]` with `c > 0`), since an infinite line scaled by a value is just the same infinite line. However, it should be noted that there is a difference between the normal vectors (the green arrow in illustration above) associated with hyperplanes scaled by a negative value (e.g. `[w₀, w₁, w₂]` vs `[-w₀, -w₁, -w₂]`) in that the normal to the negative hyperplane points in opposite direction of that of the positive one. By convention, the perceptron normal points towards the class defined as positive. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
{ | ||
"authors": [], | ||
"contributors": [ | ||
"cmcaine" | ||
], | ||
"files": { | ||
"solution": [ | ||
"perceptron.jl" | ||
], | ||
"test": [ | ||
"runtests.jl" | ||
], | ||
"example": [ | ||
".meta/example.jl" | ||
] | ||
}, | ||
"blurb": "Write your own machine learning algorithm" | ||
} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
function classify(point, hyperplane) | ||
# Takes a single point and a hyperplane | ||
# Classifies which nomrmal of hyperplane is associated with the point | ||
# Returns 1 for positive normal, -1 for negative normal and 0 for a point on the hyperplane | ||
sign(hyperplane' * point) | ||
end | ||
|
||
function update(point, label, hyperplane) | ||
# Takes one point, its label and a hyperplane | ||
# Updates the hyperplane conditional on classification not matching the label | ||
# Returns a vector, the Perceptron updated hyperplane | ||
hyperplane + (classify(point, hyperplane) != label) * label * point | ||
end | ||
|
||
function step(points, labels, hyperplane) | ||
# Takes a vector of points, a vector of their associated labels and a hyperplane | ||
# Iteratively updates the hyperplane for each point/label pair | ||
# Returns a tuple: (true/false, decision boundary/hyperplane) for valid/invalid decision boundary, respectively | ||
decisionboundary = hyperplane | ||
foreach(i -> hyperplane = update(points[i], labels[i], hyperplane), eachindex(points)) | ||
decisionboundary == hyperplane, hyperplane | ||
end | ||
|
||
function perceptron(points, labels) | ||
# Takes a vector of linearly separable points and a vector of their associated labels | ||
# Performs steps of the Perceptron algorithm until a valid decision boundary is found | ||
# Returns a vector, a valid decision boundary for the provided population of labeled points | ||
hyperplane, points = [0, 0, 0], vcat.(1, points) | ||
while true | ||
isdecisionboundary, hyperplane = step(points, labels, hyperplane) | ||
isdecisionboundary && return hyperplane | ||
end | ||
end |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
[b8feac03-a063-44c9-8867-330cce110e6f] | ||
description = "Boundary is a vector of three weights" | ||
|
||
[7220e861-e8f6-4d5f-b45a-8750e5146010] | ||
description = "Weights are Real numbers" | ||
|
||
[728853d3-24de-4855-a452-6520b67dec23] | ||
description = "Initial population" | ||
|
||
[ed5bf871-3923-47ca-8346-5d640f9069a0] | ||
description = "Initial population w/ opposite labels" | ||
|
||
[15a9860e-f9be-46b1-86b2-989bd878c8a5] | ||
description = "Decision boundary cannot pass through origin" | ||
|
||
[52ba77fc-8983-4429-91dc-e64b2f625484] | ||
description = "Decision boundary nearly parallel with y-axis" | ||
|
||
[3e758bbd-5f72-447d-999f-cfa60b27bc26] | ||
description = "Increasing Populations" |
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It seems that this should be sentences in a comment, but surely the EOL is important as well for the last line of this stub.