Decide if we will ever return null from a function. #10

lbergelson · 2018-07-06T17:27:26Z

How do we feel about returning nulls? It's gross, but I'm not totally convinced that optionals are actually less gross in java. Optional has the benefit of documenting the fact that something may not be present where null can be a surprise. The have the downside that it's wrapping the element in a new object which is potentially expensive for critical things.

I think we have 3 options:

Allow methods to return null. This is error prone in some ways, but it's familiar, it's not very verbose, and null checks are very inexpensive.
Don't allow methods to return null, instead throw and have hasX methods alongside getX methods. This has the advantage of avoiding NPE but is similarly error prone if people forget to check the hasX. It could also be complicated to efficiently implement hasX if getX is something other than returning a field.
Use Optional as a return value instead of nulls. This is nicer in a lot of ways, but it's a big change, the syntax in java is only marginally less verbose than null pointer checks, and it introduces additional boxing overhead for all calls to the method, which is likely much more expensive than null checks. If java supported value types this would be much more appealing since the overhead could be avoided.

The text was updated successfully, but these errors were encountered:

magicDGS · 2018-07-07T10:32:47Z

I prefer to never return nulls, so either 2 or 3 fits that criteria. But in my opinion, Optional is the best option at least for some cases:

For getters that can return missing values, such as variant attributes (FORMAT/INFO), it is kind of interesting to distiguish between them. Optional is great for it: a missing value is returned as the string representation (.) or special value (-1?), and completely undefined as an Optional.empty()
Methods returning a boxed primitive (e.g., int) that can be null can return an Optional instead, because there are implementations storing the primitive (e.g. OptionalInt) that would have teh same performance as boxing (e.g., Integer).

I suggest never return null, but have a mixed approach depending on the case: for example, the previous cases are better represented by returning an Optional, but cases where it would become a performance issue can use the hasX/getX implementation. On the other hand, there are values where missing can be represented in a different way (e.g., getContig() can return an empty string if there is no contig or the special * to represent it; better than a hasX method, it can be directly compared with == by forcing the static object to be returned without copying).

Does this mixed strategy works for you?

tfenne · 2018-07-10T12:43:13Z

I'm nearly in agreement with @magicDGS, but not quite. My view is that null is parallel to exception. What I mean is that exceptions should be used for exceptional conditions and not expected results. Similarly for null, I think if the value being accessed is optional and can reasonably be expected to be missing, then Optional is preferred to null. But in cases where something really shouldn't be null in 99.99% of cases, but occasionally can in weird situations, I think it's preferable to return null since using Optional gives the wrong impression.

magicDGS · 2018-07-13T10:41:27Z

So from the comments here, what I imagine now as a safe design:

Avoid null in most of the cases.
Use a constant in case there is a defined value the SAM/VCF/whatever specs (but not magic numbers/values). E.g., missing reference can be * (emty String is also a possibility, as usually empty is not a valid name).
Optional for cases where we can expect that something is not present: attributes in VCF or SAM records
null when none of this cases fits

Some return methods can distinguish in the API between missing in the record and missing value defined (e.g., VCF . missing value). In that case, this could be defined as 3+2 (return optional; not present was missing in the record; present but with a special value is explicitly missing in the record) or 3+4 (return empty optional if explicitly missing; null in case of not present at all).

I found this Wiki page from guava quite good for this discussion too: https://github.com/google/guava/wiki/UsingAndAvoidingNullExplained

magicDGS · 2018-07-27T09:44:23Z

More info about null to impove discussion: https://www.yegor256.com/2014/05/13/why-null-is-bad.html

magicDGS · 2018-09-10T20:44:08Z

What's about Bean Validation for specifying this and other constraints?

magicDGS added Priority:Normal Type:Maintenance Module:Core Type:API and removed Type:Maintenance labels Jul 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decide if we will ever return null from a function. #10

Decide if we will ever return null from a function. #10

lbergelson commented Jul 6, 2018

magicDGS commented Jul 7, 2018

tfenne commented Jul 10, 2018

magicDGS commented Jul 13, 2018

magicDGS commented Jul 27, 2018

magicDGS commented Sep 10, 2018

Decide if we will ever return null from a function. #10

Decide if we will ever return null from a function. #10

Comments

lbergelson commented Jul 6, 2018

magicDGS commented Jul 7, 2018

tfenne commented Jul 10, 2018

magicDGS commented Jul 13, 2018

magicDGS commented Jul 27, 2018

magicDGS commented Sep 10, 2018