public void testGcd() {
try {
MathUtils.gcd(Integer.MIN_VALUE, 0);
fail("expecting ArithmeticException");
} catch (ArithmeticException expected) { /* expected */ }
}
↓
public void testGcd(int i, int j) {
try {
boolean complement = !( (i==Integer.MIN_VALUE && j==0) || (i==0 && j==Integer.MIN_VALUE) );
final long actual = MathUtils.gcd(i, j);
preserveIf(complement, () −> new Long[] { actual });
} catch (ArithmeticException expected) {
preserveIf(!complement, () −> new String[] { e.toString() });
} catch (Exception e) {
failToPreserve();
}
}
preserveIf
and other APIs for preservation conditions.Question | Bug ID | Pattern |
---|---|---|
Q1 | Math73 | CC (Complementary Case) |
Q2 | Math105 | EGA (Existing General Assertion) |
Q3 | Math28 | UE (Unexpected Exception) |
Q4 | Lang58 | RI (Reference Implementation) |
Manual Group: Find a correct patch without using Poracle.
Semi-Automated Group: Find a correct patch using Poracle.
# Food for Thought - Ideally, incorrect patches are supposed to be identified by the test suite. data:image/s3,"s3://crabby-images/1d3cc/1d3cccb76e436212d7bdebc743427c4a6346d7e0" alt="width:1000px" ---
# Things to Think About data:image/s3,"s3://crabby-images/139bc/139bc509e402971250d91ad4dbd210dac249112d" alt="bg left:33% fit" - The large gap between the plausible patch space and the correct patch space suggests that the quality of the test suite is not good enough for patch validation. ---
# Difficulty of Test Generalization $\forall \vec{v}: T(\vec{v}) = \psi(\vec{v})$ - $\vec{v}$: inputs - $T(\vec{v})$: output of test $T$ when $\vec{v}$ is given - $\psi$: the oracle function ---
# Preservation Condition Example - Math105 bug: assertion --reused-> preservation condition ```java public void testSSENonNegative(double d1, ..., double d6) { try { double[]y={d1,d2,d3}; double[]x={d4,d5,d6}; SimpleRegression reg = new SimpleRegression(); for(inti=0;i<x.length;i++) { reg.addData(x[i], y[i]); } double ret = reg.getSumSquaredErrors(); // Original: assertTrue(ret >= 0.0); preserveIf(ret >= 0.0, () −> new Double[] { ret }); } catch (Exception e) { failToPreserve(); } } ``` # Preservation Condition Example - Lang58 bug: exploits a reference implementation ```java public void testLang300(int n, int m) { // NumberUtils.createNumber("1l"); // Original body // Test with a generalized input String s = "" + ((char) n) + ((char) m) + "l"; String actOut = ""; try { actOut = "" + NumberUtils.createNumber(s).longValue(); } catch (Exception e) { actOut = "Exception"; } // Use Long.valueOf as a reference String refOut = ""; try { refOut = "" + Long.valueOf(s); } catch (Exception e) { refOut = "Exception"; } preserveIf(actOut.equals(refOut), () −> new String[] { actOut }); } ```
# Patch Validation with Preservation Condition data:image/s3,"s3://crabby-images/b0197/b019765b0f2550663c6a188b21b5492f67a7c6c8" alt="width:1500px" ---
- Patch reviewing cost reduction - The number of patches to be reviewed after filtering
# Patch Reviewing Cost Reduction data:image/s3,"s3://crabby-images/4a30f/4a30f6aebda79dbf014f452bee16e6b95bc498a3" alt="bg left:33% fit" - JAID returns a ranked list of plausible patches. - We applied Poracle to the obtained ranked list of plausible patches and compared the number of patches to be reviewed before and after filtering. --- # Patch Reviewing Cost Reduction data:image/s3,"s3://crabby-images/b5983/b59831d53d98f5e30673b9c49c0959dc60969b12" alt="width:1200px" ---
- For each question, participants were divided into two groups.
--- # Ablation Study data:image/s3,"s3://crabby-images/c94a5/c94a527e69d5cff3b71f0fc84be42514abc8a99a" alt="width:700px" # Example - Failing test for Math95 of Defects4J ```java public void testSmallDegreesOfFreedom() { FDistributionImpl fd = new FDistributionImpl(1.0, 1.0); double p = fd.cumulativeProbability(0.975); double x = fd.inverseCumulativeProbability(p); assertEquals(/* expected output */ 0.975, x, /* delta */ 1e-5); } ``` --- # Example - Generalizing the failing test ```java public void testSmallDegreesOfFreedom() { FDistributionImpl fd = new FDistributionImpl(1.0, 1.0); double p = fd.cumulativeProbability(0.975); double x = fd.inverseCumulativeProbability(p); assertEquals(/* expected output */ 0.975, x, /* delta */ 1e-5); } ``` ↓ ```java public void testSmallDegreesOfFreedom(double d1, double d2, double d3) { FDistributionImpl fd = new FDistributionImpl(d1, d2); double p = fd.cumulativeProbability(d3); double x = fd.inverseCumulativeProbability(p); assertEquals(/* expected output */ ________, x, /* delta */ 1e-5); } ``` --- # Preservation Condition Example - Math95 bug: An unexpected exception occurs. ```java public void testSmallDegreesOfFreedom(double d1, double d2, double d3) { try { FDistributionImpl fd = new FDistributionImpl(d1, d2); double p = fd.cumulativeProbability(d3); double x = fd.inverseCumulativeProbability(p); preserveIf(/* preservation condition */ true, /* outputs to compare */ () -> new Double[] {x}) } catch (Exception e) { failToPreserve(); } } ``` data:image/s3,"s3://crabby-images/b6447/b6447bd2924f834b2fb2fa1d83f52e93c2818485" alt="width:900px" --- # Classification Performance data:image/s3,"s3://crabby-images/35537/35537196630252354bd0add64eb9b674329afe55" alt="width:700px" - BERT-LR: Haoye Tian et al., "Evaluating representation learning of code changes for predicting patch correctness in program repair", ASE 2020 --- # Developer Patches data:image/s3,"s3://crabby-images/13a10/13a101fd0aaa3f4f1e2875fc236419feac0665f1" alt="width:700px" --- # Correct Answer Ratio | Top 50% Students | Bottom 50% Students | |:---:|:---:| | data:image/s3,"s3://crabby-images/33634/3363408d7ce0a0d57d37fa5329277d516e76b129" alt="width:600px" | data:image/s3,"s3://crabby-images/392f7/392f764dd5d62c9e385f71950cf4ef91f5d1ae30" alt="width:600px" | --- # Manual Time Cost | All students | Students who submitted correct answers | |:---:|:---:| | data:image/s3,"s3://crabby-images/a37b9/a37b94e0c8261304effb75bc20aa3cdf60ebbd78" alt="width:600px" | data:image/s3,"s3://crabby-images/9f08c/9f08ce6e32b7e84f3a92111247c434ffd20aba12" alt="width:600px" | --- # Sentiment | All students | Top 50% students | |:---:|:---:| | data:image/s3,"s3://crabby-images/9172e/9172e9cc7e5c384cb5f650c9e5459b98cf8a623a" alt="width:600px" | data:image/s3,"s3://crabby-images/572d1/572d1af4da1f758086c8ab365c8f711dd709f0e3" alt="width:600px" |