First Look At the ACID3 Browser Test 133
ddanier writes "Now that all major browsers have mastered the ACID2 test (at least in some preview versions), work on ACID3 has begun. The new test will focus on ECMAScript, DOM Level 3, Media Queries, and data: URLs. 100 tests will be put into functions each returning either true or false depending on the result of the test. The current preview of ACID3 is still missing 16 tests."
From the summary: (Score:5, Funny)
Re:From the summary: (Score:5, Funny)
Re:From the summary: (Score:5, Funny)
discourse in social choice using selective spelling excuses?
Re: (Score:1)
DiscusseS (Score:1)
Re:From the summary: (Score:5, Funny)
Re: (Score:2)
T<span style='display: {can_prove(P = NP)};'>h</span>e
Acid3 is hard.
I bet some devs are really pissed now (Score:2, Insightful)
Re:I bet some devs are really pissed now (Score:4, Informative)
Re:I bet some devs are really pissed now (Score:5, Informative)
Camino 1.0.3 crashes when starting the test
Safari 2.0.4 doesn't even get started. It says I need to enable JavaScript, which is enabled.
Re: (Score:2, Informative)
Re:I bet some devs are really pissed now (Score:5, Insightful)
I would be glad to receive bug reports with an easy to use test case. It saves me the trouble of determining if it is a bug or not, coming up with a test case, the pain of communicating back and forth with the customer trying to find out what they are doing and how the bug is being triggered, etc. Also, this test suite will improve compatibility with other browsers so it will reduce bug reports in the long run.
Why the heck would they be pissed?
Re: (Score:2)
Because their manager is pissed that they have to spend more money shoring up their shoddy product and is taking it out on the dev team?
Re:I bet some devs are really pissed now (Score:4, Insightful)
> Camino 1.0.3 crashes when starting the test
> Safari 2.0.4 doesn't even get started.
Those aren't the current versions of any of those browsers--not even close in the case of Camino and Safari--so that's not a terribly interesting test list.
Re: (Score:1)
Camino... I'll give you that one. I apparently need to upgrade.
Safari is the latest version available for Tiger, which is the latest available version of the OS that can run on the laptop I have here.
Re:I bet some devs are really pissed now (Score:4, Informative)
Safari 3 is available for both Tiger and Leopard. The 10.4.11 update [apple.com] includes Safari 3.
Not in 2.0.0.x. Try Minefield. (Score:1)
Re: (Score:2)
Re:I bet some devs are really pissed now (Score:5, Informative)
After prompting me if I wanted to open empty.txt, it segfaulted my Konqueror with this backtrace:
Using host libthread_db library "/lib/tls/i686/cmov/libthread_db.so.1".[Thread debugging using libthread_db enabled]
[New Thread -1232832304 (LWP 8079)]
[KCrash handler]
#6 0xb609a9a1 in ?? () from
#7 0xb5f325d4 in ?? () from
#8 0x081e1f38 in ?? ()
#9 0xbfcde5a4 in ?? ()
#10 0xbfcde588 in ?? ()
#11 0xb60fe4fd in DOM::NodeFilter::acceptNode () from
Backtrace stopped: frame did not save the PC
I think we have a zeroday on our hands boys!
Re: (Score:2)
mov ebx, dword [eax]
where eax == 0.
It's a crash, but fortunately, a non-exploitable one.
konqi and 'fox on Ubuntu64 7.10 (Score:2)
Konqueror-kde4 gives 1/100 (!) - Konqueror 4.00.00 (KDE 4.0.0)
Konqueror FAIL - crashes, screencap shows that it says "scripting must be enabled" even when it is
FWIW
Re:I bet some devs are really pissed now (Score:5, Insightful)
The ACID3 test won't be a test for standards compliance either. The way I see it it's just a tool to motivate developers to work TOWARDS standards compliance.
The ACID3 test should, therefore, not be seen as a new set of standards. It's just a different subset of standards.
Re: (Score:3, Insightful)
AFAIK, the purpose of the ACID tests were basically to demonstrate a specific set of rendering bugs, supposedly bugs chosen because they were common complaints of web developers. So the purpose wasn't to test standards compliance, but to give browser developers a target to hit in order to help web developers with some of their more common problems.
Re: (Score:3, Insightful)
Re: (Score:2)
Re: (Score:3, Insightful)
It's hard enough to write one of these tests (have you seen their source code?!), let alone write it in such a way that when it fails it presents a clear message explaining why it isn't rendering correctly and giving helpful hints to the rendering engine developers.
Re:I bet some devs are really pissed now (Score:5, Interesting)
http://hixie.ch/tests/adhoc/ [hixie.ch]
The Acid tests are easier for the less technically inclined to get a hold of. In practice, the browser vendors take Acid tests and turn them into small tests of the kind you describe before fixing them. For Acid2, I was the one who did a number of those small tests for Opera (I worked for Opera at the time) -- you can see them here:
http://www.hixie.ch/tests/evil/acid/002/opera001.html [hixie.ch]
http://www.hixie.ch/tests/evil/acid/002/opera002.html [hixie.ch]
http://www.hixie.ch/tests/evil/acid/002/opera003.html [hixie.ch]
http://www.hixie.ch/tests/evil/acid/002/opera004.html [hixie.ch]
http://www.hixie.ch/tests/evil/acid/002/opera005.html [hixie.ch]
http://www.hixie.ch/tests/evil/acid/002/opera006.html [hixie.ch]
http://www.hixie.ch/tests/evil/acid/002/opera007.html [hixie.ch]
http://www.hixie.ch/tests/evil/acid/002/opera008.html [hixie.ch]
http://www.hixie.ch/tests/evil/acid/002/opera009.html [hixie.ch]
http://www.hixie.ch/tests/evil/acid/002/opera010.html [hixie.ch]
http://www.hixie.ch/tests/evil/acid/002/opera011.html [hixie.ch]
They're not as exciting as the smiley face, so they don't get the media's attention in the same way.
Re: (Score:3, Insightful)
Not quite a new "subset" ... (Score:3, Insightful)
Re:I bet some devs are really pissed now (Score:5, Informative)
http://www.webdevout.net/browser-support-summary?IE7=on&FX2=on&OP9=on&uas=CUSTOM [webdevout.net]
Re: (Score:1, Interesting)
Re:I bet some devs are really pissed now (Score:5, Informative)
Which is why the GP shouldn't be modded as "Insightful."
The ACID Tests are meant to test certain parts of the proposed standards.
Passing the Test doesn't imply standards compliance.
BUT
Standards compliances DOES imply passing the tests.
Re: (Score:2)
>:O
humans (Score:2)
Re: (Score:3, Informative)
Everything in the ACID3 test is at an implementable stage (look at Anne's blog post in the summary (i.e., RTFA)), and has been since 2004.
Re:I bet some devs are really pissed now (Score:5, Insightful)
Re: (Score:2)
bad assumption (Score:2)
Various Scores (Score:5, Informative)
Final scores of course are subject to change on the final test:
* - script takes long enough to run that browser prompts you to kill it.
Re:Various Scores (Score:5, Informative)
Re: (Score:1)
Re: (Score:2)
Re: (Score:1)
And the IE6 version, you don't even have to compare it to the reference rendering.
Re:Various Scores (Score:5, Informative)
I don't know what versions you're using, but at least for elinks (and links), they both support javascript. Just has to be compiled in.
eix elinks
* www-client/elinks
Available versions: 0.11.2 0.11.2-r1 0.11.3 {X bittorrent bzip2 debug finger ftp gopher gpm guile idn ipv6 javascript lua nls nntp perl ruby ssl unicode zlib}
Homepage: http://elinks.or.cz/ [elinks.or.cz]
Description: Advanced and well-established text-mode web browser
eix ^links$
[I] www-client/links
Available versions: (2) 2.1_pre26 2.1_pre28-r1
{X directfb fbcon gpm javascript jpeg livecd png sdl ssl svga tiff unicode}
Installed versions: 2.1_pre28-r1(2)(21:18:19 11/07/07)(javascript ssl tiff unicode -X -directfb -fbcon -gpm -jpeg -livecd -png -sdl -svga)
Homepage: http://links.twibright.com/ [twibright.com]
Description: links is a fast lightweight text and graphic web-browser
So while they do support javascript, they don't support iframes, and the test uses 3 of those.
Re: (Score:1, Informative)
Re: (Score:2)
Re: (Score:2)
IE6 IS 100% OF THE STANDARDS!!!!1111 (Score:2, Funny)
Re: (Score:1)
Re: (Score:2)
Re: (Score:2)
Same here, although it always says it should be 'smooth', and I'd say it's nowhere close that.
# Internet Explorer 7 @ Windows XP
My eyes! My eyes!
Re:Various Scores (Score:4, Informative)
Safari 3.0.4 on Windows using WebKit-r29380 (today's nightly build), Safari scores a 70/100.
Re: (Score:2)
Firefox 3 nightly @ Linux: 63 %
Re: (Score:2)
Firefox probably has similar results. It displays a red box with a weird bunny creature inside. I looked at the "reference redering" and laughed. It's supposed to be a series of boxes, each a different rainbow color.
It should be noted that the ACID3 test is not ready, and should not really be used for comparison, and certainly should not be used by browser developers yet.
Re: (Score:2)
Konqi 3.5.8 crashes, Firefox 2.0.11 fails at 59... (Score:3, Interesting)
Safari 3.0.4 (Windows) hangs at 60, Internet Explorer 7.0.5730.11 messes up so badly the result can't be read...
The test looks interesting, for sure. And it's going to raise the game for standards compliance!
Firefox 3 Beta 2 (Windows) gets to 62 (Score:3, Informative)
... and looks somewhat like the reference image...
Opera 9.2.4 (Windows) reaches 55 (but looks horrible)...
Firefox 3 looks like the best shot at it so far.
Re:Konqi 3.5.8 crashes, Firefox 2.0.11 fails at 59 (Score:2)
Also, with the emphasis on ECMA script and animation, it'll raise standards and compliance for games.
Re: (Score:2)
More fun acid hacking for me
Re: (Score:1)
I'm wondering whether the Konqueror devs will notice and fix this, since a drop of 84% in that amount of time is definitely messed up.
Re:Konqi 3.5.8 crashes, Firefox 2.0.11 fails at 59 (Score:2)
Look, I don't mean to be getting you down about this, but I'm fairly certain this is the only time such a phrase has been uttered with such enthusiasm. Ever. Congratulations of a sort are in order.
Swell, but misses the point (Score:1)
Re: (Score:2, Insightful)
Re: (Score:2)
Because it focuses on the tip of the iceberg, the symptoms, not the disease. I wonder how many of the browsers have been tweaked to pass certain tests, instead of being engineered to meet the specs.
Re: (Score:3, Insightful)
Re: (Score:1)
Because it focuses on the tip of the iceberg, the symptoms, not the disease. I wonder how many of the browsers have been tweaked to pass certain tests, instead of being engineered to meet the specs.
Your medical analogy is apt. Black-box testing is pointless in exactly the same way that going to a doctor when you are sick is pointless. The symptoms are all you have initially to indicate there is any disease or what it's nature might be. Those with clear-box knowledge (doctors or developers) can use the symptoms as a starting point fur further inquiry to diagnose the underlying problem.
I know what theory says about testing and correctness and all that. It doesn't work that way in the real world.
Testing is the best we have (Score:2)
Yes, testing can never prove a program correct. On the other hand, do you think you'd get anywhere trying to prove that anything about any browser is correct using formal methods? Especially when the source code for most browsers is not even publicly available.
The Acid tests are also not really about finding obscure bugs, but about demonstrating which basic features work and which ones do not work. After all major browsers pass an Acid test, web developers can attempt to use the features tested by the Acid
Re: (Score:2)
Where is the reference image from? (Score:5, Interesting)
Re:Where is the reference image from? (Score:5, Funny)
Re: (Score:3)
I don't know if you're trying to be Funny(sarcastic) or Informative.
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Oddly enough, I had trouble finding a site that renders in quirks mode so I could get the exact message that the dialog displays. I tried about a half dozen sites in my bookmarks and they were all rendered in standards compliant mode. So then I tried microsoft.com [microsoft.com] and bang, I had a site that rendere
Re: (Score:1)
Re:Where is the reference image from? (Score:5, Interesting)
You don't
I remember an article by the Apple guy who made ACID2 work on Safari (I think this was the first browser to make it work). One of the steps to get it working was to fix a bug in the test, when he couldn't make the reference result fit with what the test HTML said.
Re:Where is the reference image from? (Score:5, Informative)
http://weblogs.mozillazine.org/hyatt/archives/2005_04.html#008011 [mozillazine.org] details the bug (in this case, it was the test itself that was wrong -- not the reference). The reference rendering for Acid3 is likely correct as the actual rendering isn't overly complex (the complexity is in the ECMAScript and DOM support), though with the complexity of some tests there could easily be bugs in the test again.
Re: (Score:2)
Re: (Score:1)
Re: (Score:2)
In my view a test isn't a test until it is finished and can be used as a test, so it isn't yet a test
Your view dramatically reduces the value of a test. It's very normal in any test process that every "failure" has to be evaluated to determine if the error is in the implementation, the test or even the specification. Building and running tests is how you find the issues that need to be resolved. Saying they're not tests until both the test and the specification have been proved correct is just a word game that makes it necessary to come up with some other term for the "tests" that aren't quite yet "te
Re: (Score:2)
I just wish we could get that into the heads of the big software shops... you know who you are. It's unfortunate that money turns against a good idea.
Re: (Score:1)
In fact, it always downloads a file "data006" when openig this page.
(tested with Safari 3.0.4 (5523.10.6) on Leopard)
Firefox 3 Beta passes this test.
Re: (Score:2)
You're not supposed to question Faith
Re: (Score:2)
For Acid2, I made a second version of the test that worked around all the bugs in Firefox, and then took a screenshot of Firefox.
For Acid3, I actually made the background of the reference rendering first as a simple HTML file, took a screenshot of that, made that the background of the reference.html file, and then added some text to the reference file and used absolute positioning to get the text wh
So.... (Score:2, Flamebait)
Re:So.... (Score:5, Informative)
ECMAScript, DOM Level 3 (Score:3, Informative)
Konqueror fails (Score:2)
But there's a newer version in the repository, so I'm going to upgrade and see what happens.
Re: (Score:2)
This time, it asks me what to do with a file it is trying to download, then crashes (with a different-looking requester; maybe that also changed between versions).
Re: (Score:2)
KDE4 isn't in Sid yet (it's a credit to Debian that "unstable" is actually so stable, that they have to have an "experimental" distribution). I'd grab a Kubuntu liveCD and try that, but it wouldn't necessarily prove anything: crashing in Kubuntu is hardly news.
Woo hoo! (Score:2)
Oh, wait...
Who cares? (Score:2)
Re: (Score:2)
(Based on a study I did at Google using several billion pages.)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
The authors of the Acid2 test (primarily me) didn't actually include any HTML4 parsing error handling tests. There were some CSS ones, but a far cry from all the ones I could think of. (Acid3 has even fewer.) The CSS and HTML5 standards define how you handle errors, by the way --
Re: (Score:2)
This misconception seems to come up every time Acid2 gets mentioned on Slashdot.
Broken code was part of the Acid2 test, but far from the primary focus. Read the test guide [webstandards.org] sometime. The first Acid test [cleverchimp.com] didn't use broken code at all; it simply tested implementation of the box model.
Since when is vaporware acceptable as proof? (Score:3, Interesting)
When Firefox makes news on this there are daily builds to test, source code to inspect and compile. One can see the progress first-hand.
There is no build of Microsoft Internet Explorer 8 to test. You are accepting something unverifiable as reality and thus talking about these browsers as if they're all on the same level. This suggests a new low: believing the illegal monopolist who tells you that their vaporware behaves in accordance with published publicly-implementable standards.
Re: (Score:2, Funny)
If it can't hold up, maybe it needs some work...
Re: (Score:2)
http://it.slashdot.org/comments.pl?sid=394442&cid=21757950 [slashdot.org]