gono-go_continuation_f17_beta_rc3_part_two_or_three
LOGS
15:00:38 <rbergeron> #startmeeting Go/No Go Continuation, Round Two, RC3, F17
15:00:38 <zodbot> Meeting started Thu Apr  5 15:00:38 2012 UTC.  The chair is rbergeron. Information about MeetBot at http://wiki.debian.org/MeetBot.
15:00:38 <zodbot> Useful Commands: #action #agreed #halp #info #idea #link #topic.
15:00:53 <rbergeron> #meetingname Go/No-Go Continuation F17 beta rc3 part two or three
15:00:53 <zodbot> The meeting name has been set to 'go/no-go_continuation_f17_beta_rc3_part_two_or_three'
15:00:59 <ianweller> round two: electric boogaloo
15:01:03 <rbergeron> #topic Who's up
15:01:13 <rbergeron> wwoods, tflink, adamw, nirik
15:01:15 <ianweller> oh god, i can forsee an interesting meetbot bug
15:01:16 <ianweller> hold on
15:01:26 <rbergeron> ianweller: yeah, i have fudged that one up a few times
15:01:27 * nirik is around.
15:01:29 <rbergeron> i forget when i'm sleepy
15:01:41 <ianweller> never mind, it handles it okay :)
15:01:48 * pschindl_ is here
15:02:01 <adamw> yo
15:02:28 * tflink is around
15:02:39 <adamw> both bcl and hughsie are active, just trying to get them both in here
15:03:14 <rbergeron> okay
15:03:16 <adamw> we're gonna need some intellectual firepower on preupgrade
15:03:18 <rbergeron> #chair nirik adamw tflink
15:03:18 <zodbot> Current chairs: adamw nirik rbergeron tflink
15:03:42 <rbergeron> Do we want to look at test list, new proposed, or preupgrade drama first
15:03:45 <adamw> <hughsie> adamw, i'm got a RH conf call now, sorry
15:04:31 <adamw> the last two are the same...
15:04:37 <adamw> preupgrade drama is one of the (two) new proposed
15:05:06 <adamw> or three. or what the hell ever.
15:07:03 <adamw> we're gonna need _someone_ who can take a stab at the preupgrade bug, though.
15:08:25 <rbergeron> three, yeah
15:08:29 <adamw> we can nibble around the edges
15:09:19 <rbergeron> okay, want to look at the validation list?
15:11:11 * rbergeron looks for anyone to agree with her
15:11:22 <tflink> works for me
15:11:26 <rbergeron> just to get any administrative bits out of the way :)
15:11:29 <tflink> we have to do it at some point :)
15:11:31 <rbergeron> #topic Test Results
15:11:37 <rbergeron> #link https://fedoraproject.org/wiki/Test_Results:Fedora_17_Beta_RC3_Install
15:12:11 <rbergeron> https://fedoraproject.org/wiki/Test_Results:Fedora_17_Beta_RC3_Base
15:12:22 <rbergeron> https://fedoraproject.org/wiki/Test_Results:Fedora_17_Beta_RC3_Desktop
15:12:40 <adamw> i'm happy with the coverage at this point
15:12:57 <tflink> yeah, other than preupgrade, I'm not seeing anything that is a huge red flag
15:12:57 <adamw> we can pull the missing tests from rc2, especially for kde
15:13:16 <adamw> installation and base are full except for scsi, which we have no hardware for.
15:13:30 <adamw> er, 'full' wrt beta.
15:13:46 <adamw> all fails that ought to be proposed as blockers are, i'd say
15:14:00 <adamw> i'm not horribly worried about any of the efi fails, as enough methods work
15:14:52 <adamw> they're all marked 'alpha' because we want in the big picture EFI to work at alpha, but we don't really need every single permutation to pass.
15:15:43 <rbergeron> okay
15:16:08 <rbergeron> so ... agreed we are okay with the level of test coverage as of now on RC3
15:16:14 <rbergeron> ?
15:16:26 <adamw> ack
15:16:31 <rbergeron> #agreed we are okay with the level of test coverage as of now on RC3
15:16:41 <rbergeron> #topic Blockers
15:17:07 <rbergeron> adamw: I leave it to your discretion about the tackling of preupgrade vs. other bugs
15:17:11 <rbergeron> or tflink
15:17:17 * rbergeron suspects y'all are tyring to catch someone who can give feedback
15:17:24 <adamw> yes.
15:17:29 <adamw> oh, hey bcl, thanks
15:17:37 <rbergeron> #chair bcl
15:17:37 <zodbot> Current chairs: adamw bcl nirik rbergeron tflink
15:17:56 <bcl> so, where are we at?
15:18:02 <adamw> about to start looking at blockers
15:18:17 <adamw> let's take the easy ones first
15:18:19 <adamw> #topic https://bugzilla.redhat.com/show_bug.cgi?id=741594
15:18:39 <adamw> my feelings on this can be summarized by the words BIG RED NO, accompanied by several other words not fit for print.
15:18:44 <Martix> adamw: easy?
15:18:48 * nirik clicks
15:18:54 <Martix> https://bugzilla.redhat.com/show_bug.cgi?id=741594#c14
15:19:14 <Martix> please refresh page
15:19:35 <adamw> the answer to that is to have your shit together for a test day more than one day in advance.
15:19:51 <nirik> this was a yum update on a running live media? not a install from live media?
15:19:53 <adamw> yes,
15:20:04 <Martix> adamw: I did, but dev is doing what he thinks
15:20:14 <nirik> -1 blocker.
15:20:18 <adamw> bottom line, there has never been the slightest expectation that yum update on running live would work, not in the critera, not anywhere else. it's absolutely not a beta blocker.
15:20:22 <tflink> I'm pretty much with adam on this one, but a little less extreme :)
15:20:26 <adamw> Martix: i didn't mean personally, i meant in general.
15:20:30 <bcl> yum update on live really isn't a good idea.
15:20:34 <tflink> -1 blocker
15:20:35 <Martix> adamw: ok
15:20:41 <adamw> if you're relying on your users being able to yum update a live image, You Have Already Lost. :)
15:20:46 <nirik> it might be nice to capture the attempt and warn or something.
15:20:58 <rbergeron> -1 blocker :)
15:20:59 <Martix> yum update is often workflow during Test Days
15:21:08 <nirik> unless you have a LOT of memory. ;)
15:21:16 <bcl> Martix: a full update or specific packages?
15:21:17 <adamw> Martix: it wasn't when i was running them ;)
15:21:33 <Martix> nirik: 4GB isnt lot of memory?
15:21:38 <nirik> no.
15:21:48 <Martix> adamw: Gnome Shell SW rendering?
15:22:03 <Martix> nirik: what about persistent overlay?
15:22:10 <bcl> a yum update of a small set (1) of packages *may* be ok. but I wouldn't want to support it in general.
15:22:13 <nirik> every file added, _and_ every file removed has to be kept in the dm overlay. Aside from isolated small packges being installed, yum upgrade from a running live media is doomed.
15:22:15 <adamw> Martix: yeah, i didn't run that. i did tell desktop team several times to make sure they had an updated live image ready.
15:22:26 <Martix> bcl: what is small?
15:22:29 <adamw> (1)
15:22:38 <adamw> persistent overlay is /home only, isn't it?
15:22:50 <nirik> sure, if you have usb and persistent overlay you are in much better shape.
15:23:03 <nirik> but it still goes away a lot faster than you think.
15:23:18 <nirik> it's not just keeping the current change... but everything in between.
15:23:35 <Martix> btw AFAIR yum update always worked on F14 or F15 on Live media
15:23:50 <bcl> luck
15:23:53 <tflink> yeah, I remember seeing goofy errors when I attempted to debug a kernel issue  by updating the kernel in usb+overlay live
15:24:28 <adamw> i've certainly seen it explode in 14/15
15:24:32 * satellit_laptop liveinst to 8 GB USB is OK
15:24:39 <adamw> it's obviously going to depend on how many updates happen to be available at the time
15:24:43 <bcl> anyway, not a blocker. -1
15:24:44 <tflink> it turned out to be easier to just build livecds or do an install to a USB HD
15:25:06 <Martix> adamw: it was updates after release
15:25:28 <Martix> but 100 or 200 MB of update shouldnt kill system with 4 GB RAM or overlay
15:25:44 <nirik> I suspect it will.
15:25:50 <adamw> anyhow
15:25:51 <tflink> that sounds like a really bad idea to me
15:25:52 <adamw> we have solid - votes
15:25:58 <tflink> yep
15:26:06 <nirik> 100-200MB of updates is xz compressed payload... unpacking is... larger.
15:26:13 <Martix> free space is never exhausted when I run df -h
15:26:22 <adamw> propose #agreed 741594 does not hit any of the criteria and the cited scenarios are not significant enough to worry about delaying the beta
15:26:27 <tflink> ack
15:26:48 <nirik> Martix: it's not in a df... it's in a overlay.
15:27:43 <adamw> acks welcome
15:27:46 <rbergeron> ack
15:27:54 <nirik> Martix: 'dmsetup status live-rw'
15:28:05 <bcl> ack
15:28:30 <nirik> ack
15:29:18 <adamw> #agreed 741594 does not hit any of the criteria and the cited scenarios are not significant enough to worry about delaying the beta
15:29:26 <rbergeron> moving on then
15:29:56 <rbergeron> wait, there were three, now there are five
15:30:48 <rbergeron> which one's next :)
15:31:10 <adamw> let's go for a kde party
15:31:12 <adamw> #topic https://bugzilla.redhat.com/show_bug.cgi?id=810226
15:31:46 <adamw> "Cursor invisible in KDE"
15:32:05 <adamw> i can reproduce this, but only in a VM set up with qxl as the driver but VNC as the viewer, which has never been a default config
15:32:19 <adamw> the old default was cirrus/vnc (which hits another bug we'll come to), the new default is qxl/spice
15:32:29 <adamw> you can only get cirrus/spice by manual config
15:32:33 <adamw> has anyone else poked this one?
15:32:49 <tflink> adamw: how long as qxl/spice been the default?
15:33:06 <bcl> oh, hey, I hit that.
15:33:22 <tflink> a vm I just created on F16 x64 is using cirrus/vnc by default
15:33:39 <bcl> same conditions as adamw, I was working around the shell screen corruption bug.
15:34:00 <adamw> tflink: a new machine definition?
15:34:20 <adamw> then i really give up on figuring out those frickin' defaults. how did you create it?
15:34:24 <tflink> adamw: yeah, new disk, new VM
15:34:26 <adamw> bcl: so, qxl/vnc ?
15:34:29 <tflink> created w/ VMM
15:34:30 <bcl> right
15:34:49 <adamw> if no-one can reproduce with cirrus/vnc or qxl/spice i'm inclined towards -1 on this, though it's close
15:35:03 <tflink> the only non-default thing I do is lvm backed storage instead of flatfile
15:35:28 * tflink hasn't tried, to be honest
15:36:33 <bcl> adamw: oh, and this isn't KDE specific. I see it in anaconda which uses metacity and gtk
15:36:40 <adamw> ah, that's interesting
15:37:14 <Kevin_Kofler> Looks like an issue deeper in the stack than the KDE Workspaces for sure.
15:38:05 <adamw> seeing if i can confirm with desktop
15:38:19 <adamw> yup
15:38:28 <adamw> so basically any qxl/vnc is invisible cursor.
15:39:07 <tflink> not sure if it's a beta blocker, though
15:39:19 <tflink> since it doesn't hit what we think the 2 most common configs are
15:39:24 <adamw> yeah, i'm kinda on the fence
15:39:28 <tflink> qxl/spice and cirrus/vnc
15:39:40 <adamw> asking in #virt if they have a feel for whether qxl/vnc is popular and/or supported
15:40:22 <adamw> anyone else have a feeling on this one?
15:40:47 <rdieter_work> adamw: I'd tend to agree with you, depends on how common that config is.
15:40:53 <bcl> I'd lean towards -1 -- I guess I need to learn how to use spice with qemu
15:40:55 <adamw> oh, hey, look what the first google result for 'qxl vnc' is.
15:40:55 <adamw> http://patchwork.freedesktop.org/patch/9198/
15:41:07 <rdieter_work> lolz
15:41:36 <rbergeron> folks: I have to depart now for dentist but I will be back on shortly to keep going in meeting
15:41:38 * nirik leans toward -1 as well... especially if it can be fixed in an update.
15:41:42 <rbergeron> #chair rdieter_work
15:41:42 <zodbot> Current chairs: adamw bcl nirik rbergeron rdieter_work tflink
15:41:58 <tflink> yeah, this can't be fixed on the live images - it's a host problem
15:42:01 * rbergeron would lean towards -1
15:42:04 <adamw> bcl: http://www.linux-kvm.org/page/SPICE
15:42:05 <tflink> unless I'm missing something
15:42:15 <rbergeron> oh, ew
15:42:47 <nirik> well, if you are using the live image in virt instead of installing in virt. Not sure how common either is off hand.
15:42:48 <bcl> adamw: thanks
15:43:11 <adamw> nirik: both v. common.
15:43:24 <Kevin_Kofler> If that's really the problem, it's indeed on the host end as tflink says, so not a bug in the live images at all.
15:43:30 <bcl> so if that bug is correct this has nothing to do with the media. It is a qemu bug and can be fixed later.
15:43:41 <tflink> but if this is a driver problem on the host, how can we fix it in beta?
15:43:48 <adamw> the patch is in qemu?
15:43:49 <nirik> well, I meant which is more common... but yeah.
15:44:38 <adamw> tflink: you can update the host?
15:44:39 <tflink> adamw: if it isn't, won't this affect F16, too?
15:44:45 <adamw> tflink: presumably. i can test that.
15:45:05 * tflink has a F16 VM sitting in front of him, will try
15:45:08 <adamw> if the bug's in the host, then definitely -1.
15:46:04 <adamw> yup, reproduced with a 16 desktop live.
15:46:23 <adamw> so given that, i'm a definite -1. i'll try and hack up a fixed qemu to confirm it can be fixed with a host update, but sounds convincing.
15:46:32 <bcl> same here with my qemu-kvm and f16 netboot
15:46:36 <bcl> -1 blocker
15:47:13 <tflink> -1 blocker
15:48:04 <adamw> propose #agreed 810226 only hits a non-default configuration (qxl as the graphics driver, VNC as the display agent) and the bug is on the host side, not the guest side, so it can safely be fixed with an update: we do not need to re-spin images to fix it. therefore, rejected as a blocker
15:48:15 <tflink> ack
15:48:20 <bcl> ackack
15:48:29 <nirik> ack
15:48:37 <tflink> adamw: we probably want to commonugs it, though
15:48:40 <adamw> point
15:48:44 <tflink> commonbugs, even
15:48:45 * nirik nods.
15:48:57 <adamw> #topic https://bugzilla.redhat.com/show_bug.cgi?id=810161
15:49:04 <adamw> so, this one's likely to be a bit more sticky. =)
15:49:32 <adamw> background: we had a blocker bug, https://bugzilla.redhat.com/show_bug.cgi?id=809052 , which was really bad display corruption in shell on a cirrus/VNC VM
15:50:08 <Kevin_Kofler> So now we have KDE Platform applications being corrupted instead.
15:50:16 <adamw> the update https://admin.fedoraproject.org/updates/xorg-x11-drv-cirrus-1.3.2-19.fc17,mesa-8.0.2-2.fc17 fixes it and was pulled into rc3, but it introduces a (MHO) less serious display corruption regression in KDE on cirrus/VNC
15:50:19 <Kevin_Kofler> Plus, the "fix" degrades overall display quality by reducing color depth.
15:50:46 <Kevin_Kofler> IMHO, this kind of change is landing way too late for its impact and should be pulled out.
15:50:46 <adamw> Kevin_Kofler: i don't believe that's actually true, i'm pretty sure i've heard ajax explain before that there's actually no practical difference between 16 and 24. the details escape me, though.
15:51:24 <adamw> ajax's rationale for doing it this way is that the 24bpp path is used by almost no drivers and not well tested; he thinks all drivers should be on 32 or 16
15:51:33 * adamw trying to find records
15:51:56 <Kevin_Kofler> 24 bpp is what cirrus used for ages, and in fact we went out of the way to fix Qt to work with it back in the day.
15:52:05 <bcl> adamw: so this only hits KDE in VM and only with cirrus/vnc?
15:52:12 <Kevin_Kofler> And now we get force-switched to 16 between Beta RC2 and Beta RC3. :-/
15:52:21 <adamw> so, important that we understand the issue fully: has anyone seen anything besides the pink scrollbars?
15:52:46 <adamw> i only tested briefly, enough to see the pink stuff, i didn't see anything further than that immediately but i might've missed something
15:52:47 <adamw> bcl: yes.
15:52:48 <nirik> is this a cirrus bug? or a general 16bpp issue with kwin/etc?
15:53:03 <Kevin_Kofler> Those pink scrollbars are so blatant they immediately strike a user as "broken".
15:53:04 <adamw> bcl: desktop is fine with cirrus/vnc, kde is fine with qxl/spice.
15:53:05 <nirik> I guess cirrus
15:53:08 <bcl> adamw: so does it work ok with spice/qxl?
15:53:13 <adamw> Kevin_Kofler: sure, i just want to make sure we catch if there's anything worse
15:53:26 <bcl> I'd say -1 blocker then.
15:53:30 <adamw> Kevin_Kofler: i don't want to be discussing it on the basis of 'pink scrollbars' if the bug is actually 'pink scrollbars and my baby explodes'
15:53:42 <rdieter_work> hard to say, do any other drivers use 16bpp by default?
15:53:56 <adamw> rdieter: ajax says all the other 'fallback drivers' use 16bpp
15:53:59 <Kevin_Kofler> vesa does, I think.
15:54:02 <Martix> adamw: when I was discussing next TD with jreznik he told me, there are more artefacts than just scrollbars
15:54:08 * rdieter_work only has old i945 i965 to test
15:54:11 <adamw> i'm not entirely clear what he means by 'fallback drivers' but i believe we're talking vesa, fbdev, that kinda stuff.
15:54:13 <Kevin_Kofler> I don't think we've ever seen that particular issue with vesa though.
15:54:19 <adamw> Martix: is he around?
15:54:24 <Kevin_Kofler> So I suspect a driver bug.
15:54:31 <Martix> adamw: unfortunately not now
15:54:48 <adamw> Kevin_Kofler: i tried to ping ajax/airlied to discuss it
15:55:19 <adamw> hi ajax
15:55:23 <adamw> so we're arguing about https://bugzilla.redhat.com/show_bug.cgi?id=810161
15:55:36 <adamw> the kde regression introduced by the fix for https://bugzilla.redhat.com/show_bug.cgi?id=809052
15:55:54 <ajax> oh good, choice.
15:55:59 <adamw> =)
15:56:15 <ajax> i actually found something similar in gnome while trying to fix 16bpp gears to not be translucent
15:56:15 <adamw> kev was questioning switching to 16bpp so late, any background on that? specifically the question of what drivers use what depths has come up
15:56:35 <ajax> normally we try to use 32bpp if at all possible
15:56:40 <Kevin_Kofler> Looks like cirrus is badly broken in 16 bpp then.
15:56:44 <ajax> Kevin_Kofler: no.
15:56:56 <ajax> cirrus is correct.  the desktop is broken.
15:57:07 <Kevin_Kofler> I'm not aware of ever having seen this on vesa though.
15:57:12 <Kevin_Kofler> And vesa is 16 bpp, isn't it?
15:57:17 <ajax> also no.
15:57:23 <Kevin_Kofler> Oh?
15:57:37 <ajax> if you'd let me talk...
15:57:56 <adamw> you're not both using the same keyboard, are you? :)
15:58:02 <Kevin_Kofler> ^^
15:58:23 <ajax> vesa tries for 32 then 16 then 24.  because 24bpp is easily the worst-tested path in the world.
15:58:35 <ajax> the cirrus card that qemu emulates can't do 24bpp.
15:58:36 <adamw> ah, checkboxes are half purple, there's purple spots on menu entries (File, Window etc)
15:58:40 <ajax> excuse me. can't do 32bpp.
15:58:56 * rbergeron peeks in
15:59:05 <ajax> it turns out that gnome is a little broken for 16bpp too, and it's _weird_
15:59:09 <adamw> slider widgets have purple backgrounds.
16:00:06 <adamw> let's say there's a definite lilac theme to the desktop. =) i haven't found anything that's actually unusable yet though.
16:00:07 <ajax> what happens is when cogl goes to create a glx pixmap for texture_from_pixmap, it guesses whether to do xrgb or argb based on whether the number of bits in the window's visual matches the pixmap's bit depth.
16:00:26 <ajax> but it turns out that's the wrong thing to do
16:00:42 <ajax> because the backing pixmap can often be 32bpp, due to the way composite works
16:00:58 <ajax> so you have a 16-bit visual but 32 bpp real storage
16:01:30 <ajax> so in cogl this manifests as the alpha channel being 0 a lot of the time, and gears is accidentally translucent
16:01:44 <ajax> but i'm reasonably sure you'd find something similar in the kde code
16:01:53 <Kevin_Kofler> But KDE does not use OpenGL at all on cirrus.
16:02:01 <ajax> did i say opengl?
16:02:06 <adamw> is anyone else testing this, btw? it's easy to poke
16:02:13 <Kevin_Kofler> glx, texture_from_pixmap etc.
16:02:15 <adamw> just boot up the kde live in a cirrus/vnc vm
16:02:16 <ajax> it's not a property of opengl.  it's a property of Composite.
16:02:37 <ajax> redirected windows can end up with a backing pixmap that doesn't match the X visual.
16:02:49 <ajax> if kde is still compositing even on cirrus, then you'd hit this.
16:03:03 <Kevin_Kofler> It's compositing using XRender.
16:03:07 <ajax> well then there you go.
16:03:22 <ajax> it might manifest differently but i suspect it's the same underlying conceptual bug.
16:03:50 <Kevin_Kofler> I guess we need to test by disabling effects/compositing in KWin entirely.
16:04:05 <adamw> how do i do that?
16:04:15 <rdieter_work> so, we have possible workaround 3, disable desktop effects
16:04:18 <Kevin_Kofler> KDE System Settings, there's a Desktop Effects entry there.
16:04:22 <adamw> ah, got it.
16:04:25 <rdieter_work> adamw: ALT-SHIFT-F12
16:04:31 <rdieter_work> should toggle effects on/off
16:04:53 <Kevin_Kofler> Upstream falls back to XRender when there's no suitable OpenGL (and llvmpipe is not "suitable" under KWin's current definition, due to both corruption issues and performance issues).
16:05:39 <ajax> corruption you'd probably fix if you did the moral equivalent of http://pkgs.fedoraproject.org/gitweb/?p=cogl.git;a=blob;f=cogl-1.8.2-lp-no-framebuffer-blit.patch;h=c6564772d3c45abc5f3db6c8188a801c290c8427;hb=HEAD
16:06:04 <ajax> wouldn't fix the performance bit, of course
16:06:13 <adamw> turning effects off doesn't seem to fix things.
16:06:16 <adamw> anyhow
16:06:23 <ajax> well hell.
16:06:27 <adamw> we don't really need to know how to fix it in this meeting...question is, is it serious enough to be a blocker
16:06:52 <tflink> which gets us back to the question of how common the verious VM settings are
16:07:07 <ajax> qemu defaults to cirrus because they're idiots.
16:07:12 <adamw> well since you seem to bleedin' well always get cirrus/vnc as your default...=)
16:07:51 <adamw> ajax: i've long thought they switched to qxl/spice as default with f16, but tflink says he gets cirrus/vnc as default even for a new 64-bit VM on his f16 machine. so i'm really not sure what the story is there.
16:08:02 <adamw> tflink: also the question of just how bad we think the bug is, to be honest.
16:08:04 <bcl> adamw: I'm -1 blocker. Seems like you can use qxl/spice to get around it.
16:08:20 <Kevin_Kofler> And doesn't the switch to 16 bpp also mean that graphics will look worse than before (even if not corrupted)?
16:08:21 <bcl> I'd be +1 final blocker though.
16:08:21 <rdieter_work> imo, slight corruption no real impact , workarounds available, -1 beta blocker, can NTH later perhaps
16:08:25 <adamw> looks silly? sure. but i've yet to find a symptom which renders the desktop unusable.
16:08:26 <rbergeron> maybe jforbes knows offhand, but....
16:08:31 <ajax> i could probably fix it by getting cirrus to expose 32bpp visuals and papering it over in the shadowfb upload hook
16:08:32 * nirik is -1 blocker, +1 common bugs for sure.
16:08:51 <adamw> oh, of course if someone's on an old enough host, spice/qxl may be unavailable or really buggy.
16:09:01 <adamw> i'm not sure how good the spice/qxl in f15 is. anyone used that combo lately?
16:09:02 <ajax> which is probably not a bad idea anyway
16:09:08 <rbergeron> i would be -1 for beta, but commonbugs... final though maybe
16:09:10 <tflink> adamn: just because it always happens to me, doesn't mean that it always happens for everyone:)
16:09:42 <tflink> I'm probably -1 blocker, +1 commonbugs, +1 nth
16:09:44 <adamw> i'm a slight -1 mainly because it's just not a terrible bug, it looks silly but everything works okay
16:09:51 <adamw> +1 nth +1 commonbugs obviously
16:10:17 <Kevin_Kofler> Well, we have a fix available, revert the change. So let's exercise the NTH with that fix.
16:10:32 <adamw> not really an option, as that would re-open a blocker.
16:10:46 <Kevin_Kofler> Then that blocker needs to be reconsidered, or fixed differently.
16:10:58 <Martix> Kevin_Kofler: +1 for different fix
16:11:11 <adamw> Kevin_Kofler: sure, if we wind up slipping, we have several days to come up with something which fixes both
16:11:17 <bcl> Kevin_Kofler: Remember this is BETA, there's still time to fix it.
16:11:32 <adamw> if we don't slip, though, i'm okay with shipping this, and commonbugsing it.
16:11:47 <nirik> adamw: +1
16:11:51 <tflink> same here - the workaround is pretty easy
16:12:02 <adamw> for the record, not because this is KDE. if we had a KDE bug as bad as 809052, i'd call it blocker.
16:12:10 <bcl> yep.
16:12:19 <Kevin_Kofler> tflink: The same workaround would also fix the other bug which was accepted as a blocker and whose "fix" caused this late regression.
16:12:22 <tflink> yeah, 809052 made shell completely un-usable
16:12:35 <ajax> Kevin_Kofler: i look forward to your patch.
16:13:13 <adamw> propose #agreed 810161 is rejected as a blocker because the impact is minor (it's essentially a cosmetic bug, desktop is still usable), also it only hits one specific virt configuration so it can at least be worked around. accepted as NTH and will be commonbugs'ed if we ship with it
16:13:21 <Martix> ajax: adamw for the record, we have KDE 4.8 Test Day on Tuesday, so #810161 fix would be nice
16:13:28 <rbergeron> ack
16:13:30 <nirik> ack
16:13:30 <bcl> ack
16:13:31 <adamw> Martix: thanks
16:13:38 <tflink> ack
16:13:43 <ajax> Martix: i'll see what i can do.
16:13:52 <Martix> ajax: thank you
16:13:56 <adamw> #agreed 810161 is rejected as a blocker because the impact is minor (it's essentially a cosmetic bug, desktop is still usable), also it only hits one specific virt configuration so it can at least be worked around. accepted as NTH and will be commonbugs'ed if we ship with it
16:14:24 <adamw> #topic https://bugzilla.redhat.com/show_bug.cgi?id=810289
16:14:36 * ajax afk for lunch
16:14:37 <adamw> so, this is annoying - it's the one case bcl and i didn't think of in the free space for upgrade check discussion
16:15:08 <adamw> what happens if you don't have enough space for upgrade but you do have enough space for /usr move? answer: /usr move happens, and so even though anaconda catches that you don't have enough space for upgrade, you're stuck with a basically busted system
16:15:24 <adamw> you can't just go back to 16, as it's been /usrmoved. you have to free up space and do the upgrade.
16:15:38 <adamw> ajax: thanks, i think that's all we needed you for
16:15:59 <Kevin_Kofler> Just yet another example of how UsrMove is broken… :-/
16:16:02 <adamw> bcl: wdyt on this one?
16:16:15 <bcl> adamw: I don't think we check for space on upgrade.
16:16:27 <adamw> "5. usr-move is executed, anaconda will not upgrade your machine, bc. of missing
16:16:27 <adamw> space"
16:16:44 <bcl> And really, we can't know how much space will be needed.
16:17:32 <bcl> oh, sorry, I hadn't noticed he'd filed a bug with details.
16:17:35 <adamw> i guess that could just mean 'it fails', not 'anaconda refuses to try'.
16:17:45 <adamw> it's a bit unclear.
16:18:47 <cpuobsessed> if root and usr are separate partition then if usr free > usr + root?
16:19:37 <Kevin_Kofler> AFAIK, UsrMove actually operates on a copy of the directories, so it does need some additional space.
16:19:39 * wwoods is here if still needed
16:19:39 <adamw> bcl: so we can't really do anything better than hardwire an ugly arbitrary free space check into dracut or something here?
16:19:50 <adamw> wwoods: you will be, we'll be coming to the preupgrade bug later
16:19:54 <bcl> I'd guess that it fails during package install.
16:20:06 <adamw> Kevin_Kofler: apparently it uses hardlinks
16:20:07 <Kevin_Kofler> It could be done with basically no space requirements if it'd MOVE files from one directory into the other without using a copy, but that'd be more risky.
16:20:10 <pjones> Kevin_Kofler: they sell 3TB drives now, you know.
16:20:10 <adamw> Kevin_Kofler: so it doesn't actually need much.
16:20:18 <bcl> adamw: yeah, I don't know what else to do.
16:20:23 <pjones> the hardlink tree isn't exactly what you'd call large.
16:20:29 <adamw> i can't see that there's really much we can _do_ here
16:20:39 <adamw> since usrmove happens in dracut, the free space check needs to be before it
16:20:50 <adamw> and we can hardly go wiring 'check the installed package set and estimate size needed for upgrade' code into dracut...
16:20:58 <nirik> perhaps dracut could grow a 'unupgrade' ?
16:21:00 <Kevin_Kofler> pjones: I know. :-)
16:21:06 <bcl> Kevin_Kofler: I talked to harald and convertfs fails gracefully if it runs out of space.
16:21:10 <nirik> so if the install fails, anaconda could tell it to revert on boot?
16:21:18 <adamw> and it's hard to just set an arbitrary minimum because the disk space needed for a minimal install upgrade is going to be a lot less than for a full install
16:22:16 <bcl> right. that's why I didn't see a problem when I tried it with 1G free.
16:22:27 <Kevin_Kofler> nirik: Only really possible if you save a list of what files were where.
16:22:35 <Kevin_Kofler> Undoing a merge is harder than doing it. ;-)
16:22:36 <nirik> yeah.
16:22:44 <bcl> this isn't a new problem either, it's just that usrmove makes it slightly worse.
16:22:57 <adamw> true. and i guess it is 'slightly'
16:22:58 <pjones> yeah, you're down in the territory where %post space estimation is a problem.
16:23:02 <pjones> i.e. the weeds.
16:23:11 <pjones> %postin rather
16:23:14 <adamw> because if anaconda has no check, when it gets halfway through an upgrade and fails, you likely have a busted system anyhow.
16:23:16 <bcl> right.
16:23:34 <bcl> At best we could test for (1G?) free and prompt to continue.
16:23:48 * adamw is trying to reproduce the reporter's scenario to see exactly how anaconda fails
16:23:55 <adamw> bcl: yeah, it's a bit microsoft-y but can't see what else
16:24:57 <bcl> wwoods: do you have any feeling for metadata overhead on package installs?
16:25:21 <bcl> maybe we could look at the # of installed packages and make a guess.
16:25:39 <adamw> so if we've never had a space check in anaconda anyway, i guess i'm -1 to this
16:26:18 <adamw> because that would mean we've shipped a dozen releases which can explode on low space upgrade and everyone's just said 'boy, that fedora sure is a lil scamp'
16:26:21 <bcl> I'm -1 blocker +1 NTH but we should probably come up with something for final.
16:26:26 <tflink> would you have ended up with a busted system if this happened with earlier releases?
16:26:39 <bcl> yes, depending on where in the package list it failed.
16:26:45 <adamw> tflink: i'm pretty sure 'yes', if the only failure mode anaconda has is 'try the upgrade anyway then explode at some point'.
16:26:55 <adamw> usr move is not making things a whole lot worse, in practical terms.
16:26:58 <tflink> then -1 blocker, +1 NTH
16:28:09 <adamw> i wish my test would finish so i can confirm we don't hit some kind of anaconda check bcl has forgotten about =)
16:29:02 <adamw> well, we can circle back if my test finds something new
16:29:14 <rbergeron> -1 blocker, +1 nth....... unless adam finds something
16:29:21 <rbergeron> bad something
16:29:22 <rbergeron> :)
16:30:04 <adamw> propose #agreed #810289 is rejected as a blocker on the basis that we've never actually checked for low space on upgrade (and hence have implictly never considered it a blocking issue). anaconda will just try to upgrade and explode. so this isn't in fact a regression compared to earlier releases, and we don't have a specific requirement for a low space check in criteria or anywhere else
16:30:46 <bcl> ack
16:31:27 <tflink> ack
16:31:27 <nirik> ack
16:31:38 <adamw> #agreed #810289 is rejected as a blocker on the basis that we've never actually checked for low space on upgrade (and hence have implictly never considered it a blocking issue). anaconda will just try to upgrade and explode. so this isn't in fact a regression compared to earlier releases, and we don't have a specific requirement for a low space check in criteria or anywhere else. accepted as NTH
16:31:57 <adamw> okay, fun time :/
16:31:59 <adamw> #topic https://bugzilla.redhat.com/show_bug.cgi?id=810136
16:32:07 <adamw> so, as we feared, preupgrade is still borked.
16:32:23 <adamw> this is one of those where we've squelched exactly one bug in each tc/rc, and we're not at the end of the chain yet.
16:32:25 * tflink wants to figure out a better way to test this earlier
16:32:31 <adamw> wwoods: floor show
16:32:40 <tflink> we always seem to have problems with preupgrade at the last minute
16:32:45 <adamw> tflink: i at least figured out how you can force preupgrade to pull from some specific location, so it should be possible.
16:33:12 <tflink> and there is a TODO in the preupgrade code to make the location configurable
16:33:13 <adamw> obviously, it's important to know whether this bug will be fixed preupgrade side or anaconda side
16:33:41 <adamw> but even if it's preupgrade side, there's the possibility we'll find ANOTHER bug behind it which is in anaconda
16:34:16 <bcl> repo=hd::/var/ is missing a UUID
16:34:42 <bcl> so that points to preupgrade, not anaconda/dracut
16:35:01 <adamw> so, that would be preupgrade?
16:35:05 <bcl> yes.
16:35:20 <adamw> okay, so i still have that vm running - i can reboot it and hack a UUID in there
16:35:37 <bcl> NO guarantee that it would actually work, but you could try to edit the cmdline and point it to the same drive as the others.
16:35:49 <wwoods> it's not missing
16:35:51 <wwoods> it's not necessary
16:35:56 <bcl> oh?
16:36:11 <wwoods> "stage2=" is supposed to override repo=, so we'd ignore that
16:36:43 <wwoods> "repo=hd::.." is special preupgrade-only notation that means "use whatever device we're upgrading"
16:36:53 <adamw> okay. and changing it doesn't seem to help.
16:36:54 <wwoods> which anaconda understands
16:37:00 <adamw> next idea? :)
16:37:14 * adamw has the VM running and is taking requests
16:37:28 <wwoods> "no suitable images" is the confusing bit
16:38:02 <wwoods> add "rd.debug" to the boot args so you can examine /run/initramfs/init.log
16:38:16 <wwoods> adamw: does /tmp/ks.cfg.done exist?
16:38:43 <wwoods> uh
16:38:47 <wwoods> upgrade --root-device=UUID=None
16:39:00 <wwoods> that's definitely wrong, but outside the scope of the problem you're seeing
16:39:11 <adamw> there's only export.orig in /tmp
16:40:06 <adamw> we're not hitting another case of the parameter split bug here, are we?
16:40:07 <wwoods> adamw: that's.. confusing. I'm gonna set up my own VM so I can swear^Wplay along
16:40:25 <adamw> wwoods: it'll probably take you a while :/
16:40:36 <adamw> oh hey!
16:40:42 <wwoods> adamw: not really, since I just need a squashfs.img in the right place
16:40:48 <adamw> ah k.
16:40:52 <wwoods> adamw: very likely yes, now that you mention it
16:41:02 <adamw> i could give you root login on my machine, then you could access my VM via virt-manager...=)
16:41:39 <adamw> is the parameter split bug fixable with an updates.img (so i could test)?
16:42:02 <bcl> adamw: is preupgrade beta criteria? Or can we insist they just use anaconda?
16:42:09 <adamw> bcl: we put preupgrade in beta
16:42:20 <adamw> in our young, foolish and optimistic phase, no doubt
16:42:35 <bcl> well, really, we need to test sooner and more often.
16:42:38 <adamw> ironically, it's in beta because it's our 'recommended method' for upgrading a running system (as opposed to yum)
16:42:48 <wwoods> adamw: no. although I keep thinking it might be useful to have a way to update initramfs inside initramfs
16:42:49 <adamw> yet it always seems to turn out that we have yum upgrades basically working before preupgrade does. :/
16:42:54 <bcl> right. and we'd like to make it the only one eventually.
16:42:59 <adamw> yo dawg...
16:43:03 <wwoods> but that requires being able to cleanly restart initramfs /init, which we're still working on
16:43:41 * cpuobsessed points out he noticed it was broke in alpha :D
16:43:50 <adamw> cpuobsessed: oh, we knew
16:43:57 <adamw> cpuobsessed: but it was broken for a completely *different* reason
16:44:08 <cpuobsessed> but broken nonetheless
16:44:09 <adamw> cpuobsessed: that's the problem with preupgrade - we can basically fix one bug in it per candidate
16:44:21 <adamw> so we find one bug, fix it, then with the next compose we find the next bug, fix it...etc etc
16:44:38 <cpuobsessed> trudge trudge trudge
16:45:47 <adamw> well, shit, i'm an idiot. for the free space on upgrade test, i carefully installed a system and tweaked it down to 700MB free disk space. an *f17* system. d'oh.
16:46:11 <rbergeron> doh
16:48:27 <adamw> wwoods: any eta?
16:48:37 <adamw> sorry to take up people's time, but it's pretty important we figure out the preupgrade story
16:49:48 <wwoods> adamw: patch is already sent, reviewed, and applied
16:49:53 <wwoods> oh wait blarg
16:50:26 <adamw> i meant, eta on you reproducing and investigatin'
16:51:48 * rbergeron peeks in
16:53:06 <wwoods> adamw: so here's what you can do - boot with 'rd.debug rd.break' and grep 'stage2=' /run/initramfs/init.log
16:53:42 <adamw> roger
16:54:38 <wwoods> if it's missing the path, it's the splitsep bug
16:54:49 <adamw> there's no /run/initramfs/init.log
16:55:01 <wwoods> ...
16:55:35 <adamw> huh, it's like my rd.debug got dropped.
16:56:38 <bcl> try it w/o rd.break, I had some problems with it truncating logs a while back.
16:56:49 <adamw> okay, worked this time
16:56:52 <adamw> i moved the parameters
16:56:53 <bcl> since you're already getting the shell you shouldn't need the break.
16:57:09 <bcl> oh? that's not good.
16:57:17 <adamw> um, i get three results
16:57:49 <adamw> 27parse-anaconda-repo.sh@8(source): getarg stage2=inst.stage2=
16:58:09 <adamw> 27parse-anaconda-repo.sh@8(source): stage2=hd:UUID=blahblah:/upgrades/squashfs.img
16:58:21 <adamw> 27parse-anaconda-repo.sh@12(source): arg=stage2
16:58:35 <adamw> so that's a fail?
17:00:09 <adamw> bcl: it's possible i stuck the parameters on the initrd line first try. i am pretty tired.
17:00:12 <cpuobsessed> adamw: you forgot to rub your tummy and pat your head
17:00:13 <adamw> wwoods: ^^^
17:00:34 <adamw> cpuobsessed: sorry, one hand is typing and the other is holding the breakfast whiskey.
17:00:34 <rbergeron> buddha!
17:00:51 <adamw> in case anyone's wondering, this is the last proposed blocker
17:00:56 <adamw> so we're not blocking discussion of any other bug
17:01:36 <adamw> bcl: should i try the root= style workaround that we've used for the other cases of the getarg breakage?
17:01:51 <bcl> may as well try.
17:02:08 <bcl> though that really isn't a good solution.
17:02:13 <adamw> root=live:hd or what?
17:02:27 <bcl> yeah, the live needs to be there.
17:02:31 <wwoods> no hd
17:02:37 <wwoods> hd: is anaconda-specific
17:02:42 <wwoods> only works with things that start with inst.
17:03:18 <adamw> ...i'm getting mixed messages =)
17:03:39 <wwoods> in fact I might drop it, since it's obvious you're talking about a disk device when you say "/dev/sdb" or "LABEL=funk" or whatever
17:03:49 <wwoods> root=live:DEVICE
17:03:58 <adamw> so, live:UUID=blah
17:04:03 <wwoods> yes. and live: doesn't take paths
17:04:06 <adamw> k
17:04:22 <adamw> i'm cloning the machine first, so i have a 'clean' copy of the problem state.
17:04:35 <adamw> even if this test explodes things.
17:04:41 <wwoods> so unless the image is at /LiveOS/squashfs.img it might not work
17:05:20 <adamw> wwoods: tflink said his workaround for 810005 is "root=live:http://path/to/repo/LiveOS/squashfs.img" ?
17:05:31 <wwoods> that'd work
17:05:51 <wwoods> "root=live:DEVICE rd.live.dir=upgrades" might work too
17:05:58 <wwoods> then you wouldn't need to http-fetch it
17:06:21 <adamw> well i'm just sayin, he passed a path.
17:06:33 <wwoods> that's a URL, not a path
17:06:44 <adamw> well played, sir
17:06:48 <adamw> you win THIS round
17:07:30 <wwoods> live:{http,https,ftp,nfs} get handled differently
17:07:33 <adamw> might i invite you to a round of golf in my space lair?
17:07:55 <adamw> still cloning...
17:07:58 <wwoods> the one with all the death lasers? sounds very un-traplike!
17:08:58 <adamw> oh, yes. it has a big sign that says DEFINITELY NOT A TRAP, and you can't say fairer than that.
17:09:16 <wwoods> is everything in the RC tagged and live in the repos, or do I need to grab some updates to make images?
17:09:40 <adamw> hey, that second one seems to get further.
17:09:48 <tflink> wwoods: everything has been pushed to stable AFAIK, may not be out to the mirrors yet, though
17:09:56 <adamw> now it's trying to get the kickstart from /dev/vda2:/upgrade/ks.cfg...and goodnight.
17:10:19 <adamw> i'm not sure _everything_ is pushed, but everything image-compose-relevant ought to be
17:10:31 <adamw> we pushed anaconda and lorax and forced a tree compose
17:10:57 <adamw> so, yeah: that gets to 'fetching kickstart from /dev/vda2:/upgrade/ks.cfg' and then fails, 'Unable to process initqueue' and shell.
17:11:56 <wwoods> adamw: any /tmp/ks.cfg* exist?
17:12:11 <adamw> yeah, /tmp/ks.cfg and /tmp/ks.cfg.done and /tmp/ks.info .
17:12:17 <adamw> so i guess it actually fails after kickstart.
17:12:29 <wwoods> okay then
17:13:06 <wwoods> try: cd /lib/dracut/hooks/initqueue/finished
17:13:17 <wwoods> . /lib/dracut-lib.sh
17:13:38 <wwoods> for f in *; do . $f; echo $f: $?; done
17:14:12 <wwoods> one or more of those scripts is returning non-zero - figure out which
17:14:57 <adamw> it doesn't like the filenames...
17:15:11 <adamw> the two 'devexists' ones with lots of slashes in them just give 'file not found'
17:16:08 <wwoods> okay. what's in those scripts?
17:16:48 <wwoods> also, "devexists" should be like, devexists-\xXXdev\xXXroot or something. if it's just a bunch of slashes, maybe the device name got lost
17:17:10 <adamw> yeah, it's like that
17:17:32 <adamw> all three seem to return 0, but i fiddled a bit...
17:17:36 <adamw> lemme reboot and retry
17:18:13 <adamw> actually, the x2fdev one gave 1 on that last try
17:18:25 <adamw> yeah, seems to be giving 1 every try now...
17:18:27 <wwoods> what are the contents of those files?
17:18:43 <adamw> the one that fails just has one line
17:18:47 <wwoods> should be something like "[ -e "DEVICE" ] || warn ..
17:18:52 <adamw> [ -e "/dev/mapper/live-rw" ]
17:19:10 <wwoods> and /dev/mapper/live-rw doesn't exist, I take it
17:19:21 <adamw> you would be correct, sir
17:19:30 <wwoods> so probably when you dropped into the shell it said  "/dev/mapper/live-rw does not exist"
17:19:39 <adamw> indeed it did, missed that.
17:19:40 <adamw> sorry.
17:21:21 <adamw> so, big picture, so far we're sure the parameter interpretation bug is the first thing breaking it, but we could theoretically patch preupgrade to work around it, and now we're hitting a different bug?
17:22:08 <wwoods> seems that way
17:22:20 * nirik sighs. a nasty Matryoshka of bugs.
17:23:41 <adamw> any idea what this second bug is?
17:23:55 <adamw> nirik: ooh, you should change preupgrade's Summary line to that. :P
17:23:56 * Kevin_Kofler thinks we should make direct yum the one supported upgrade method and drop all the others.
17:24:00 <Kevin_Kofler> Really it's what works best.
17:24:23 <Kevin_Kofler> We've now been discussing how to fix preupgrade for over an hour.
17:24:35 <Kevin_Kofler> And the DVD upgrades are known for their upgrade path issues.
17:25:01 <Kevin_Kofler> Why do we keep insisting that the best working upgrade method is "unsupported"?
17:25:14 <wwoods> because it has its own host of problems which you are conveniently ignoring.
17:25:29 <wwoods> but hey, support whichever you want
17:25:46 <wwoods> they've all got their own problems
17:26:13 * adamw would like to try and get to the smallest matroshka before we make a final vote/determination
17:26:25 <adamw> so we have some degree of confidence whether we can make preupgrade work with the rc3 images or not
17:28:30 <rbergeron> yesplz
17:31:17 <wwoods> adamw: does 'anaconda_live_root_dir' show up in init.log? if so, what are the args?
17:32:03 <adamw> whoops, didn't boot with rd.debug, just a sec
17:34:14 <adamw> wwoods: nope, no result from grep.
17:35:27 <wwoods> adamw: can you copy the init.log out of there and fpaste it or something?
17:35:37 <wwoods> mount the working disk and copy it in there or something
17:35:47 <adamw> i was already working on it =)
17:35:59 <wwoods> aces
17:37:09 <adamw> well shit. /dev/vda2 is already mounted ro as the install source and i can't mount the other disk as it's an LVM
17:37:09 <adamw> gr
17:38:51 <adamw> okay, a usb stick works. yay dracut.
17:39:29 <adamw> wwoods: http://fpaste.org/peao/
17:41:45 <wwoods> oh. this is when you're doing root=live:UUID...
17:42:25 <wwoods> well, this needs poking at too
17:43:44 <adamw> sorry, you mean it might just be caused by the 'workaround'?
17:45:38 <wwoods> adamw: is the file in /upgrades/squashfs.img on /dev/vda2?
17:46:02 <adamw> /dev/mapper/live-rw ?
17:46:09 <wwoods> no
17:46:30 <wwoods> is /upgrades/squashfs.img the file on /dev/vda2
17:46:34 <wwoods> does that file exist on that partition
17:46:38 <adamw> oh, i see.
17:46:55 <adamw> it's upgrade/squashfs.img
17:46:58 <adamw> not upgrades/squashfs.img
17:47:00 <adamw> but it's there
17:47:06 <wwoods> there's your problem then
17:47:09 <adamw> ah.
17:47:21 <wwoods> rd.live.dir=upgrade, not upgrades
17:47:28 <adamw> d'oh =)
17:47:31 <adamw> shoulda caught that.
17:47:33 <adamw> retrying
17:48:55 <adamw> well, we have stage2
17:48:58 <adamw> but:
17:49:47 <adamw> An error occurred mounting device /dev/vda2 as /boot: mount failed: (32, 'mount: /dev/vda2 is already mounted or /mnt/sys/image/boot busy\n   /dev/vda2 is already mounted on /run/initramfs/live'). This is a fatal error and the install cannot continue.
17:49:52 <adamw> sic
17:50:03 <adamw> it does have that weird cut-off message, that's not a typo
17:50:56 <wwoods> that's probably because of the upgrade line being wrong
17:51:11 <adamw> oh, you mentioned that earlier
17:51:20 <wwoods> change ks.cfg so the upgrade --root-device=DEVICE is correct
17:51:23 <wwoods> and see if that works
17:51:33 <wwoods> that's a preupgrade problem, not an anaconda one
17:51:48 <wwoods> but also: if you can get an init.log from trying to boot with the default args, that'd be helpful
17:51:56 <adamw> so it should be device=UUID=thesameUUIDasonthekernelcmdline ?
17:52:15 <adamw> uh, so an init.log from this state it's in now?
17:52:18 <wwoods> yeah. or whatever device you're trying to upgrade
17:53:14 <wwoods> no, you changed the boot args from what anaconda set - I'm looking for init.log after booting with "repo=hd::... stage2=hd:UUID:...:" etc.
17:53:30 <adamw> oh, without the root= workaround?
17:53:33 <wwoods> yes
17:53:42 <adamw> oh, okay. i thought we already knew the fix for that, but sure
17:53:47 <wwoods> no, we don't
17:53:51 <adamw> ok
17:53:56 <adamw> i thought it was the getparam bug
17:54:18 <wwoods> I don't know. that's why I want th elog.
17:54:34 <wwoods> what I got from you earlier was inconclusive
17:56:24 <adamw> ah ok
17:56:40 <adamw> wwoods: http://fpaste.org/Ayn6/
18:00:29 <adamw> wwoods: so just to double-check, for my Tottering Pile Of Workarounds path, we're up to 'upgrade --root-device=UUID=f9f8667a-5806-41be-a96d-93bbac0334e4' in the ks.cfg, right?
18:00:34 <adamw> that's the correct syntax?
18:00:50 <wwoods> should be
18:01:01 <adamw> k
18:01:52 <wwoods> adamw: root=anaconda-disk:UUID=f9...4e4
18:02:00 <wwoods> no path. ayup, that's the splitsep error
18:02:07 <adamw> ok
18:02:43 <adamw> still get the vda2 error after correcting the upgrade line
18:03:20 <adamw> i can see the 'corrected' line in /run/install/ks.cfg so it looks like it's being picked up
18:04:52 <rbergeron> tottering!
18:05:01 <adamw> i certainly am
18:05:47 <adamw> rbergeron: so, status is: we know what the first bug is, it's 810005, same parameter splitting bug we discussed yesterday
18:06:04 <adamw> once i work around that we can make it to stage2, which promptly explodes trying to mount something wot is already mounted
18:06:22 <adamw> wwoods: /dev/vda2 is mounted at /run/initramfs/live , of course.
18:07:02 <adamw> and anaconda is trying to mount it at /mnt/sysimage/boot .
18:07:15 <wwoods> well yeah, but anaconda is supposed to recognize this situation and not attempt to mount it again
18:07:24 <wwoods> is 'preupgrade' in the boot args?
18:07:37 <adamw> yup.
18:07:56 <adamw> BOOT_IMAGE=/upgrade/vmlinuz preupgrade repo=.......
18:08:04 <adamw> (that's cat /proc/cmdline)
18:10:15 <tflink> btw, let me know if you want something tried again - I have a resettable system ready for (pre)upgrade
18:12:08 <adamw> wwoods: any logs that would help?
18:17:28 * adamw going for a quick shower, brb
18:17:36 <wwoods> adamw: the traceback from that error would be helpful
18:18:14 <adamw> wwoods: it's an anaconda error dialog, it doesn't hit the traceback handler and there's no /tmp/anaconda-tb-*
18:18:20 <adamw> wwoods: is there somewhere else i might find it?
18:18:46 <wwoods> not really. can you tell me what the last step mentioned in anaconda.log is?
18:19:03 <adamw> movoing (1) to step upgrademount
18:19:09 <adamw> upgrademount is a direct step
18:19:30 <wwoods> 's what I figured
18:19:33 <adamw> then it mounts lv_root as /mnt/sysimage, and next tries to mount /dev/vda2 as /mnt/sysimage/boot
18:19:59 <adamw> program.log shows a resize2fs as the last command before the failed mount
18:24:42 <adamw> wwoods: where's the code that's supposed to recognize the preupgrade cmdline parameter? I don't see it
18:26:38 <adamw> feh. i'll go take that shower. =)
18:27:00 <wwoods> adamw: storage.log might be helpful, actually. but yeah I'm still tracing stuff here.
18:29:02 <adamw> i'll get it in a sec
18:36:29 <adamw> wwoods: http://fpaste.org/YVHo
18:36:40 <adamw> could be the use of root= is breaking this, i guess? i'll test with a network sourced root=
18:39:25 <wwoods> that's a pretty good theory actually
18:40:57 <adamw> yeah, with network sourced root it's got further
18:40:58 <rbergeron> theories are good, good theories aee awesome
18:41:06 <rbergeron> :)
18:41:08 <adamw> it's doing packages now
18:41:43 <adamw> kinda hard to tell if it's using the preupgrade-downloaded packages, but eh
18:41:45 <rbergeron> at some point we need to call it one way or the other so people know what to do this weekend :/
18:41:51 <wwoods> well, that's not really a solution though, since you're just dodging the mount
18:41:52 <adamw> rbergeron: we're still diagnosing
18:41:58 <adamw> wwoods: sure, just as a triage
18:42:03 <rbergeron> not pressuring just... saying so nobody can say it wasnt said
18:42:11 <rbergeron> adamw: i know :)
18:42:42 <adamw> well...so right now you actually can do a preupgrade with at least one hacky workaround (possibly two, not sure if the 'fix' to the kickstart is required)
18:43:00 <adamw> and the hacky workaround requires network access for the start of install at least
18:43:48 <adamw> well, maybe a bit early to declare that. it didn't get to bootloader install yet.
18:53:11 * adamw taps fingers
19:01:46 <adamw> wwoods: any new thoughts your end? does it sound like we won't be able to get a working local workaround for rc3?
19:01:53 <adamw> or is there some way we could finesse it?
19:11:13 <adamw> my upgrade is cleanup now
19:11:37 <wwoods> I don't know. I'm trying to figure out how we used to avoid the "/boot is already mounted" thing
19:12:53 <adamw> i can run a 15->16 preupgrade and try to watch what happens, but it might take a while :/
19:14:15 <wwoods> yeah I'm reading the source and trying to bug dlehman about it
19:15:08 * adamw starts cloning his 15 master
19:20:08 <adamw> okay, well, the upgrade with the remote root workaround succeeds
19:20:10 <adamw> so, yay, i guess
19:20:26 <adamw> i should test whether the kickstart fix was needed, though
19:21:16 <adamw> wwoods: are you likely to need anything from the machine that succeeded, with the ks fix and remote root workaround? or can i blow it away now?
19:30:40 <adamw> so, just passing root=live:http://path/to/squashfs.img seems to be good enough as a workaround - the 'error' in the kickstart doesn't seem to break it
19:31:11 <adamw> but that does require a working network connection.
19:31:54 <adamw> wwoods: i'm kicking off an f15 preupgrade now
19:39:02 <wwoods> isn't f15 where we first combined the images, so we had a big install.img?
19:39:40 <adamw> i really don't remember that far back
19:40:00 <adamw> so...you think we have a good enough understanding to make a call now?
19:40:35 * adamw updates bug
19:40:39 <adamw> ping rbergeron
19:40:41 <adamw> ping nirik
19:40:48 <nirik> yes?
19:40:53 * nirik reads up
19:41:14 <wwoods> ayup, F15 is where we combined the images
19:41:45 <rbergeron> hi
19:41:48 <nirik> so, work around, but it needs network... but really you should have net anyhow doing preupgrade, right?
19:41:48 <wwoods> so we gotta go back to F14 to figure this out, and yeah, we used to copy install.img into /tmp and unmount
19:41:51 <wwoods> which is why it used to work
19:41:56 * rbergeron looks for spot
19:42:04 <wwoods> nirik: not during the upgrade, usually, but sure
19:42:13 <nirik> yeah
19:42:20 <wwoods> it's not really workable
19:42:34 <wwoods> you'd need to bring up whatever crazy-ass network the person might be on inside initramfs
19:42:43 <wwoods> which means you have to plug into a wired network to run preupgrade
19:42:52 <wwoods> 'cuz we don't do wireless
19:42:52 <wwoods> etc
19:44:04 <nirik> ;(
19:44:22 * rbergeron reiterates the pouty face
19:44:47 <adamw> oh yeah that's a point
19:44:53 <adamw> you need a *wired* connection
19:45:37 <adamw> #agreed 810136 turns out to be a dupe of 810005, moving discussion
19:45:42 <adamw> #topic https://bugzilla.redhat.com/show_bug.cgi?id=810005
19:46:02 <adamw> okay, so i updated #810005 with a summary of the current state as I grok it: https://bugzilla.redhat.com/show_bug.cgi?id=810005#c6
19:46:20 <adamw> now we need to make a call. please god let's just make a call and i can get out of this room.
19:46:39 <tflink> I don't know, we could debate it a bit longer ...
19:46:43 <adamw> =)
19:47:11 <adamw> what's everyone's feeling on the acceptable-ness of the workaround?
19:47:27 <adamw> it does mean you can't really do preupgrade via wireless :/
19:47:41 <spot> yuck.
19:48:04 <nirik> do we have preupgrade critera? or just 'should work by beta' ?
19:48:14 <wwoods> or VPN, or with proxies, or basically anything except "wired network with DHCP"
19:48:15 <tflink> nirik: it's specifically required
19:48:32 <adamw> just in case everyone's thinking we should fudge it to make the whole validation sprint worth it, i'll stick my neck out and say +1 blocker. really, strictly going by the protocol, this workaround isn't good enough to consider the criterion not breached, imo.
19:48:44 <spot> +1 blocker from me, as much as i hate it
19:48:48 * nirik nods sadly. slip and +1 blocker here too.
19:48:52 <tflink> "The installer must be able to successfully complete an upgrade installation from a clean, fully updated default installation (from any official install medium) of the previous stable Fedora release, either via preupgrade or by booting to the installer manually. The upgraded system must meet all release criteria"
19:48:57 <adamw> yup.
19:49:02 <tflink> yeah, I'm +1 blocker
19:49:24 * nirik plays the sad trombone.
19:49:37 <adamw> spot: it's a pita to slip, but on the plus side, it means we can fix some nasty nths. we're not really at a point of diminishing returns.
19:49:51 <rbergeron> nasty easyfix nths?
19:49:52 <spot> adamw: thats very positive of you. ;)
19:50:10 <adamw> rbergeron: the other paths that hit 810005
19:50:11 <rbergeron> it's because his breakfast whiskey turned into lunch whiskey
19:50:14 <adamw> that NetworkManager crash thingy
19:50:35 <adamw> i think i had something else on my list, i'll have to look for it.
19:50:44 <rbergeron> at two weeks i want to be verrry careful to not introduce new awesome
19:50:48 <adamw> rbergeron: point.
19:50:48 <rbergeron> gnome 3.4?
19:50:53 <adamw> 3.4 was in rc3
19:50:58 <adamw> it worked out fine, shockingly enough. no problem.
19:51:03 <rbergeron> ohhhright
19:51:39 <adamw> propose #agreed 810005 is accepted as a blocker per criterion "The installer must be able to successfully complete an upgrade installation from a clean, fully updated default installation (from any official install medium) of the previous stable Fedora release, either via preupgrade or by booting to the installer manually. The upgraded system must meet all release criteria". the known workaround has too many limitations to be considered 'good enough'
19:51:52 <tflink> ack
19:52:06 <rbergeron> uck
19:52:07 <rbergeron> :)
19:52:12 <rbergeron> but ack
19:52:38 <nirik> ugh ack ugh. ;)
19:52:50 <tflink> we sounds like cavepeople
19:53:23 <adamw> #agreed 810005 is accepted as a blocker per criterion "The installer must be able to successfully complete an upgrade installation from a clean, fully updated default installation (from any official install medium) of the previous stable Fedora release, either via preupgrade or by booting to the installer manually. The upgraded system must meet all release criteria". the known workaround has too many limitations to be considered 'good enough'
19:53:33 <adamw> rberge you want to finish up the bureaucracy?
19:53:38 <adamw> we should make an official no-go call
19:53:50 <rbergeron> well
19:54:00 <rbergeron> blockers = no go but :)
19:54:12 <rbergeron> #topic Go or no-go?
19:54:27 <rbergeron> #info we have a blocker.
19:54:31 <nirik> sadly, no go
19:54:35 <adamw> per our policy, QA votes no-go as there is an open unresolved blocker.
19:55:00 <adamw> rbergeron: oh, on the nth front, it'd be nice to fix that kde corruption.
19:55:18 <rbergeron> #info qa votes no-go, ass do various others from engineering, fesco, pm
19:55:26 <rbergeron> adamw: agreed
19:55:32 <tflink> and pxe boot and the missing livecd fixes
19:55:49 <rbergeron> adamw, wwoods: we do have a fix for 810005?
19:55:52 <adamw> tflink: we'll get pxe boot with the parameter fix
19:56:00 <adamw> rbergeron: i believe there's one on the anaconda list, yeah
19:56:05 <adamw> we'll need a fix for teh second bug too though
19:56:07 <rbergeron> or when can we get to rc4, given holiday weekend and etc
19:56:10 <tflink> adamw: yep, but it'll still be fixed where it doesn't work w/ RC3
19:56:30 <adamw> i'm happy to do the qa crap for it over the weekend, holiday or no, if wwoods can get anaconda fixes in
19:56:40 <adamw> dunno where dgilmore stands
19:56:49 <tflink> oh yeah, there's a holiday this weekend
19:57:07 <nirik> dgilmore is out today I know for sure, not sure if he's around the rest of the weekend or not.
19:57:13 <rbergeron> i mean i'll be around all weekend but i dont know about  others
19:57:16 * nirik should be around mostly if he can help with anything.
19:57:29 <adamw> tflink: missing livecd fixes, you mean the efi thing?
19:57:30 <rbergeron> holiday means kids have candy and crap from grandparents and are fully occupied
19:57:34 <tflink> adamw: yeah
19:57:41 <adamw> yeah, that'd be nice.
19:58:23 <rbergeron> that said i'm not entirely helpful beyond making sad faces and meeting notes and delivering sad news
19:58:27 <adamw> #agreed we are no-go on F17 Beta RC3 sadly
19:58:36 <rbergeron> oh, thx
19:58:55 <rbergeron> #action rbergeron to update schedule.... again
19:59:32 <rbergeron> i can send out a slip mail in prob an hr
19:59:47 <rbergeron> blocker meeting tomorrow?
20:00:02 <tflink> yep, I'll send out the announcement as soon as I see your email re: slip
20:00:02 <rbergeron> or monday in qa meeting or ad hoc as we are hailed :)
20:00:09 <rbergeron> okay
20:00:19 <rbergeron> #action rbergero to send slip mail
20:00:22 * tflink has afeeling that we're in for some lively discussion around the kde 16bpp thing
20:00:38 <rbergeron> #action tflink to do blocker meeting mail following slip notice
20:02:47 <rbergeron> any other loose ends?
20:03:09 <rbergeron> #action qa heroes of awesome to get some rest plz
20:03:10 * adamw is going golfing
20:03:11 <rbergeron> :/
20:03:20 <kalev> nirik: if dgilmore is out today, could you do a stable push with the NTH fixes that were included in RC3?
20:03:23 <rbergeron> #action adamw to get a hole in one
20:03:42 <nirik> kalev: if they have enough karma.
20:03:58 <rbergeron> we may want to cherry pick those - adamw, tflink, spot, nirik, thoughts?
20:03:58 <nirik> kalev: I couldn't get it to push the gnome update (I can check with lmacken on why)
20:04:28 <adamw> cherry pick what? the rc3 packages? yeah, we've been doing that for this release, pushing the stuff that was pulled into the TCs/RCs stable
20:04:39 <tflink> I thought that we got most of them, though
20:04:40 <adamw> we were already working on it last night, just had a problem with the gnome update as nirik says
20:04:53 <kalev> nirik: ah, lmacken did some manual hackery on that update, I bet he knows what the issue might be
20:04:56 <nirik> there's a few others that didn't have enough karma.
20:05:01 <rbergeron> okay
20:05:04 <adamw> we'll get those karma'ed and pushed.
20:05:07 <nirik> kalev: it just ignored me trying to push it. ;(
20:05:13 <adamw> and i'll try and pull together an nth list later.
20:05:20 <adamw> things it's worth fixing.
20:05:47 <rbergeron> adamw: thanks, i assume in a ticket nirik can watch for (or dgilmore)
20:05:58 <adamw> well, just via nth process at first.
20:06:10 <rbergeron> okay
20:06:27 <rbergeron> i'll just be quiet and let you guys do your work and golfing :)
20:07:16 * kalev goes to file a new NTH bug.
20:07:35 <nirik> so, shall we close out this meeting?
20:07:38 <nirik> (finally) ;)
20:07:53 <adamw> yup
20:08:46 <adamw> thanks for the work everyone, esp anaconda team
20:08:56 <adamw> #info we will re-convene tomorrow or monday
20:08:59 <adamw> #endmeeting