For getting started I really liked Toby Segarin's Programming Collective Intelligence. It was my introduction to this area before I went on to produce After the Deadline.
I read that one, I liked the fact that he builds up each example from first principles. It's hard to find explanations that bridge theory and practice.
Good stuff - I added http://szeliski.org/Book/ to my reading list, thanks for sharing the link. There's a huge overlap for certain classes of problems. CV in many ways resembles the same problems with large online data streams, noisy, time critical, huge volumes of data - feature extraction is problematic. Hybrid solutions usually required.
I've found that reading books from other ML domains helps out in understanding the application and getting ideas on how to approach the problem.
As I experienced it, Szeliski's book is better as a reference as it covers lots of material (just see the number of citations at the end). I don't think it's an easy read without reading (some of) the cited papers (or having background knowledge).
I found the Stanford course almost assumed too much of a stats background to make it easily accessible. Starting with the math foundations is sound, but scary for people who don't dream in LaTeX :)
These are really, really basic tools and books. Once you're past this you can get a copy of some good Springer books (e.g. "Recommender Systems Handbook") and follow up on the papers and studies referenced.