Reading the question and the comments, there may be a conceptual misunderstanding : the attenuator WILL attenuate any noise presented on its input (even from just a 50 ohm source impedance), to the same extent it attenuates the signal.
However it also generates noise of its own, which may be represented as the noise from a perfect resistor equal to its own output impedance, and this is added at the output to the (attenuated) input signal and noise. So if input and output Z are both 50 ohms, the net result is attenuated signal + marginally increased noise (i.e. NF = attenuation).
But if its output impedance is lower, the added noise is also lower, thus improving the noise voltage as Andy states.
So represent the attenuator as a perfect attenuator (attenuating noise) in series with a Johnson noise voltage source equal to the output impedance. The rest is just applying the formulae.
EDIT: re: updated question.
(1) There is nothing special about 290K except that it's a realistic temperature for the operation of a passive circuit. The reason they chose it is that the article quotes a noise floor ( -174dBm/Hz) which is correct for a specific temperature : yes, 290k.
(2) While any resistance in the attenuator will contribute noise, I realise that it is not a satisfactory explanation as to why you get the same noise out of an attenuator, because (as Andy says) you could make a capacitive attenuator which is not a Johnson noise generator. So we have to look a little deeper, and remember these noise sources are the statistics of the individual electrons that make up the current.
So, let's say we build a (50 ohm in, 50 ohm out) attenuator, and attempt to cheat Johnson by using a capacitive divider. That implies a node within the attenuator which conducts some of the input current to ground. At this node, we have two current paths; a fraction of the current flows to output, the rest to ground. What determines which path an individual electron will take? Essentially, chance. Collectively? Statistics. So this is a noise source.
Or let's just add series capacitance to provide enough attenuation : we thereby avoid dividing the current flow and eliminate the noise source, right? At the cost of reducing the signal current; our statistics now operate with a smaller sample size and consequently greater variance : more noise.
These results are the best you can do, there is no way round them.
Your problem is combining the voltage sources. This is incorrect, first because you can't add uncorrelated noise to each other, second because we don't even need to do worry about the other resistor's power generation for this problem.
Since we are only looking at the power that one resistor transfers to another, we look only at the voltage it generates and transfers to the other.
simulate this circuit – Schematic created using CircuitLab
Now, we look at the voltage that would appear on the transferred resistor, which would be exactly half.
$$
V_\mathrm{transferred} = \frac{\sqrt{4k_BT \Delta FR}}{2}
$$
Now with power:
$$
P_\mathrm{transferred} = \frac{V^2}{R}
$$
$$
P_\mathrm{transferred} = \frac{4k_BT \Delta FR}{4R}
$$
$$
P_\mathrm{transferred} = k_BT \Delta F
$$
Hope this helps!
Best Answer
If the question was targeting also the susceptibility to external noise, than the answer accepted was not complete. There is much more involved here.
I can recommend an excellent book on the subject, The Circuit Designers Companion.
You will want to read at least the first two chapters, grounding and wiring.
Decoupling capacitors have their role in reducing the electrical noise radiated out by the circuit. Narrow but possibly high current power supply peaks are contained within the small area near the high speed components, instead of pulling the current all the way from the power supply.
However, if the question was also how to prevent the EMI (Electromagnetic Interference) from the outside to play havoc with your circuit, that there are many other factors involved.
One of the most important things you should take care of is the cable and signal routing. The ground references should be kept separated, and if you had several circuits boards their grounds should be connected in s single point (star topology grounding).
High speed or high current lines should be kept separated from the low level signal lines. If such cables (or PCB traces) have to cross their paths, it should be done at right angle, minimizing the length of path running in parallel, and forming a stray capacitance.
Analog and digital inputs should be protected by filter components, and protection diodes. Output components switching high currents with inductive loads should also be protected by schottky diodes and filter components. Very often the software plays important role. For example, some communication protocols can adjust the signal slew rate (signal edge rise / fall time) to reduce the radiated interference.
There are many other measures, besides obvious shielding, keeping the 'electrically dirty' parts away, orienting the transformer so then it does emit it's magnetic field through the low voltage input stages of some sensitive amplifier. Avoiding or at least keeping the signal path loops short and narrow is always a good practice. Some beginners would route the PCB in a way that there is a power supply (or ground) trace around a board, just in case something needed to be connected. It it is fine if this was a true ground plane, but if it is just a wider track then it should be broken at some point, or it will serve as an antenna (both receiving and transmitting noise). I hope you have the picture, this subject is broad and involves much, much more than spreading few capacitors around the board.