In a "purely digital" link where you set an output to "high" and an input the other end of a line is read as "high" then the probability error is purely to do with the SNR of the line. What is the probability that a HIGH can be interpreted as a LOW? By introducing a higher level protocol with error detection and correction you effectively negate most of the SNR errors and the question is now "What is the probability that the protocol cannot correct corrupted bits?"
So yes, the CODEC (or protocol) can be used (and is used) to negate the effects of SNR-induced signal corruption.
As for the second part...
If you assume 1 bit of information is transmitted per quantization level, and 1 bit is received per quantization level, then yes, increasing the quantization level will increase the number of bits sent at any one time. However, the SNR of the transmission medium will then have a greater effect on those now smaller quantization steps, so although you reduce the quantization noise, you now increase the SNR noise.
However, if you don't assume 1 bit per quantization level, but have multiple quantization levels per bit, then you can increase the number of quantization levels and keep the overall bitrate the same, but have more detail about each bit, so can make a better informed decision about what value that bit is.
For instance, you can think of a simple digital link with 2 states (HIGH and LOW) as a 1-bit quantized system. For simplicity we'll call it 1V for HIGH and 0V for low.
Now, you could then have it that anything received >= 0.5V is a HIGH and anything < 0.5V is a LOW. That's 1 bit quantization. 0.5V would be HIGH, but 0.499999999999V would be LOW. That's an infinitesimally small margin for noise.
However, increase the receiving quantization to 2 bits, say, would give you more detail. It would give you 4 voltage levels to consider - 0V, 0.33V, 0.66V and 1V.
You could now say that anything > 0.66V is a HIGH, and anything less than 0.33V is a LOW. You have now introduced a "noise margin". Anything that falls between those values is discarded as noise. The bitrate remains the same, but the overall SNR has fallen.
Then of course you can add a "schmitt trigger" to it (or software equivalent), whereby you toggle the value depending on a transition. When the input rises above 0.66V you see the value as HIGH, and keep it as HIGH. Only when it then drops down below 0.33V do you then switch it to LOW.
For systems where you have discrete voltage levels you could sample them at a higher resolution, and the line-induced noise would occupy the least significant bits of that sampled value. Discarding the noisy bits down to the resolution of the sent data can then reduce the noise in the system. Also taking multiple samples and averaging them, which in effect cancels the random noise out, (known as "oversampling") can reduce the noise as well.
None of those techniques affect the bitrate as such since you're not adding any extra information to the sent values.
Data rate is the speed at which bytes (or chunks of data) are sent down a channel.
The bandwidth is how fast the bits that make up that data are transmitted.
Sampling rate is the frequency at which an incoming signal is read to measure its shape.
Take for example a typical 9600 baud serial connection.
The bandwidth is 9600 bits per second. Each byte, though, has extra bits with it (start, stop, parity, etc). So for a typical 8N1 format there's 10 bits used for every 8 bits sent.
So the data rate for 9600 baud would be 960 bytes per second.
The sampling rate would be the rate the receiver looks at the signal to see if it's a 1 or a 0 - typically at least 2x the bandwidth (see Nyquist-Shannon sampling theorem), so 19.2kHz.
Best Answer
I'll add another answer into the mix.
So $$ \text{bitrate} = \text{number of bits transmitted per second} $$
$$ \implies \text{bitrate} = \text{(Symbols Per Second)} \times \text{(Bits Per Symbol)} $$
$$ \implies \text{bitrate} = f_s \times \log_2 L = 2B \log_2 L $$
The capacity tells us the maximum number of error free bits that can be transmitted over a channel. The capacity of a channel is given by
$$ C = \frac{f_s}{2} \log_2 \left ( 1 + S/N \right) $$
so for a noiseless channel (\$N = 0\$) we have \$C = \infty\$. So in a noiseless channel there is no upper limit on the number of error free bits we can transmit, any bitrate is achievable.