Professional Documents
Culture Documents
Monograph ON Applied Physics - Ii (PH-1007) : BY Anamol Gautam Rajesh Mohni
Monograph ON Applied Physics - Ii (PH-1007) : BY Anamol Gautam Rajesh Mohni
ON
APPLIED PHYSICS - II
(PH-1007)
BY
ANAMOL GAUTAM
RAJESH MOHNI
1
Preface
Anamol Gautam
&
Rajesh Mohni
2
Syllabus
Section A
Crystal Structure: Space lattice, unit cell and translation vector, miller indices, simple
crystal structure, bonding in solids, experimental x-ray diffraction method, and powder
method.
Free electron theory: Elements of classical free electron theory and its limitations,
quantum theory of free electrons, Fermi level, density of states, Fermi-dirac distribution
function , Thermionic emission , Richardson’s equation.
Section B
Band Theory of Solids: Origin of energy bands, Kronig Penney model (qualitative), E-K
diagrams, Brillouin zones, concept of effective mass and holes, classification into metals,
semiconductors and insulators, fermi-energy and its variation with temperature.
Section C
Photoconductivity and Photovolatic: Photoconductivity in insulating crystals, variation
with illumination, effect of traps, application of photoconductivity, photovoltaic cell and
their characteristics.
Section D
Laser: Spontaneous and stimulated emission, laser action, characteristics of laser beam-
concept of coherence, types of laser based on pumping techniques, He-Ne laser ,
Semiconductor laser (simple ideas) with applications.
Fibre Optics: Optical communication: communication through open space optical wave
guides with special reference to propagation of light fibres, numerical aperture, single
mode and multimode fibres, applications
3
CONTENTS PAGE NO.
SECTION A
SECTION B
SECTION C
SECTION D
4
CHAPTER 1
CRYSTAL STRUCTURE
The three states of matter are solids, liquids and gaseous. Solids have used shapes
while the other two states do not have any shape. Solids further exist in two states
crystalline and amorphous. The crystalline solids show periodicity in the arrangement of
atoms while amorphous solids are irregular in shape. The crystalline solids are further
divided into two types single crystal and poly crystalline solids, if the entire solid consists
of only one crystal i.e. the periodicity of the arrangement of atoms extend up to infinity,
the crystal is termed as single crystal. In polycrystalline materials a number of aggregate
crystallites are separated by well defined boundaries. The small crystallites are known as
grains and the boundaries are known as grain boundaries. Materials like rubber, glass etc.
are amorphous. Diamond, rock salt, mica, quartz etc. are crystalline. Many properties of
materials are found to depend upon the structure of the crystal. For this reason in this
chapter we will study basic concepts and simple structures of solids.
Crystallography
The study of the geometric form and physical properties of crystalline solids by
using X-Rays electron beams and neutron beams etc. constitutes the science of
crystallography.
Since in crystalline solids the atoms and molecules are arranged in regular and
periodic manner, so most of the solids are crystalline in nature because energy released
during the formation of ordered structure is more than that released during the formation
of a disordered structure. Thus crystalline is the low energy state and most of the solids
preferred this state.
5
Fig.1 Crystal Structure
To define space lattice, let us consider point O as origin and two vectors a→ and b→,
called fundamental translational vectors in two dimensions. To obtain any point P, we
draw a vector T→ from O to P. Then
T = n1 a→ + n2 b→
where n1 and n2 are integers.
Similarly, in three dimensional lattice
(1)
The application of this operation (1) to any point r→, results in the point r→
(2)
Such translational vectors which produce a translation operation containing
integral coefficients are called primitive translational vectors.
If point r→ is obtained by using equation
Such translation vectors which contain non-integral coefficients are called non-primitive
translation vectors.
6
The generation of a crystal from two dimensional lattice and a basis is as shown
in fig.2. The basis consists of two atoms, represented by O and • having orientation as
shown in fig.2. To obtain the crystal structure the basis is placed on each lattice point
such that the centre of the basis coincides with the lattice point. The number of atoms in a
basis may vary from one to several thousands, but the number of space lattices possible is
only 14. Hence due to different types of basis available, a large number of crystal
structure can be obtained from just fourteen lattices.
7
Consider the unit cells ABCD and EFGH displace these portions towards any
direction we find that only one lattice point can lie in the area of the parallelogram.
Displaced parallelograms are shown by dotted lines and are containing points C and F
respectively. Such unit cells which contain only one lattice point are known as primitive
cell. A primitive cell is also a minimum volume cell.
The other unit cells viz. MNOP and PQRS are not primitive since these contain
more than one lattice points.
Therefore, we conclude that the entire lattice can he derived by repeating a unit
cell by means of translation vectors as edges of the unit cell.
Primitives
The three sides of the unit cell are known as primitives. In fig.4; a, b and c are
called primitives.
TYPES OF LATTICES
Bravais in 1848 introduced the concept of space lattice to describe the structure of
crystals. According to him there are five lattice types or unit cells in two dimensions and
fourteen in three dimensions.
(1) Two dimensional lattice
There are in all five Bravais lattice types in two dimensions, i.e. oblique, square,
hexagonal, rectangular and centred rectangular. These are shown in fig.5(a).
8
Fig. 5(a) Bravais lattice types in two dimensions
9
Fig.5 (b) space lattice and crystal system.
10
From different crystal structures given above we find that
1. In triclinic arrangement
(i) No two of the three axis are at right angle to each other.
(ii) Repetitive intervals along each axis are different.
2. In trigonal (rhombohedral) arrangement
(i) The three axis are equally inclined to each other at angles other than a right angle
(ii) The repetitive intervals along the three axes are equal.
3. In hexagonal arrangement
(i) The three axes in one plane are 120° from each other and the fourth axis normal to this
plane
(ii) The repetitive intervals along the three coplanar axis are equal but the interval along
the fourth axis is unique.
4. In a simple monoclinic arrangement
(i) One of the axes is at right angle to the other two but the other two axis are not at right
angles to each other
(ii) The repetitive intervals along each axis is different.
5. In a simple tetragonal arrangement
(i) The three axes are at right angle to each other
(ii) the repetitive intervals along two axes are equal and the interval in the third direction
is different from that in the other two.
6. In simple cubic arrangement
(i) The three axes are at right angle to each other.
(ii) The repetitive intervals along the three axis are equal.
7. In Orthorhombic arrangement
(i) The three crystal axes are at tight angle to each other
(ii) The repetitive interval along each axis is different,
11
Fig. 6 Crystal directions
(i) Find the intercepts of the plane along the a, b and c axis in other words determine the
coordinates of the intercepts
12
Important features of Miller indices
(1) In cubic crystals the direction [hkl] is perpendicular to the plane (hkl), but this
is not true in other crystal systems.
(2) When Miller indices are of double digit, these are separated by commas e.g.
(5, 12, 15).
(3) If a plane is parallel to a coordinate axis then the intercept of the plane is
infinity and hence index is zero.
(4) If a plane passes through origin then the Miller indices are defined for another
plane parallel to this plane and making non-zero intercepts since all the parallel equally
spaced planes have the same index numbers (hkl).
(5) The planes (2 2 6) and (1 1 3) represent the same plane i.e. ratio of the indices
is only important,
(6) The crystall directions of a family < hkl > need not be parallel to each other
similar is the case with members of a family of planes {hkl} some important planes with
their Miller indices are shown in fig.8
13
DISTANCE BETWEEN LATTICE PLANES IN A CUBIC CRYSTAL
Let oa, ob and oc are mutually perpendicular lines and form the Cartesian system
of coordinates as shown in fig.9. Consider two planes parallel to each other one passing
through origin (not shown) and other plane parallel to this plane is ABC. If this plane is
represented by Miller indices (hkl), then the coordinates of A, B and C will be (a/h,0,0),
(0,b/k,0) and (0, 0, c/l) respectively.
Let ON = d be the distance between the two planes. Now equation of plane ABC
in intercept form can be written as
(3)
and perpendicular distance of a point O (0, 0,0) from this plane is given as
we have, used the formula that perpendicular distance of point (x1, y1, z1 ) from plane
(4)
Hence distance between two parallel lattice planes is
(5)
Alternative method
Let α, β and γ be the angles made by ON with oa, ob and oc axes respectively.
Hence we can write
(6)
From fig. it is clear that
Now
14
(7)
Hence from eq. (6) & (7) we get
(8)
(9)
(10)
CRYSTAL STRUCTURE
Face-centered Cubic or Cubic Close-packed Structure
The above close packing of cubic symmetry leads to face-centred cube. Here also
the fraction of the total volume filled is 0.74. In this case the first two planes go in the AB
sequence; the third however is displaced relative to both A and B into positions C. A
fourth plane has the A position and the repetition cycle ABCABC……is established. A
unit cell of this crystal structure is shown in fig.10 (a).
This cell is not the smallest possible cell, but it does have one convenient feature ;
the cell is a cube. An atom occupies each corner of the cube and the centre of each face ;
hence the name face-centred cube (fcc). This structure is symmetrical with respect to
rotation by 900 about any one of the cube edges. The more conventional way to draw this
cube is pictured in fig.10(b). The face-centered cubic structure is also close-packed, each
atom having 12 nearest neighbours.
12 is the maximum value of the coordination number in a structure of identical
atoms. In other words no structure with coordination number greater than 12 is possible.
Packing fraction in this ease is also 0.74 as proved below.
(a) (b)
Fig. 10
15
The atoms are in contact along the diagonal of the faces. Let R be the radius of
sphere, then diagonal of the face
= 2R + R+ R = 4R.
Number of spheres in one unit cell =6/2 + 8 * 1/8 = 4
= 0.74
Since fraction of the total volume filled is the same in hcp and fcc structures
hence these are the closest packed structures.
The atoms are in contact along the diagonal of the faces. Elements having face-
centred cubic structure are Cu, Ag, Au, Ca, Al, Pb and Pt.
16
Say BC = a
Now from triangle ABD
17
Diamond Structure
The diamond structure is a combination of two inter-penetrating face-centred
cubic sublattices. The unit cell is sketched in fig.14. One sub-lattice has its origin at the
point 0, 0,0, the other at a point one quarter of the way along the body diagonal (at the
point a/4, a/4, a/4). The diamond cubic structure is loosely packed, since each atom has
only four nearest neighbours.
Number of spheres in one unit cell =8.
Now the diagonal is 8R where R is radius of sphere. In diagonal there are two half
spheres.
If ‘a’ is a side of the cube then diagonal will be √3 a.
BONDING IN SOLIDS
The arrangement of atoms in any solid material is determined by the character,
strength and directionality of the binding forces, cohesive forces or chemical bonds.
These bonds are made of attractive and repulsive forces that tend to hold atoms (i.e., ions,
radicals, molecules) or atomic units at a different spacing such that the opposite forces
just balance; and the process of holding them is known as bonding.
Since the particular bonding pattern within a material plays a large part in
determining the physical, chemical, and electrical properties of that materials, the
materials engineer must possess a sound working knowledge of the types of bonding
found in engineering materials.
18
(11)
where α is the proportionality constant or attraction and β for repulsion. The solid curve
in Fig.14(a) show the potential energy.
The forces between the atoms can be derived directly from Eq.11 by recalling that the
force is the derivative of potential energy. This is expressed as:
(12)
The net force is represented by the solid curve in Fig.14(b).
At large separations, the atoms do not interact with each other, so that V=O and
F=O. As the atoms approach each other, they exert attractive forces on each other,
primarily because of the positive and negative charges in the atoms. When the atoms are
separated by only a few atomic diameters, the repulsive forces between the like charges
of the nuclei, however, start to assert themselves although somewhat more slowly. At
some separation, called the equilibrium separation, re, the forces of attraction just equal,
the forces of repulsion, and the potential energy is at a minimum.
The forces of repulsion are the result of interpenetration of outer electron shells
between atoms, ions, and molecules when they approach one another at interatomic or
intermolecular distances. The forces of attraction are due to the interaction between outer
electrons of the two atoms, resulting in the formation of a sufficiently stable aggregate
which can be considered as an independent molecular species.
19
The attractive forces in primary bonds are directly associated with the valence
electrons. The outershell, which contains the valence electrons, is in a high energy state
and is relatively unstable. If it can acquire more electrons to bring the total up to eight, or
lose all its electrons to another atom, it can become stable. This is how atomic or primary
bonds are formed. In fact, each is a direct consequence of the exchange or sharing of
valence electrons which lie in the s-and p- orbitals. In other worth, a primary bond is
formed only if there is a pronounced lowering of the energies of the electrons involved
during the formation of the bond. Since a primary bond does involve a substantial
lowering of the energies of the bond electrons, it follows that such a bond must be
stronger and more stable than secondary bonds.
There are three principal types of primary bonds ionic, covalent, and metallic.
These bonds are distinguished on the basis of the positions assumed by the bond electrons
during the formation of the bond. vander Wads bonds and hydrogen bonds are typical
examples of secondary bonds and they result from intermolecular (dipole) attractions.
The natures of the first four types of bonding in solids are illustrated in Fig.15
Few materials have pure bonds of one type or the other. In fact mixed bonds are mostly
found in common materials.
20
BOND ENERGIES
The bonding energy or the cohesive energy is defined as the energy of formation
of one mole of a substance from its atoms or ions. It is but opposite in sign to the energy
of dissociation of the substance. The bond energy can be calculated as the energy of the
atoms or ions at the equilibrium spacing in the crystal structure, using the state of infinite
separation of the atoms or ions as the zero of potential.
The strength of a bond is best measured by the energy required to break it, that is,
the amount of heat which must be supplied to vaporize the solid (infinite separation) and
hence separate the constituent atoms. If we measure the heat of vaporization of one
kilogram mole this must be the energy used in breaking N bonds (N0 =Avogadro’s
number) Figures derived in this way are tabulated in Table 2 together with the type of
bonding for the solid concerned. It is clearly seen from this table that primary bonds are
stronger than secondary ones as referred to earlier.
The melting points and the boiling points of the elements are dependent on the
strength of the bond. Generally, the stronger the bond, the higher are the melting and
boiling points.
21
IONIC BOND
An ionic bond is rarely the attractive force existing between a positive and a
negative ion when they are brought into close proximity. These ions, of course, are
formed when the atoms involved lose or gain electrons in order to stabilize their outer
shell electron configuration. Elements are classified as either electropositive or
electronegative, depending upon whether they tend to lose or gain electrons in order to
achieve this stable outer-shell electron configuration. The definition of ionization
potential provides that ionization energies or potentials measure the amounts of energy
required to remove the successive outer electrons from an atom. The first I.P. of various
elements listed in Table 3 shows that, in general, it requires less energy to strip one
electron from a metal atom than from a nonmetallic atom. This rule also applies to the
removal of the second and subsequent electron in most cases. Thus, metals are termed
electropositive while nonmetals are electronegative, these terms referring to the abilities
of the elements to form positive and negative ions respectively under favorable
conditions during them chemical reactions.
Let us consider the combination which takes place between the metal sodium and
nonmetal chlorine to form sodium chloride (common salt). The sodium atom has a single
electron in its outer shell and this transfers to join the seven electrons in the outer shell of
the chlorine atom as illustrated in Fig.16. This type of atomic interaction, involving the
outright transfer of one electron from one atom to another, leads to the formation of ions
which are held together by electrostatic attraction. Because of the electrostatic nature of
the binding force, the bond between atoms is said to be ionic or electrovalent, and the
valence exhibited is said to an electrovalence.
Besides this attraction, the process of adding an extra electron to the chlorine
atom is accomplished by an appreciable decrease in the potential energy.
Ionic bonds are nondirectional, so that in a crystal, a cation tends to surround
itself by as many anions as possible and vice versa. The Coulomb term for a crystal i then
dependent on the mutual attractive as well as repulsive interaction or a given ion with all
the other ions in the crystal.
The major requirement in an ionically bonded material is that the number of
positive charges equals the number of negative charges. Thus, sodium chloride has a
22
composition NaCI. Magnesium chloride has composition of MgCI2, because the
magnesium atom can supply two electrons from its valence shell but a chlorine atom can
accept only one. Other solids having ionic bonding are potassium oxide (K2O) and
lithium hydride (LiH).
Yet other examples of ionic bonding occur in the compounds: cupric oxide
(CuO), chromous oxide (CrO2) and molybdenum fluoride (MoF2) showing that metallic
element need not be from Group I or II of the periodic table but that any metal may
readily become ionized by losing its valence electrons.
COVALENT BOND
A covalent, homo polar or electron-pair bond is formed when pairs of electrons
are shared by several atoms (rather than by electron transfer to make two different ions)
and have their energies lowered as a result of this sharing. Stable covalent bonds are
formed between many nonmetallic elements since the atoms of these elements usually
possess half-filled outer electron shells which resist the direct electron transfer required
for the formation of an ionic bond. An excellent example of covalent bonding is found in
the chlorine molecule. Here, the outer shell of each atom possesses seven electrons. Each
chlorine atom would like to gain an electron, and thus forms a stable, octet. This can be
done by sharing of two electrons between pairs of chlorine atoms, thereby producing
stable diatomic molecules. In other words, each atom contributes one electron for the
sharing process.
The natures of the covalent bonds in the molecules of chlorine, nitrogen, and
hydrogen fluoride are illustrated in Fig.17. All molecules with halogen atoms (e.g., Cl, F
Br, I, etc.) exhibit the same phenomenon.
In line with modern ideas involving probability distribution of electrons, the
covalent bond maybe considered to be a region of high elctron density existing between
two atoms. This can be best demonstrated in terms of hydrogen molecule (Fig.18). When
two hydrogen atoms are very far apart, they do not interact and the lone electron of the
atom stay in their respective 1s ground states. When the atoms are brought together, the
two electron-density-distribution clouds start to overlap. The electron of one atom comes
under the influence of the positive charge of the nucleus of the other and vice versa. This
results in a change in the electron density distribution around the two nuclei. The
23
quantum-mechanical calculation is roughly equivalent to adding these two atomic
orbitals (AO) to give the bonding, that is molecular orbital (MO). With the formation of
the molecule, the density of electron distribution, which is a measure of the probability of
finding an electron in any spherical shell, increases at points between the nuclei along the
internuclear axis. In other words, the electrons spend more time between the two nuclei
where they are commonly shared by the positive charge. The Fig.19 illustrates another
covalent bond in a chlorine molecule seen as a region of high electron density between
the two chlorine atoms.
24
Since pairs of electrons are shared in covalent bonds there is some degree of
orbital overlap between the bonded atoms (Fig.18). This overlap is limited by the
electrostatic repulsion of the positively charged nuclei, the resultant shapes of the bond
orbitals determining whether the bond will be directional or nondirectional in space with
respect to the nuclei (or ion cores). The greater the degree of orbital overlap the stronger
the bond, since extensive overlap means greatly lowered energy levels of the bond
electrons.
In atoms that have certain energy levels close together; these levels sometimes
interact as a preliminary to the formation of molecule with other atoms and exhibit a
bonding phenomenon known as hybridization, which is an important feature of the
molecular-orbital theory. The molecular orbitals of methane, diamond, graphite, ethylene
having carbon in each of them illustrate hybridization and directional bonding.
The covalent bond found in a wide variety of materials since it can be formed
between atoms of the same or different elements. Stable covalent bonds are formed in the
solid state primarily between. Non-metallic atoms like nitrogen, oxygen, carbon and
chlorine families. Other elements like silicon, germanium, arsenic, and selenium form
bonds that are partly covalent, partly metallic; the transition metals (Columns IIIb-VIIb in
the periodic table) are also thought to have a certain amount of covalent character to their
bonds. They also include many of the organic compounds such as benzene, alcohol,
turpentine, chloroform, and members of the paraffin series which contain mainly the
elements carbon, hydrogen and oxygen. An important example of covalent bonding is
found in polymeric materials which include rubbery elastomers, high-strength fibres, and
a variety of structural materials denoted by the term plastics.
Generally speaking, the number of covalent bonds that can be formed by an
element is determined by the number of electrons that can be added to the valence shell
without exceeding 8. According to this rule the maximum number of covalent bonds is
(8-N),where N is the number of valence electrons.
A covalent bonding is very strong so that the cohesive energy of a solid such as
diamond, which is entirely carbon, very large and in agreement with this fact valency
solids are usually very hard and have high melting points.
The strength of individual bonds in covalent molecules depends on such factors as
degree of ionic character, bond order (single bond, double bond, etc., indicated by the
sign —), and resonance of bonds between alternative positions. Table.4 gives some
examples of experimentally determined bond energies for neighbouring atom pairs in
typical molecules. Using these values one can estimate the total bonding energy of
materials such as polymers and judge their relative stabilities.
METALLIC BOND
As its name signifies, metallic bonding is the principal force holding together the
atoms of a metal. This type of bond is characteristic of the elements having small number
of valence electrons, which are loosely held; so that they can be easily released to the
common pool. The bonding results when each of the atoms of the metal contributes its
valence electrons to the formation of an electron cloud (frequently referred to as an
‘electron gas’) that pervades the solid metal. The electrons are pictured not as definite
points but as a “cloud” whose density represents the probability that electrons will be
found at the given position in the atom. It is seen that the ten innermost core electrons
25
tend to be concentrated in the vicinity of the nucleus, while two valence electrons have
their most probable positions near the surface of the atom.
At the simplest level, in metallic bond the metal ions are bound together in a solid
metal by the interaction of all the valence electrons—the electron cloud or gas that may
be considered to move freely. A schematic picture of the metal ions and that of the
electron cloud is illustrated in Fig.20. Valence electrons occupy a definite energy level
and move about freely in the crystal structure of the metal. Therefore, a metal crystal
should be considered to consist of positively charged ions, and surrounded by a cloud of
free electron. Since the negative electron cloud surrounds each of the positive ions that
make up the crystal structure, strong electrical attraction holds the metal together.
A metallic bond, thus conceived can exist only between a large aggregate of
metallic atoms and must therefore be nondirectional. This type of bond causes a
significant lowering of the energy levels of the electrons involved since they are in close
proximity to several ion cores at all times. Another view of the metallic bond is to regard
it as a covalent bond brought about by atoms in close proximity alternately sharing and
losing electrons in their outer orbitals.
In general, the fewer valence electrons an atom has and the more loosely they are
held, the more metallic the bonding. However, the transition metals, such as ‘iron, nickel,
and tungsten exhibit a fair degree of covalence due in part to their incomplete “d”
electron orbitals. A special characteristic of metal bonding is the fact that every positive
26
23ion is equivalent. Thus, the metal can behave in a definite ductile fashion under stress
because a group of positive ions break their bonding at one location, slip to a new
position and establish their bonds.
This picture of the metallic state is essentially that proposed in the classical free
electron theory of Drude and Lorentz, formulated, before the advent of quantum
mechanics. More recent work has substantially modified the details of the above model
but the main assumption, i.e., the idea of “free” electrons in the metal is still useful.
COMPARISON OF BONDS
The comparision between various bonds are explained hereunder
Ionic And Metallic Bonds
In ionic bonding, a metallic element losses from the outer electron shell of its
atom a number of electrons equal to its numerical valency. These lost electrons are
transferred to the other electron shells of the nonmetallic atom or atoms with which the
metal is combining. In this a complete octet of electrons is left behind in the metallic
particle and completed in the nonmetallic particle. Thus, ionic bond results from an
electrostatic attraction between positive and negative ions which are derived from the free
atoms by the loss or gain of electrons, ideally, ionic bonds are formed by combining a
highly electropositive atom (metal atom which tends to lose electrons and become
positive ion) with electronegative atom (nonmetallic atom which tends to acquire
electrons and become negative ions). The bond itself is nondirectional since each positive
ion attracts all neighbouring negative ions and vice-versa; The bond in sodium chloride is
one of the good examples of ionic bond.
In metallic bonding atoms of the same metal loose their valence electron to form
an electron “cloud” throughout the space occupied by the metal atoms. The valence
electrons lie much further from the nucleus than do those in non metals. Thus the wide
orbits of the valence elections allow them to pass to regions remote from their parent
atoms. Likewise, they never associate with any one atom in the metal but drift through
the whole assembly as a kind of electron gas. Only the valence electrons behave in this
manner, for electrons in the inner shell lie much closer to the nucleus and are unable to
move outside its field of influence. Consequently, as far as the inner electrons are
concerned, an atom of a metal is much the same as an isolated atom, because these
electrons are held firmly within the atom and remain in their characteristic quantum shell.
27
CHAPTER 2
X-RAY DIFFRACTION
X-RAY DIFFRACTION
X-rays are electromagnetic waves like ordinary light, therefore, they should
exhibit interference and diffraction. The wavelength of x-rays is of the order of 0.1 nm,
so that ordinary devices such as ruled diffraction gratings do not produce observable
effects with x-rays. In 1912, German physicist Laue suggested that a crystal which
consisted of a three-dimensional array of regularly spaced atoms could serve the purpose
of a grating. The crystal differs from the ordinary grating in the sense that the diffracting
centers in the crystal are not in one plane. Hence the crystal acts as a space grating rather
than a plane grating.
On the suggestion of Laue, his associates, Friedrich and Knipping succeeded in
diffracting x-rays by passing them through a thin crystal of zinc blende. The diffraction
pattern obtained consists of a central spot and a series of spots arranged in a definite
pattern around the central spot. This symmetrical pattern of spots is known as Laue
pattern and it proves that x-rays are electromagnetic radiation. A simple interpretation of
the diffraction pattern was given by W.L. Bragg. According to him, the spots are
produced due to the reflection of some of the incident x-rays from the various sets of
parallel crystal planes (called Bragg’s planes) which contain a large number of atoms.
BRAGG’S LAW
Consider a ray PA reflected at atom A in the direction AR from plane 1 and
another ray QB reflected at another atom B in the direction BS. Now from the atom A,
draw two perpendiculars AC and AD on QB and BS respectively. The two reflected rays
will be in phase or out of phase depending on the path difference. When the path
difference (CB + BD) is a whole wavelength λ , or multiple of whole wavelength (nλ),
then the two rays will reinforce each other and produce an intense spot. Thus condition of
reinforcement is:
CB+BD=nλ
From fig.1 CB = BD = d sinθ
where, θ is the angle between the incident ray and the planes of reflection (glancing
angle). Therefore,
2d sinθ=nλ (1)
where d is the interplanar spacing of planes and n = 1,2, 3 ... stand for first order, second
order, third order... maxima respectively. Equation (1) is known as Bragg’s law. Different
directions in which intense reflections will be produced can be obtained by giving
different values to θ; i.e.,
for first maximum, sinθ1=λ/2d
for second maximum, sinθ2 =2λ/2d
for third maximum, sinθ3=3λ /2d and so on.
28
Fig.1Reflection of x-rays from lattice planes in a crystal
29
Fig. 2 Bragg’s X-ray spectrometer
30
Fig. 3 Variation of ionization current with glancing angle.
This procedure may also be followed for other crystal systems such as tetragonal,
hexagonal, etc. Table A gives the values of ratios between interplanar distance of the
planes (100), (110) and (111) for the cubic structures.
Thus we see that sodium chloride crystal has a face centered cubic structure.
31
on the axis of a cylindrical camera. The photographic film is mounted round the inner
surface of the camera, covering nearly the whole circumference in order to receive the
beams diffracted up to 1800.
The powder specimen of the crystal can be imaginated to be a collection of
random oriented tiny crystals, presenting all values of glancing angles to the incident
beam. For a given wave length and a given value of d, there can be only one value of θ
(glancing angle) which satisfies the equation 2d sinθ= nλ where n = 1. Such reflected
beams emerge out from the specimen in all directions inclined at an angle 2θ with the
direction of the incident beam, because millions of tiny crystals in the specimen are
random oriented. The reflected rays will be on the surface of a cone, vertex at the
specimen, base on the photographic film and having a semi-vertical angle 2 θ. The traces
obtained on the photographic film will be as in Fig. 5.
Let L be the radius of the cylindrical camera. The direct beam strikes the film at θ.
Suppose a spectrum with glancing angle θ is found at A which is at a distance R from θ .
Then θ = R/2L. Using the value θ in the Bragg’s equation and knowing the value of λ, d
(spacing of the plane involved) can be calculated. The powder method has been
employed in the study of micro crystalline substances like metals, alloys, carbon,
fluorescent powders and other forms where single crystals are not available.
ROTATING CRYSTAL METHOD
In this method a small and well-formed single crystal is mounted perpendicular to
the beam. The film is generally held in a cylindrical camera. Now when the crystal is
rotated slowly, and during the time of exposure, successive planes pass through
orientation, which are necessary for Bragg’s reflection, each producing a spot on the film.
The position on the film when developed indicates orientation of the crystal at which the
spot was formed. The data obtained from these spots give information about the structure
of ordinary and complex molecules.
32
LAUE’S METHOD
In this method a single crystal is held stationary on a goniometer so that θ remains
fixed. A beam of x rays of continuous wavelength is incident on this single crystal. The
crystal specimen have the dimensions of 1mm * 1mm * 1 mm and the beam x rays have
their wavelength varying from 0.2 to 2Å. A pinhole arrangement produces a well
collimated beam. The two flat films receives the diffracted beams. One receives the
transmitted and the other film receives the reflected x-rays. The arrangement is shown in
fig 7 (a). The diffraction pattern obtained on the films consists of a series of spots, fig.7
(b). The pattern will show the symmetry of the crystal. This method is widely used to
orient crystals for solid state experiments.
33
CHAPTER 3
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
Temperature
53
CHAPTER 4
54
Fig.1. Schematic electron occupancy of allowed energy bands for an insulator, metal,
semimetal, and semiconductor. The vertical extent of the boxes indicates the allowed
energy regions; the shaded areas indicate the regions filled with electrons, In a
semimetal (such as bismuth) one band is almost filled and another band is nearly
empty at absolute zero, but a pure semiconductor (such as silicon) becomes an
insulator at absolute zero. The left of the two semiconductors shown is at a finite
temperature, with carriers excited thermally. The other semiconductor is electron-
deficient because of impurities.
(a)
where, for periodic boundary conditions over a cube of side L,
(b)
The free electron wave functions are of the form
(c)
they represent running waves and carry momentum p = ħk.
The band structure of a crystal can often be explained by the nearly free electron
model for which the band electrons are treated as perturbed only weakly by the periodic
potential of the ion cores. This model answers almost all the qualitative questions about
the behavior of electrons in metals.
We know that Bragg reflection is a characteristic feature of wave propagation in
crystals Bragg reflection of electron waves in crystals is the cause of energy gaps. (At
Bragg reflection wavelike solutions of the Schrodinger equation do not exist, as in Fig.
2.) These energy gaps are of decisive, significance in determining whether a solid is an
insulator or a conductor.
We explain physically the origin of energy gaps in the simple problem of a linear solid
of lattice constant a. The low energy portions of the band structure are shown
qualitatively in Fig. 2, in (a) for entirely free electrons and in (b) for electrons that are
nearly free, but with an energy gap at k = ± π/a. The Bragg condition (k + G)2 = k2 for
diffraction of a wave of wavevector k becomes in one dimension
(d)
55
where G= 2πn/a is a reciprocal lattice vector and n is an integer. The first reflections
and the first energy gap occur at k ± π/a. The region in k space between -π/a and π/a is
the first Brillonin zone of this lattice. Other energy gaps occur for other values of the
integer n.
The wavefunctions at k = ± π/a are not the traveling waves exp(iπx/a) or
exp(-πx/a) of free electrons. At these special values of k the wavefunctions are made up
of equal parts of waves traveling to the right and to the left. When the Bragg reflection
condition k = ±π/a is satisfied by the wavevector, a wave traveling to the right is Bragg-
reflected to travel to the left, and vice versa. Each subsequent Bragg reflection will
reverse the direction of travel of the wave. A wave that travels neither to the right nor to
the left is a standing wave : it doesn’t go anywhere.
The time-independent state is represented by standing waves, We can form two
different standing waves from the two traveling waves exp( ±iπx/a), namely
(e)
The standing waves are labeled (+) or (-) according to whether or not they change sign
when -x is substituted for x. Both standing waves are composed of equal parts of right-
and left-directed traveling waves.
Fig.2.(a) Plot of energy ε versus wavevector k for a free electron. (b) Plot of energy
versus waveveetor for an electron in a monatomic linear lattice of lattice constant a,
The energy gap Eg shown is associated with the first Bragg reflection at k ± π/a; other
gaps are found at ± nπ/a, for integral values of n.
This function piles up electrons (negative charge) on the positive ions centered
x = 0 a, 2a, . . . in Fig. 3, where the potential energy is lowest.
Fig.(3a) pictures the variation of the electrostatic potential energy of a conduction
electron in the field of the positive ion cores. The ion cores bear a net positive charge
because the atoms are ionized in the metal, with the valence electrons taken off to form
56
the conduction band, The potential energy of an electron in the field of a positive ion is
negative, so that the force between them is attractive.
For the other standing wave ψ (-) the probability density is
which concentrates electrons away from the ion cores. In Fig.. 3b we show the
electron concentration for the standing waves ψ(+), ψ(-), and for the traveling wave.
When we calculate the average or expectation values of the potential energy over
these three charge distributions, we find that the potential energy of ρ(+) is lower than
that of the traveling wave, whereas the potential energy of ρ(-) is higher than the
traveling wave. We have an energy gap of the width Eg if the energies of ρ(-) and ρ(+)
differ by Eg. Just below the gap at points A in Fig. 2 the wavefunction is ψ(+), and just
above the gap at points B the wavefunction is ψ(-).
Fig.3 (a) Variation of potential energy of a conduction electron in the field of the ion
cores of a linear lattice. (b) Distribution of probability density ρ in the lattice for |ψ(-)|2
ά sin2πx/a ; |ψ(+)|2 ά cos2πx/a; and for a traveling wave. The wavefunction ψ(+) piles
up electronic charge on the cores of the positive ions, thereby lowering the potential
energy in comparison with the average potential energy seen by a traveling wave. The
wavefunctlon ψ(-) piles up charge in the region between the ions, thereby raising the
potential energy in comparison with that seen by a traveling wave, This figure is the
key to understanding the origin of the energy gap.
2.Kronig-Penney Model
The essential feature of the behaviour of electronic potential may be studied by
considering a periodic rectangular well structure in one dimension, which was first
discussed by Kronig and Penney in 1931. It is assumed here that the potential energy of
an electron, when it moves in one dimensional perfect crystal lattice, is represented in
57
the form of rectangular walls as shown in the fig.4. The period of the potential is (a +b).
In regions where 0 < x < a, the potential energy is assumed to be zero and in the regions
-b< x < 0, the potential energy is V. Though this is a highly artificial model but it is a
model for which the Schrodinger equation can be solved explicitly in terms of
elementary functions. The Schrodinger equation for the two region can be written as
Fig. 4
(1)
And (2)
Let the energy E of the electrons under consideration be less than V0, then let us put
(3)
Where α and β are real quantities. The two equations then become
(4)
(5)
Now from the Bloch theorem the solution of wave equation for a periodic potential will
be of the form of a plane wave modulated with the periodicity of the lattice of the form
(6)
where Uk (x) is the periodic function in x with period (a + b).
Since
So,
58
(7)
(8)
Now to solve these differential equations assume the solution of the form
So that
So that
(9)
Similar, (10)
where A, B, C, D are constants. These constants are so chosen that they satisfy the
conditions of continuity and periodicity of wave functions. Now because of continuity
at x = 0 the two wave-functions and their derivatives must have the same value at x = 0,
(11)
And (12)
and because of periodicity of function U (x), the values at x = a must be equal to those
at x = -b.
or
(13)
(14)
Now these four equations (11) to (14) have a solution only if the determinant of the
coefficients of A, B, C, D vanishes, i.e.,
59
This determinant on solving leads to the following equation
(15)
In order to express the relation in a more simplified form Kronig and Penney
suggested delta functions such that V0 →∞ and b→0 but the product V0b or β2b
remains finite. Such a function is known as delta function.. Within this limit equation
(15) reduces to
(16)
Between when b→0, sinh βb→ βb; cosh βb→1
Equation (16) is further written as
(17)
where the quality P is defined by the expression
(18)
since V0 tends to infinity i.e. P is a measure of the quantity V0b, which is the area of the
potential barrier.
The physical significance of this quantity is that if P is increased, the area of the,
potential barrier is increased and the given electron is bound more strongly to a
particular potential well. When P →0, the potential barrier becomes very weak which
means that electrons are free electrons. In this case we obtain from equation (17),
(19)
which resembles the result obtained by considering the Sommerfield model of a
metal.
60
Fig. 5. Graph showing the function [(P sin αa)/ αa] + cos αa plotted against αa for
P = 3π/2.
2. When αa increases the first term on the left side of (17) decreases so the width
of allowed energy bands is increased and forbidden regions become narrower.
3. The width of allowed band decreases with the increasing value of P i.e., with
increasing the binding energy of electrons. When P is infinite, the allowed energy bands
are infinitely narrow and are independent of k because when P→∞, the allowed energy
ranges of α a reduce to points given by
(20)
i.e., E is independent of k.
The energy levels in this case are discrete and the electron is completely bound. It is
caught between the potential wells and moves only in one cell of width a.
61
Hence we conclude that in one dimensional problem for the limiting case the
spectrum of permissible energy values is found to consist of continuous regions
separated by finite intervals. By varying the quantity P i.e. V0 b from 0 to ∞ we pass
from the case of free to that of bound electrons and can thus study the changes in the
allowed and forbidden ranges of energy and the wave function.
From (17) it is also possible to obtain the energy E as a function of the wave number k
which leads to the following conclusion.
The discontinuities in the E vs k curve occur at k = (nπ/a) where n = 1, 2, 3,….
These k values define boundaries of the I, II etc. Brillouin zones. From k = π/a to -
π/a there exists the first Brillouin zone. The second zone consists of two parts: one from
+ π/a to + 2π/a and the second from -π/a to -2π/a.
Each portion of the curve may be called a band. The characteristics of the curves
are that:
1. They are horizontal at the top and bottom.
2. They are parabolic near the top and the bottom with curvatures in opposite
directions.
3. (d2E/dk2) is positive in the lower portion of the band and negative for the upper
portion.
(21)
62
Fig. 7. (a) Plot of B versus k for free electron.
(b) Plot of B versus wave vector k for an electron in a monatomic linear
lattice of lattice constant a. The energy gap Eg shown is associated with the first
Bragg reflection at k = ± π/a.
where G = ± (2nπ/a) is a reciprocal lattice vector. The first reflection and the first
energy gap occur at k = ±π/a, other energy gaps occur for other values of integer n in
(21)
The reflection at k = ± π/a arises because the wave reflected from one atom in the
linear lattice interferes constructively with the wave from a nearest neighbour atom.
The phase difference between the two reflected waves is just ± 2π for these two values
of k. The region in k space between -π/a and + π /a is called first Brillouin zone.
At k = ± π/a the wave functions are not the traveling waves eiπx/a and e-iπx/a of the
free electron model. We shall show that the solutions of these particular values of k are
made up equally of waves traveling to the right and to the left. The solutions are
standing waves. This can be understood in the following manner: When the Bragg
condition is satisfied, a wave travelling in one direction is soon Bragg- reflected and
then travels in opposite direction. Each subsequent Bragg reflection reverses the
direction of travel again. The only time-independent situation is formed by standing
waves. We can form two different standing waves from the travelling waves eiπx/a and e-
iπx/a
(22)
The standing waves are labelled (+) and (-) according to whether they are even or odd
when -x is substituted for x.
The two standing waves ψ(+) and ψ(-) correspond in a lattice to the different values of
energy. This is the origin of the energy gap. We recall that in quantum mechanics the
probability density ρ of a particle is |ψ|2. For a pure travelling wave ψά eiπx/a , so that ρ ά
eiπx/a .e-iπx/a =1, a constant. But ρ is not constant for linear combination of plane waves.
Consider linear combinations ψ(+) in (22); for we have ρ ά cos2πx/a. Fig. 8 (a) indicates
the variation of the electrostatic P.E. of the conduction electron in the field of positive
ion cores of monatomic linear lattice. The ion cores bear a positive charge, as in
forming the metal each atom has lost one or more valence electrons to form the
conduction band. The potential energy of an electron in the field of a positive ion is
negative. In fig. 8 (b) we sketch the distribution of electron density corresponding to the
standing waves ψ(+), ψ(-) and to traveling wave.
63
The traveling wave eikx distributes electrons uniformly over the line: standing wave ψ(-)
ά Sinπx/a distributes electrons preferentially midway between the ion cores; and the
standing wave ψ(-) ά Cosπx/a distributes electrons on the ion cores, where the potential
energy is lowest . On calculating the average value of potential energy over the three
charge distributions, we expect to find the potential energy of ψ(+) lower than that of a
travelling wave, whereas the potential energy ψ(-) of is higher than that of traveling
wave. If the potential energy of ψ(+) and ψ(-) differ by an amount Eg , we have an
energy gap of width Eg between the two solutions at k = π/a or between the two at k = -
π/a. In fig. 7 (b) the wave function at A will be ψ(+) and the wave function at B will be
ψ(-).
Fig. 8(a) Variation of P.E. of conduction electron in the field of ion cores of
linear lattice.
(b) Distribution of probability density ρ in the lattice for |ψ(+)| 2 ά sin2 πx/a; |ψ(+)|
2
ά cos2πx/a. and for a travelling wave. The wave function ψ(+) piles up lowering the P.E.
in comparison with the average potential energy seen by travelling wave. The wave
function ψ(-) piles up charge in the region, between the ions and removes it from the ion
cores, thereby raising the P.E. in comparison with that seen by travelling wave.
4. Brillouin Zones.
We have seen in the Kronig Penney model that the energy discontinuities in a
monatomic one dimensional lattice occur when the wave number k satisfies the relation
k = nπ/a, where n is any positive or negative integer. Thus it is the value of wave
number which is important for the energy discontinuities. In one dimensional
monatomic lattice a line representing the value of k is divided up by energy
discontinuities into segments of length π /a as shown in fig. 9. These line segments are
known as Brillouin zones; the segment.
-π/a <k < π/a
is the first Brillouin zone, the two segments
- π/a <k <π/a and π/a < k < 2π/a
64
Fig. 9
The meaning of these wave number diagrams is that if we start at the origin of k-
space and consider a series of states with continuously increasing k, then E varies
continuously with k as long as k lies within the zone, but when k reaches the surface of
the zone, there is an abrupt increase in energy. The explanation of this abrupt increase
65
of energy has already been discussed previously. For any one direction in k space the
relation between B and k is of the same general form as already studied but the position
and magnitude of the gaps, a1 b1, a2 b2 ... depend upon the direction concerned. This
gives the possible type of Brillouin zone.
(a) Zones do not Overlap
Here the energies of states at the top of the first band i.e., ar may for all directions
be less than the energy of any state for any direction at any bottom of the second band
i.e., as in fig. 12.
(2) Zones Overlap
Here the energy gap Eg is present at the top of the first band for any direction but
some of the states at the top of first band i.e., ar have energy values higher than those of
the lowest state situated at the bottom of the second zone i.e., br. This condition of
affairs is shown in fig.13.
Fig. 13
(23)
66
The total number of possible wave functions in any energy band is equal to the
number of unit cells N.
(24)
But
(25)
i.e. the velocity of free electrons is proportional to k.
Now on the band theory the energy versus k curve is shown in fig. 14. Because it is a
symmetrical curve having points of inflexion at M and N, the value of (dE/dk) is
different for different portions of the curve. Since the curve is horizontal at A, O and B
Fig.14 Fig. 15
we have for these portions (dE/dk) =0. At the points N and M, (dE/dk) is the maximum
but negative at M and positive at N. The velocity versus k curve can therefore be
obtained from the E versus k diagram which is of the type as shown in fig. 15. It is
important to note here that the absolute value of velocity reaches a maximum for k = k0
where k0 corresponds to the point of inflexion in the E - k diagram and beyond this
point the velocity decreases with the increasing energy which is quite different in
behaviour from that of free electrons.
67
electron for a time dt, the electron gains certain energy dE which is the work done on
the electron by the electric field in the time dt i.e.,
(26)
Because
Now
Which gives
(27)
Now if ‘a’ be the acceleration of the electron, we have
(28)
For free electron, we have
(29)
These two expressions (28) and (29) suggest that on the band theory, the electron
behaves as if it
(Note that if energy is given by E = (ħk2/2m)*, the same expression for m* can be
obtained). The effective mass is thus determined by (d2E/dk2). This shows that since for
the lower positions of E-K curve (d2E/dk2) is +ve, m* is +ve and it will have a –ve
value for the upper position. At the points of inflexion, (d2E/dk2) = 0 and hence m*
becomes infinite. Physically speaking this means that in the upper half of the band the
electron behaves as having a negative mass or behaving like particles with +ve charges.
One can arrive at the same conclusion by considering the v-k curve and making use of
27 when an electric field is applied,
the wave vector increases linearly with time. Until the velocity reaches its maximum
value, the electron is accelerated by the field; beyond the maximum the same field
produces a decrease in v, i.e., the mass must become negative in the upper part of the
band.
68
Fig. 16.m*- k curve
This factor fk determines the extent upto which the electron in a k-state is a ‘free
electron’. If m* is large, fk is small, i.e., the particle behaves as a heavy particle. When
fk =1 then electron behaves as a free electron.
(30)
where summation extends over occupied states in the band.
The number of states in the range dk for one dimensional lattice of length L is
(31)
In the shaded region of fig. 17. i.e., within limits - k1 to + k1.
(32)
69
Since two electrons occupy each of the state, the term on the right hand side of
equation (32) is to be multiplied by 2. Hence,
70
1. That the existence of the energy gaps ∆E between allowed band should be
somewhat confirmed.
2. We should get some experimental confirmation of the fact that allowed bands
are partly full in metals and completely full in insulators.
Study of resistivity variation with temperature, light absorption and photo-conductivity
in semiconductors and insulators indicate the existence of energy gap ∆E.
Confirmation of (2) comes from soft X-ray spectroscopy. Let us consider X- ray
emission first, using the energy level diagram 18 (a). ‘A’ represents the energy level of
a core electron say 1s (K-shell) state. Such core levels are sharp. Suppose a suitable
bombardment in an X-ray tube removes the core electron. The level can be filled from
the electrons in the higher ‘allowed’ band which is shown partly full.
For v > v2 there will be no emission; since there are no levels in higher band from
where transition can take place at v2, sudden emissions of all transitions will take place
which therefore becomes the emission edge. v1 <v <v2 will correspond to the emission
band; for v <v1 there will be no emission
because again there are no possible transition levels. The observed emission band
width therefore measures the extent to which the allowed band is full. Now consider the
absorption of X-rays by the same substance. Fig. 18 (b) shows that for v <v2 there
should be no absorption; at v = v2 there should be an absorption edge’ at v > v2 there
should be an ‘absorption band’. Absorption therefore also gives the evidence for the
existence of the unfilled levels in the allowed band.
Absorption and emission edges coincide in frequency in metals.
Emission, bands for non-metals are different from metals and are what can be
expected from full zones. They have no sharp emission edges.
Emission bands in metals are also not perfectly sharp as transitions from filled to
unfilled levels occur in metals over a region proportional to kT at the Fermi levels. As a
consequence, the width of the emission edge also varies with temperature.
71
PROBLEMS
1. Show that if in the potential model used in connection with the Kronig — Penney
calculation. V0 is allowed to become infinitely large and b is allowed to approach zero,
such that the product P = remains fixed, ten allowed energy values are given
by the solution of equation
Also show that this leads to allowed and forbidden energy bands.
While discussing Kronig —Penney model it has been shown that equations (11)
to ( 14) have non-vanishing solutions if and only if
(15)
Within energy range 0 < E < V0, ± β is imaginary and putting β = iγ. Noting that cos ix
= coshx and sinix = sinhx. Eqn. (15) can be written as
Substituting
72
CHAPTER 5
PHOTOVOLTIC
SOLAR CELLS AND PHOTODIDOES
Optoelectronic devices convert optical energy into electrical energy and vice
versa. The phenomenon of converting optical radiation into electrical energy is called
photovoltaic effect. Two important devices based on the photovoltaic effect are solar cell
and photo detector. The inverse of photovoltaic effect is termed electroluminescence, i.e ,
the phenomenon of light emission when subjected to electrical signal. The LEDs and
lasers found good examples of this effect.
PHOTOVOLTIC EFFECT
We know that if a photon with energy hν which is greater than or equal to band
gap Eg is incident on a semiconductor, absorption of light can take place with consequent
generation of an EHP. In other words, in order to generate EHPs in a particular
semiconductor, the wavelength of the incident radiation must be lower than a
characteristic value for the material, called the cut-off wavelength λc. The ability of the
material to absorb light is also dependent on another parameter, called the absorption
coefficient α. α has the unit of inverse of length while (1/ α) called the effective
absorption length, is essentially a measure of the material thickness within which the
incident photon is absorbed by the semiconductor. In fact, the fraction of the incident
radiation which is absorbed within a distance x from the surface is given by [1-exp(-αx)].
Therefore, if α is larger, more of the incident radiation is absorbed within the distance x.
Also, the fraction of the incident radiation absorbed within a distance (1/ α) is then
evaluated as [1-exp(-1)] = 0.632. An absorption coefficient of 104 cm-1 corresponds to an
absorption length of 1µm, which means that 63.2% of the incident radiation is absorbed
within 1µm from the surface. The absorption coefficients for different semiconductors
are plotted as a function of the photon energy in Fig.1 For indirect band gap
semiconductors, the value α a at a given photon energy is usually lower for materials with
higher Eg, as can be seen from the plots for Ge and Si. However direct band gap
materials (such as GaAs) have much larger values of α even though their Eg may be
larger than indirect band gap materials, as seen from the plots for GaAs and Si. This
means that for a given photon energy, a much thinner layer of GaAs (than silicon) is
needed to absorb the light.
When a p-n junction is illuminated with light of photon energy greater than Eg
photons are absorbed in the semiconductor and EHPs are generated both in the n-region
and the p-region of the junction. For the EHPs to contribute towards current in the
external circuit, the generated electrons and holes must be separated before they
recombine. This is achieved if the EHPs are generated in the depletion layer, where the
electric field sweeps away the electrons and holes in opposite directions. The photo-
73
Fig. 1 Absorption coefficients as functions of photon energy for various
semiconductor materials
generated minority carriers, which are generated within one diffusion length from the
depletion layer edge, can also diffuse to the depletion region without recombining. They
are then swept across- the junction due to the electric field present in the depletion region
as shown in Fig. 2. Due to the direction of the electric field being from the n-region to the
p-region, the holes flow towards the p-region and electrons to the n-region. Since the
direction of this photo-generated current IL is opposite to that in a forward-biased diode,
we can write an expression for the total current in an illuminated diode as
(1)
In other words, the total current of the diode is lowered by an amount IL when
illuminated.
The I-V characteristics of an illuminated p-n junction diode are depicted in Fig. 3.
An interesting point to be noted is that the I-V characteristics pass through the first, third,
and fourth quadrants. Depending on the intended application, the diode can be made to
operate so that either power is delivered to the device (operating in the first or the third
quadrant) or supplied by the device to the external circuit (operating in fourth quadrant).
When the diode is used as solar cell, it is made to operate in the fourth quadrant and work
as a battery. On the other hand, to make use of the diode as a photodetector, it is usually
operated in the third quadrant. However, though the underlying principle is the same, the
actual structures are very different for these two devices
74
Fig. 2 An illuminated p-n junction showing the generation
and subsequent separation of EHPs
Solar Cell
A solar cell is basically a p-n junction, which can deliver power to the external
circuit when illuminated. Arrays of such junctions are used to supply electrical power to
space satellites and also for remote and rural areas. Solar cells are preferred over
conventional batteries for these applications as they are environment-friendly and provide
maintenance-free services for long time. However, the high cost of solar cells has so far
curtailed its widespread use for terrestrial applications.
Solar cells are usually large area devices so that a greater amount of optical
energy is absorbed and output power is high. In order to increase the light-generated
current IL, a large number of the minority carriers generated outside the depletion legion
have to reach the edge of the depletion region. This requires the diffusion lengths Ln, and
Lp to be large. As seen from the discussion on fabrication of diodes, the top layer is
produced by diffusion or ion-implantation with a high impurity concentration. High
impurity concentrations create lattice damage which introduces recombination centres.
This makes the lifetime of carriers very short in the top layer and consequently the
diffusion length is small, hence, in a practical solar cell, the topmost layer is kept very
thin so as to allow more light to reach the base. On the other hand, the diffusion length in
75
the substrate (or base) is large and the carriers generated in this region mostly contribute
to useful photocurrent. Also, n+p structures, with the n+ layer at the top, are preferred.
This is because the diffusion constant of electrons in the p-base (Dn) is larger than that of
holes in the n-base (Dp) of a p+n junction. Larger diffusion constant means larger
diffusion length and therefore an improvement in IL as more carriers generated outside
the depiction region can diffuse to this high field region. Usually, in a practical solar cell
the surface concentration of the n-region is kept below 1020 cm-3 with a junction depth
less than 0.2 µm. The surface of the solar cell is coated with antireflection coating in
order to minimize reflection losses.In general, the antireflection materials used are Si02,
TiO2, and Ta205. The series resistance of the solar cell has to be made very small so that
power is not lost due to losses in the cell itself. To achieve this, thin contact fingers are
provided on the n-region. Fig.4(a) shows the structure of a solar cell.
Fig.4 (a) Structure or a solar cell (b) I-V characteristics of an illuminated solar cell
showing the point of maximum power.
Solar cells are usually calibrated at AMI or AMO illumination. The AMI
spectrum represents sunlight incident at the sea level on a clear day with the sun at the
zenith and the incident optical power in this case is about 92.5 mW/cm2. On the other
hand, the incident optical power tot AMO is 135 mW/cm2 and is considered to be the
intensity of the solar spectrum just outside the earth’s atmosphere. White AMO
calibration is generally used for solar cells in space satellites, AMI is usually meant for
cells in terrestrial applications.
The fourth quadrant 1-V characteristics of a solar cell is shown in Fig.4(b) The
three most important parameters of a solar cell are the open circuit voltage Voc, the short
circuit current Isc, and the fill factor FF. Since in a solar cell, current flows from the n-
region to the p-region, that is, in a direction opposite to that of a conventional diode,
putting V = 0 in Eq. (1) we obtain the short circuit current Isc, as
(2)
Again for I = 0, we have the open circuit voltage Voc from Eq (1) as
76
(3)
However, no power is delivered when the solar cell is operating at either of these two
points. So, the operating point of the cell should be selected such that the output power
becomes maximum. The output power is given by
(4)
The condition for maximum power is obtained from dP/dV = 0. Hence, differentiating
Eq. (4) and equating it to zero, we get the expressions for the voltage and the current at
the point of maximum power as
(5a)
(5b)
The optimum load resistance Rop for maximum power is thus given by rearranging Eq.
(5b) as
(6)
The Fill factor of the solar cell is defined as
(7)
where Pm = VmIm is the maximum power output of the solar cell. For a well-designed cell,
FF usually lies in the range 0.7-0.8. The efficiency of the solar cell is given by
(8)
where Pin = incident optical power.
Thus, it is evident that to realize a solar cell with high efficiency, it is not only
necessary to have high Voc and Isc but also a high FF. Solar cells with η= 15% under
AM1 illumination are commercially available.
To improve the efficiency of solar cells IL has to be increased. This can be
achieved by increasing the collection of photo-generated carriers. One way of improving
the carrier collection is to heavily dope the back of the cell so as to realize an n+pp+
structure. The potential barrier at the back pp+ junction forces the minority carriers to be
confined to the p-region. The decrease in the availability of minority carriers reduces
recombination at the back contact and enhances the chance of collection of the generated
carriers. Other techniques employed to increase IL include surface texturing, use of solar
concentrators, and so on.
The efficiency of a solar cell depends, to some extent, on the material used. For a
material with larger band gap Voc is usually higher as Io reduces drastically with increase
77
in Eg see Eq. 3. On the other hand, with an increase in Eg, some part of the. incident light
for which hv is now less than Eg, no longer contributes to IL, thereby reducing ISC, see
Eq. 2. Therefore, there is an optimum band gap for the highest possible efficiency. GaAs
solar cells have higher efficiency than their silicon counterparts, and are therefore
preferred for space applications, where cost is not of much concern. On the other hand,
silicon is still preferred for terrestrial applications, for their lower cost. Very high
efficiency cascade solar cells, consisting of a series of junctions of materials with
different band gaps, which efficiently about different portions of the solar spectrum, have
also been demonstrated primarily for space applications.
Photodiode
In a photodiode the incident optical signal generates EHPs that gives rise to a
photocurrent. The diode is reverse biased at less than its breakdown voltage (Vr<VBR ) ,
so that under dark conditions, only a very small current flows, usually referred to as the
dark current of the photodiode. Now, when light is incident on the diode, a large
photocurrent flows. This change in current allows the photodiode to detect the presence
of light and hence the photodiode is also termed a photo detector. The dark current must
he made small and the photocurrent large as much as possible so that the photodiode can
detect weak signals and the signal-to-noise ratio (SNR) is large.
Most photodiodes have a thin p+ layer at the top unlike solar cells where the n+
layer is at the top. This is because a diode with the p+ layer at the top has a faster time
response compared to the diode with the n+ layer at the top (In a solar cell, the response
time is not important). For a well designed photodiode, most EHPs are generated within
the depletion region. Due to the electric field, the electrons move towards the n-region
and the holes towards the p-region. Now the mobility of boles is much less compared to
that of electrons. So with a p-layer at the top, the slower carriers (holes) have to move a
much shorter distance to the p-contact.
An important parameter of a photodiode is the quantum efficiency η, which is
defined as the number of useful EHPs generated per incident photon. The term ‘useful
EHPs, refers to those EHPs which do not recombine and therefore contribute to current in
the external circuit. The value of η may be less than 1 if all the light is not absorbed
within the depletion layer or a diffusion length from it. Another related parameter of a
photodiode is the responsibility R. It is the ratio of the photocurrent to the incident optical
power and is expressed in units of Amperes/Watt. If Poptical is the incident optical power
at a wavelength λ (corresponding to a photon energy hν), and IL is the photo-generated
current then η and R are expressed as
(9a)
And
(9b)
78
In Eq. (9a).
α = absorption coefficient
Wp, = thickness of the top layer.
W = depletion width, and
Lp = minority carrier diffusion length in the underlying n-layer.
It has been assumed that only those EHPs which are generated in the depletion
layer and one diffusion length away from it in the n-region contribute to the photocurrent.
From Eq. 9a), we see that for high efficiency, Wp must be small and the depletion layer
width W must he large. Physically this means that most of the incident photons are
absorbed within the depletion region rather than in the neutral p- and n-regions. As there
exists a strong electric field in the depletion region. as soon as anEHP is generated within
this region, electrons are swept into the n-region and holes into the p-region eventually
contributing to photocurrent. Also a large W results in a small capacitance and improves
the speed of response of the diode. However, if W is too large. the transit time delay of
the carriers dominates and hence a compromise has to he made.
HELP DESK 1
Equation (9b) suggests that for a given quantum efficiency, the responsivity increases
with wavelength. What is the physical reason for this?
let us consider two ideal photodiodes, P1 and P2, having quantum efficiency η
equal to 1. Let us assume that the same number of photons is incident on P1 and P2 per
unit time. Therefore, it follows that the photo-generated current is also the same in P1 and
P2, that is IL1= IL2. Let λ1 and λ2 be the incident wavelengths on P1 and P2 respectively,
and λ1 > λ2. Since each photon incident on P2 has higher energy than those incident on P1
the incident optical power on P2 (Poptical 2) is greater than the incident power on
P1(Poptical).
Now, since
And
79
and a smaller radius of curvature results in a smaller breakdown voltage. A guard ring of
larger junction depth is usually employed in p-i-n photodiodes as shown in Fig.5(a) to
increase the break down voltage and reduce the dark current.
Photodiodes, which are operated close to the avalanche breakdown voltage to
enable avalanche multiplication of photo—generated carriers, are called avalanche
photodiodes. The avalanche multiplication process increases the photocurrent and hence
the responsivity of the detector. However, since the avalanche multiplication process is
statistical in nature, there is a fluctuation in the responsivity due to the avalanche
multiplication noise.
80
Example 1
81
CHAPTER 6
Introduction
A charge separation constitutes an electric dipole and the polarization represents
the average density (per unit volume) of the electric dipole moment (a vector quantity) in
a given direction while the electric polarization has to do with the static positioning of
the charges, the magnetization is due the motion of these charges.
A plane current loop is equivalent to a magnetic dipole so far as the magnetic
effects both produced and experienced by them are concerned. Electrons in an atom move
in certain prescribed fashions about the nucleus of the atom. The electronic motion can be
described by assigning a certain angular momentum to each electron. Consequently the
magnetic properties of a substance are determined by the state of angular momentum
which the electrons may possess.
Let us consider an electron with rest mass m0 moving around the nucleus in a
circular orbit of radius a and angular frequency ω. The angular momentum of the election
about the z axis is
where is the unit vector in the z direction . Since the electron goes round every 2π/ω
second, the curent generated by the electronic motion is thus given by
Where Φ is a unit vector in Φ direction. The angular momentum associated with the
electronic motion is then given by
82
This is the value of an elementary permanent magnetic dipole.
Besides permanent dipoles, magnetic moments can also be introduced by the application
of an external magnetic field; the principle involved in the operation of a Betatron is that
the electrons are accelerated into circular orbital motions by a time varying magnetic
field and the magnetic moment is induced due to change in the orbital motion of the
electrons.
When a substance is placed in a magnetic field H, a magnetic moment M per unit
volume results; M is called the intensity of magnetization. For isotropic materials M and
H are parallel vectors and the susceptibility defined by M = χ. H is then a scalar quantity.
In anisotropic substances χ is a tensor.
Diamagnetism
Diamagnetism is associated with the tendency of electrical charges partially to
shield the interior of a body from an applied magnetic field. Classically, diamagnetism is
related to changes in the orbital motion of electrons known as Lamor precession that
occur when atomic systems are placed in a magnetic field. Atoms or ions in which the
orbital and spin angular momenta cancel in pairs have J = 0 and have no permanent
magnetic moment to cause Para magnetism.
It may be recalled that the current induced in a closed electrical circuit by a
magnetic field is always in such a direction as to tend to keep the total flux unchanged.
Thus the circuit has in effect a negative susceptibility. The effect is retained even in
systems of charges that must be treated by quantum mechanics and is responsible for
diamagnetism. Since all atoms or ions produce a diamagnetic contribution to the total
susceptibility, although it may be marked by the other types; it is a consequence of the
magnetic moment induced in the atoms by an external field, In this respect diamagnetism
may be compared with the electron polarization in an electric field. Both are essentially
independent of temperatures. There exists however an essential difference; in die
electrical case the induced moment lies along the direction of an applied field leading to
83
the positive electrical susceptibility; in the magnetic case the induced moment produces a
negative susceptibility.
Theory
Assume an arbitrary direction for the angular momentum vector Ma relative to the
magnetic field H.
Fig. 1
(1)
The magnetic field produces a torque It µ X H on the dipole so that
(2)
This is the equation of motion of a vector Ma precessing about H with an angular
frequency
(3)
where ωL is called the Larmor frequency
(4)
Substituting for ωL
(5)
2
where <ρ > is the mean square radius of projection of the orbit on a plane perpendicular
to H.
When the treatment is extended to solid containing Z electrons, one obtains for
diamagnetic susceptibility
(6)
84
(7)
where it is assumed that the charge distribution of the atom is spherically
symmetrical, i.e
It should however be noted that the diamagnetic susceptibility is determined
essentially by charge distribution in atoms.
With the introduction of quantum orbital theory the Langevin’s expression was
corrected by substituting the value of <ρ >2 deduced on Bohr’s theory conception.
The value of <ρ >2 for hydrogen-like atom is
(8)
where a0 is the radius of inner most orbit in the hydrogen atoms (normal state) and
Z is the effective nuclear charge.
The atomic diamagnetic susceptibility is
(9)
where a0 =0.528 x 10-8 for hydrogen atom in normal state.
Paramagnetism
Paramagnetism is related to the tendency of a permanent magnet to align itself in
the direction of magnetic field such that its dipole moment is parallel to the field. In
atomic systems the permanent magnetic moment is associated with the electronic spin as
in atoms, molecules and lattice defects possessing an odd number of electrons, the total
spin cannot be zero. Thus a paramagnetic substance has a non-vanishing angular
momentum. It may also be the permanent moment of unfilled atomic shells that arises
from a combination of spin and orbital moment.
85
Where i is the current in the loop and A is the area enclosed by the circulating
current. Imagine for simplicity that the current is caused by a single electron in a circular
orbit of radius r about the central nucleus.
Then the current around the orbit is the amount of charge which passes any point in the
orbit per unit time and is
(11)
The angular momentum of the electron in the circular orbit of radius r is
(12)
Then
(13)
Since the charge on electron is negative so µm and L are oppositely directed vectors. Thus
the relation between magnetic moment and angular momentum is
(14)
From this expression it is clear that magnetic moment and angular momentum are related
through a constant [e/2m] termed as gyromagnetic ratio.
Now further, since angular momentum of electron in its orbit was found to be a multiple
of (h/2π), the magnetic moment of an electron with angular momentum (h/2π) is called
Bohr magneton µB.
(15)
The orbital angular momentum can also be related to quantum numbers. Recall that the
quantum numbers l and ml refer to the electron in its orbit.
Total orbital angular momentum is given by
(16)
where l = 0,1,2….
The number m1 refers to the component of angular momentum in z-direction; it satisfies
the relation
(17)
(18)
and (19)
The spin of the electron also results in a contribution to the magnetic moment now
according to the hypothesis of Uhlenback and Goudsmit the electron has an intrinsic
magnetic moment due to this intrinsic angular momentum. The magnitude of this
moment µz, spin is found experimentally to be the same and is twice as predicted by
eq.14 , Thus
86
(20)
where s is the spin quantum number. Since s has the value and +1/2 and -1/2 ; the
magnitude of spin magnetic moment is always i.e, spin moment always has the
magnitude of one Bohr magneton. Thus the energy split between Sz = 1/2 and Sz = -1/2
states in a magnetic field is found to be 2µBH in contrast to µBH as predicted in classical
model.
The magnetic moment of a multi-electron atom is merely the sum of the magnetic
moments of all the electrons including both orbital and spin moments. Each electron
contributes an independent vector quantity to the total magnetic moment of the atom,
Since all filled shells have zero total angular momentum, they also have zero total
magnetic moment. In particular atoms or ions which possess only filled shells have no
permanent moment and hence they cannot be paramagnetic.
(21)
But
(22)
If
(23)
Then
(24)
Or
(25)
87
Ferromagnetism
Ferromagnetics are qualitatively distinguished from other substances by the
following properties:
(1) They acquire a relatively high magnetization in weak fields. Thus, for instance, in a
field of 10 gauss, the specific intensity of magnetisation of antimony (diamagnetic) is
about - 0.8 x l0-6 and of cobalt (paramagnetic) is about 0.6 x 10-3 while for soft iron it is
about 200.
(2) They do not exhibit a linear proportionality between the magnetization and the field
strength.This means that their susceptibility χ is not constant but varies with the field so
that the magnetic properties of a ferromagnet cannot be specified in a simple manner as
in the case of para and diamagnets.
(3) Their magnetization is not a unique function of the field strength but depends also on
the field to which they have been subjected previously.If the field is varied cyclically, the
well known hysteresis curve is obtained.
(4) They can exit in a permanently magnetized state even when there is no external field
(residual magnetism
(5) They lose their spectral characteristic above a certain critical temperature and then
behave as paramagnetic substances. Early investigations had shown that a piece of iron
raised to dull red heat was not attracted by a magnet and that a magnet similarly heated
lost the whole of its magnetism. Hence it followed that iron loses its special magnetic
properties if its temperature is raised above a certain value.
Hopkins found the critical temperature for iron to be 740°C and for nickel to be
40°C. He also found that below the critical temperature, the susceptibility increased with
rising temperature in weak fields, but reverse was the case in strong fields. Curie and
others extending the observations far beyond the critical temperature found that
susceptibility of ferromagnetics varied both with field and the temperature in a
complicated way which could however be reduced to paramagnetics. Researches
conducted at very low temperature down to that of liquid air indicate that in the case of
Fe and Ni, if the magnetizing field is small the susceptibility decreases as the low
temperatures are approached: but the field is large it increases.
Hence ferromagnetism is probably to be regarded as a special manifestation of
Para magnetism limited to a range of temperature below Curie point.
88
Considering a gm molecule of the substance and if ρ is the density ; M the
molecular weight : σ the gm. molecular magnetic moment and σ0 the gm. molecular
saturation moment, then according to classical Langevin’s theory,
(27)
And
(28)
When the external field is zero, i.e., H = 0,
Then
(29)
But
(31)
Thus we obtain two simultaneous equations (27) and (31) which are solved
graphically. Considering the curves corresponding to the two equations, the equation (27)
gives curve (A) and equation (31) gives a straight line (B) of slope [RTM/Aρσ02].
The curves (A) and (B) intersect each other at the origin 0 and at another point S. so that
the solutions are σ/ σ0 = 0 or SN.
But σ/ σ0 = 0 cannot be a true solution since there is spontaneous magnetization.
Here the only correct solution is σ/ σ0 = SN, that is to say, a point of intersection other
than the origin. It can be easily shown that S corresponds to a stable state of spontaneous
magnetization because if the molecules in the domain assume the state C the local
magnetization is less than that corresponding to equilibrium.
Fig.2
89
Domains Theory
In order to explain the fact that a piece of ferromagnetic material may exist in the
non-magnetized state, whereas a weak magnetic field may produce saturation
magnetization in the same specimen, Weiss introduced the domain hypothesis. Each
domain is spontaneously magnetized, the magnetization being appropriate to the
temperature T of the specimen. The over-all magnetization is given by the sum of the
domain vectors, and thus may vanish under certain circumstances; an example is given in
(Fig.3a). Magnetization of a specimen may occur either by the growth of one domain at
the expense of another, i.e., by the motion of domain walls (Fig.3b), or by rotation of
domains (Fig.3c). A representative magnetization curve is given in Fig.4, indicating the
predominant processes in the different regions. We may note here that originally it was
thought that the well-known Barkhausen jumps were due to the rotation of a complete
domain and that the size of the Barkhausen discontinuities was a measure of the size of
the domains. However, experiments by Williams and Shockley show that the Barkhausen
jumps are mainly associated with irregular fluctuations in the motion of the domain walls
rather than with domain rotation.
Fig.3. The domain structure (a) corresponds to the non- magnetized state; (b) represents
magnetization due to wall motion; in (c) the magnetization is due to rotation of the
domain vectors from an “easy” to a “hard” direction.
The most direct experimental evidence for the existence of domains is provided
by the so-called “Bitter powder patterns.” A drop of a colloidal suspension of
ferromagnetic particles is placed on the carefully prepared surface of the specimen; since
there are strong local magnetic fields near the domain boundaries, the particles collect
there and the domains may be observed under a microscope.
90
Fig.4. Typical magnetization curve of a virgin specimen indicating the
predominant processes taking place in the different regions. When the field is reversed at
C, the dashed curve is obtained; Hc, is called the coercive force.
The physical origin of domains may be understood from the general
thermodynamic principle that the free energy E -TS of a solid tends to reach a minimum
value. As a result of the high degree of order in the magnetic system, except in the
vicinity of the Curie temperature, the entropy term may be neglected for our purpose;
thus, minimizing the energy E of the system should be sufficient to understand the
existence of domains. To illustrate the essential features of this point of view, we refer to
Fig. 6, representing a cross section through a ferromagnetic single crystal.
91
closure domains are necessarily magnetized along a hard direction. In iron and nickel, on
the other hand, it is possible to have both the closure domains and the dominant domains
magnetized along easy directions.
Fig. 6. Magnetization curves at 38°C for a single crystal of iron for different directions of
the field relative to the crystal axes.
Summarizing the ideas discussed above we may say that domain structure has its
origin in the principle of minimum energy. It will be evident that the number of domains
and the domain structure will depend to a large extent on the shape and size of the crystal
under consideration. The size of the domains for a particular domain structure may also
be obtained from the principle of minimum energy. The volume of domains may vary
between, say,10-2 to 10-6cm3.
Hall Effect
Measurements of conductivity alone are not sufficient for determination of both
the number of conducting charges n and their mobility µ. Neither do they permit
determination of the sign of the dominant charge carrier. However it is quite fortunate
that a simple experimental procedure is available to measure the concentration of
majority carriers. This is the Hall Effect which gives additional Information required. If a
magnetic field is applied to a system in which a current is flowing there is a tendency for
the charge carriers to be deflected laterally. Now imagine applying an external electric
field E along the axis of the specimen, and then electrons will drift in the opposite
directions. If we apply a magnetic perpendicular to the axis of the specimen, the carriers
will then tend to be deflected to one side. At the surface of the crystal they will not drift
into a space but a surface charge will be developed. The surface charge then will give rise
to a transverse electric field, known as Hall field and this will cause a compensating drift
such that the carriers remain within the specimen.
Thus the Hall effect is observed when a field is applied at right angles to a
conductor carrying a current as shown in fig.7.
92
The magnetic field gives rise to an electric field in a direction mutually
orthogonal to the direction of current and magnetic field. The reason for this effect is
apparent when the forces on the current carrier are considered. The electric field E which
produces the current I, causes a force of magnitude eE to act on the electrons. In the
presence of the magnetic field, a magnetic force proportional to the magnetic field
strength H and the electron’s average velocity v also on the electron. This force is at right
angles to the directions of H and v, and therefore each electron is deflected towards one
side of the conductor. When the electrons reach the surface of the conductor, an electrical
charge is built up there, which in turn produces an additional electric field. Under
equilibrium conditions, the sideways force on the moving carriers due to this field just
balances that arising from the field and the electrons can again move freely down the
conductor.
Fig.7
(1)
93
(2)
This Ey is called Hall voltage, where vx is average drift velocity.
Further, current density may be expressed in terms of number of electrons n per unit
volume
(3)
where RH is known as Hall coefficient. This expression for Hall coefficient is in e.s.u. and
in e.m.u. it will be
(4)
Substituting this value of vx in eq.(2), we get
(5)
Equating the two values of Ey from (3) and (5), we get
(6)
And
Also
(7)
(8)
Or
(9)
94
Importance of the Hall Effect
1. The sign of the current carrying charge is determined.
2. The number of charge carriers per unit volume can be calculated from the Hall
coefficient RH.
3. The mobility is measured directly.
4. It can be used to determine the electronic structure of the substance i.e., whether these
are metals, semi-conductors or insulators.
5. The knowledge of Hall voltage developed enables us to measure high unknown
magnetic field provided we know the Hall constant for the slab used for it.
6. It gives the concept of negative mass as suggested by Bleaney.
Experimental Determination of Hall Coefficient
We have
(10)
Where VH is the Hall voltage.
Thus the value of Hall coefficient can be calculated by knowing i and Hz and by
measuring experimentally the Hall voltage, VH developed across C and D which is
measured either by sensitive galvanometer or better by means of a calibrated
potentiometer.
For accuracy of the measurement the length of the specimen should be large as
compared to its thickness.
95
CHAPTER 7
SUPERCONDUCTIVITY
SUPERCONDUCTIVITY
Superconductivity was first discovered in 1911 by the Dutch Physicist Heike
Kamerlingh Onnes. Onnes observed that as purified mercury is cooled, its resistivily
vanishes abruptly at 4.15 K. Fig. I shows the result of the experiments conducted by
Onnes. Heike Kamerlingh Onnes was honored in 1913 with the Nobel Prize in physics
for superconductivity.
In many of the metals and alloys, dc electrical resistivity vanishes entirely below a
certain temperature, the temperature depending upon the substance. The zero resistivity is
known as superconductivity. The temperature at which the normal metal passes into
superconducting state is called the transition temperature or critical temperature Tc. The
materials whose resistivities become immeasurably small or actually become zero below
a critical temperature Tc. The known as superconductors. Fig. 2 (a) shows the resistivity
of normal metal and Fig 2(b) shows the resistivity of pure and impure superconductor.
Above critical temperature Tc, the material is in the familiar normal state. But
below Tc it enters an entirely different superconducting state.
The most sensitive measurements have shown that the resistance of these
materials in the superconducting state is at least 1016 times smaller than their room
96
temperature values. Superconductors having a critical temperature Tc , above 77K
(boiling Point of liquid Nitrogen) are particularly interesting because they do not require
liquid helium (b.pt 4 K) or liquid N2 (b.pt 20 K) for cooling. There are some metals like
cesium that become superconducting only if a large pressure is applied to them.
(a) (b)
Fig. 2. Schematic representation of the resistivity of the
(a) Normal metal, (b) Pure and impure superconductor as a function of temperature T.
Properties of Superconductors
(1) At room temperature, the resistivity ρ of superconducting materials are greater than
other elements (as shown in Fig. 3).
97
Critical Field
The minimum applied magnetic field necessary to destroy superconductivity and
restore the normal resistivity is called the critical field Hc. When the applied magnetic
field exceeds the critical value Hc, the superconducting state is destroyed and the material
goes into the normal state.
Hc depends on the temperature. Fig. 4 shows the critical field Hc as a function of
temperature. A specimen is superconducting below the curve and normal above the
curve,
For a given substance, value of Hc decreases as temperature increases from T = 0 K to Tc
(1)
Eq. (1) is the phase boundary between the normal and superconducting
state,
98
Meissner Effect
Meissner and Ochsenfeld found that if a superconductor is cooled in a magnetic
field to below the critical temperature Tc (transition temperature), then at the transition,
the lines of induction are pushed out. “The expulsion of magnetic flux from the interior of
a piece of superconducting material as the material undergoes the transition to the
superconducting phase is known as Meissner effect.”
Fig.5 shows the normal sphere at T> Tc and superconducting sphere at T< Tc showing
the expulsion of magnetic lines of induction. Meissner effect is reversible, when the
temperature is raised from below Tc the flux suddenly penetrates the specimen after it
99
reaches Tc and the substance is in the normal state. i.e. a superconductor exhibits perfect
diamagnetism. Because of diamagnetic nature, superconducting materials strongly repel
external magnets. It leads to a levitation effect.
100
(a) (b)
Fig.6. Magnetization versus magnetic field for (a) Type land (b) Type 11
superconductors.
(a) (b)
Fig.7 ρ verses H for (a) type 1 and (b) type 11 superconductors.
Isotope Effect
In 1950, CA. Reynolds and E.M. Maxwell found that the critical temperature
varies with the atomic mass ma, according to the relation:
maα . Tc = Constant
here α is a materials constant and is known as isotope effect co-efficient and is given as
101
The value of α varies from 0.45 to 0.50 for many superconductors but there
notable exceptions to these values where the value of α is quite small like Nb3Sn, Mo etc.
Tc for mercury varies from 4.185 K to 4.16 K when m0 changes ma changes from 199.5
to 203.4 amu
Fig. 8 shows the experimental result for tin. Since Debye temperature θD, is proportional
to the velocity of sound which depends on ma -1/2; therefore the transition temperature can
be related to the Debye temperature i.e.
102
CHAPTER 8
LASER
INTRODUCTION
The word LASER is an acronym for Light Amplification by Stimulated Emission
of Radiation. It is a device to produce a strong, monochromatic, collimated and highly
coherent beam of light and depends on the phenomenon of “stimulated emission’
The laser is the outgrowth of MASER which is a device that amplifies microwaves. The
first successful maser, an ammonia maser was built by Gordon, Zeiger and Townes in
1954. Townes, Basov and Prokhorov were awarded the Nobel Prize in Physics in 1964
for their invention of maser. In 1958, Schawlow and Townes showed that the maser
principle could be extended into the visible region and in 1960, T.H. Maiman built the
first laser using ruby as the active medium. Helium-Neon laser was developed by Au
Javan and his associates in 1961. This type of laser had been the dominant laser for the
next 20 years until cheap semiconductors have appeared. In these lasers the electric
discharge was used to create lasing. Also in 1961 the first semiconductor laser was
created by Robert Hall.
ABSORPTION OF RADIATION
If an atom is initially in a lower state 1, it can rise to a higher state 2 by absorbing a
quantum of radiation (Photon) of frequency v given
where E1 and E2 are the energies of the atom in the states 1 and 2 respectively, (Fig. 1).
This process is known as absorption of radiation
Fig.1 Absorption
The probable rate of occurrence of the absorption transition 1 — 2 depends
on the properties of states 1 and 2 and is proportional to the energy density u(v)
103
of the radiation of frequency v incident on the atom. Thus
SPONTANEOUS EMISSION
Let us now consider an atom initially in the higher (excited) state 2. Excited state with
higher energy is inherently unstable, hence atom in excited state does not stay for longer
time and it jumps to the lower energy state 1 emitting a photon of frequency ν. This is
spontaneous emission of radiation. If there is an assembly of atoms, the radiation emitted
spontaneously by each atom has random direction and a random phase and is therefore
incoherent from on atom to another.
104
P21 = A21 + B21 u(v) (2)
Before After
(5)
105
Einstein proved thermodynamically that the probability of (stimulated) absorption is
equal to the probability of stimulated emission
i.e. B12 = B21
Thus , we have
The equilibrium distribution of atoms among different energy states is given by using
Boltzmann’s Distribution Law according to which
Consequently,
(6)
(7)
We get
(8)
This is the relation for the ratio between the spontaneous emission and induced
emission coefficients and show that probability of spontaneous emission increases rapidly
with the energy difference between the two states.
106
POPULATION IN VERSION AND PUMPING
Usually the number of particles N2 i.e., population of higher energy level is less
than the population N1 of lower energy level which is essentially possible by supplying
energy from outside the system. Otherwise, atoms would eventually radiate, fall to the
lowest energy state, and stop emitting altogether. Therefore, as required by conservation
of energy, all laser systems must be connected to external energy sources to maintain this
non-thermal equilibrium situation. Making N2> N1 i.e., the number of particles N2 in ore
in higher energy level than the number of particles N1 in lower energy level is called
population inversion or inverted population. The states of system, in which the population
of higher energy state is more in comparison to the population of lower energy state are
called negative temperature states. Here it should be clearly understood that the negative
temperature is not a physical quantity but it is a convenient mathematical expression,
signifying the equilibrium state of the system. When the system has a number of states,
then one of them may have negative temperature with respect to other states.
A system in which population inversion is achieved is called an active system.
The method of raising a particle from lower energy state to higher energy state is called
pumping. This can be done by a number of ways:
For example; atoms can be energized by electron impact in sufficiently intense gaseous
discharges, known as ‘electrical pumping’. The electrical pumping is suited to gas and
semiconductor lasers.
We can also supply energy by powerful lamps or a laser beam whose light
populates excited states by photon absorption, known as optical pumping. This method is
particularly suited to solid state or liquid lasers (dye-lasers). The availability of efficient
and powerful laser sources at many wavelengths has made laser pumping both attractive
and practical.
A form of pumping somewhat similar to optical pumping involves a medium
excited by an x-ray beam, known as x-ray pumping.
A conceptually different and interesting type of pumping occurs when the
required population inversion is produced directly from an exothermic chemical reaction,
known as chemical pumping. Chemical pumping usually applies to materials in the gas
phase and it generally requires highly reactive and often explosive gas mixtures.
107
CHARACTERISTICS OF A LASER BEAM
The laser beam has certain remarkable properties which are as follows:
Fig. 4
The laser beam is coherent, with the waves all exactly in phase with one another [Fig.
4(c)1. An interference pattern can be obtained not merely by placing two slits in a laser
beam but also by using beams from separate lasers.
The term coherence basically refers to the degree of correlation between phases at
different points in a beam of light. It is of two types
(i) temporal or time coherence (ii) spatial coherence.
(i) Temporal or time Coherence. If the phase difference of waves crossing two points
lying along the direction of propagation of the beam is time independent, then a beam of
108
light is said to possess temporal or time coherence. This coherence is also known as
longitudinal coherence. It refers to spectral purity or the degree of two waves of laser
light remains in phase over many wavelengths. Fig.5 shows a beam of light traveling
along the axis X’X. P and Q are the two points lying on this line. The beam is said to
possess temporal coherence if the phase difference of the waves crossing P and Q at any
instant is always constant.
Accordingly, the light wave train emitted from each source can be characterized
by an average time T0. It is called coherent time. Also the band width of the spectral
distribution is inversely proportional to T0.
Fig.5
(i)
The average length of the wave trains is called coherent length. If the velocity of light is c
then coherent length Lc is given by
Lc = cT0 (ii)
Since ∆ω = 2π∆ν
Where ∆ν is the frequency in Hz
(iii)
(v)
109
ss
(ii) Spatial Coherence. A laser beam is said to possess spatial coherence, if the phase
difference of the waves crossing the two points lying on a plane perpendicular to the
direction of propagation of the beam is time-independent. This coherence is also termed
as transverse or lateral coherence. It is a measure of the minimum separation across the
wave from where two waves remain coherent. Fig.6 shows a beam of light travelings
along the line X’X. Here abed is a plane perpendicular to X’X. The beam is said to
possess spatial coherence if the phase difference of the waves crossing P and P’ at any
instant is always constant.
Fig.6
In Fig.7, S is a source and S1, S2, are two slits of separation l. If interference occurs for
lmax = lw then this dimension is called the spatial coherence length of the source.
From Fig7 (i) the source S is at a distance D from slits S1 and S2. The condition for
coherence of S1 and S2 is
110
Since a/D = θ, angle subtended by width of slit S. Thus, we get
lω = λ/θ
1. In this case interference fringes have not been observed for separation S1S2 as small as
1O-3 cm. Thus no constant phase relation exists between S1 and S2.
2. The laser light is almost perfect1y monochromatic i.e., highly temporarily coherent and
spatially coherent. This means that they have exactly mw frequency, direction and
polarization as the primary photon (spatial coherence) exactly the same phase and speed
(temporal coherence).
3. The laser beam does not diverge. Hence a laser beam is very narrow and can travel to
long distances without spreading.
The angular spread is given by
∆θ = 1.22λ/d
where d is the diameter of the front mirror.
4. The laser beam is extremely intense. To achieve an energy density equal to that in
some laser beams, a hot object would have to be at a temperature of 1O30 K. It is
estimated that light from a typical 1-mW laser is 10,000 times brighter than the light from
the sun at the earth’s surface
111
Fig. 8
Operation: Population inversion in semiconductor is achieved by using heavily doped
(106 atoms/m3 donor or acceptor atoms) crystal. We know that n—type semiconductor
has many highly mobile electrons whereas p-type has the less mobile positive holes.
When two types of structures are joined then the type of structure shown in Fig.9 results
As the material is heavily doped, the Fermi level of p-type falls within the valence
band while that of n—type falls within the conduction band. When no voltage is applied
the two Fermi levels lie in the some horizontal line. The two bands are separated by an
energy gap. E known as forbidden—hand. Due to this gap conduction electrons can not
occupy the holes in n—type material. If an external voltage V is applied, the conduction
electrons can he made to be attracted p—type up is made positive (i.e. junction is forward
biased). Thus, iicJ we can say that condition of population inversion is achieved. If this
condition is satisfied, electrons jump across the forbidden gap to combine with a hole,
giving rise to the emission of light.
Thus. LED is basically a junction between heavily doped p-type and n-type
semiconductor such as Ga As. LEDs are commonly available in near infra-red and red
regions of the spectrum. They have been made with a wavelength as short as the
wavelength of green light..
112
Fig.9
He-Ne Laser
The main drawback of ruby laser is that the output beam is not continuous though
very intense. For continuous laser beam, gas lasers are used. One such type of laser is He-
Ne laser. As the name implies, the laser medium is a mixture of Helium and Neon in the
ratio 10 : 1. It is encapsulated in a long narrow discharge tube as shown in Fig10 . The
pressure inside the tube is about 1 mm of mercury. The plane or concave mirrors M1 and
M2 are placed on the two sides of the tube to obtain a resonator system. One of the mirror
say M1 is highly reflecting whereas M2 is partially reflecting and partially transparent.
Laser beam comes out through the mirror M2.
113
The first few energy levels of He and Ne are shown in Fig11. The levels E2 and E3
of helium have nearly the same energy as the levels E’4 and E’6of neon. However, the
levels E2 and E3 are metastable. When discharge is passed through the mixture of gas, the
electrons collide with helium atoms and excite them to the metastable states E2 and E3.
This may be termed as pumping. The He atoms continue in these states for long time.
These helium atoms collide with unexcited neon atoms raising them (Ne atoms) to te
levels E’4 and E’3 . This results in a large number of neon atoms in the energy states E’6
and E’4 and this population inversion occurs with respect to the levels E’3 and E’5.
Therefore, the following three transitions occur:
1. E’6! E’5 producing laser of 3.39 µm
2. E’6 ! E’3 producing laser of 632.8 µm
3. E’4 ! E’3 producing laser of 1.15 µm
To eliminate the photons of wavelengths 1.15 µm. and 3.3 µm., mirrors are atoms
drop to level E’2 through spontaneous emission and then de-excited to ground level
through, collisions. Thus, they are again ready to go through the cycle. This is achieved
by making the tube narrow (—0.6 cm), so that the neon atoms are de-excited by collision
with the walls of the tube.
The power output from He-Ne is rather small (—100 mW maximum), however,
the radiation is extremely useful in a wide range of applications because, it is highly
collimated, coherent and has extremely narrow bandwidth.
114
APPLICATION
The lasers are profitably used in almost every field including fundamental
research. The common applications of lasers are as follows:
1. Metallic rods can be melted and joined by means of a laser beam (laser welding),
2. The laser beam is used to vaporize unwanted material during the manufacture of
electronic circuits on semiconductor chips.
3. CO2 gas lasers of about 100W output are helpful in surgery because they seal small
blood vessels while cutting through tissue by vaporizing water in the path of their IR
beams.
4. Lasers are used to detect and destroy enemy missiles during warfare.
5. Low power semiconductor lasers are used in CD (compact disc) players, laser printers,
laser copiers, facsimile machine etc.
6. Semi-conductor lasers are ideal for fibre-optic transmission lines in which the electric
signals that would normally be sent along copper wires are first converted into a series of
pulses according to a standard code. Lasers then turn the pulses into flashes of IR light
that travel along thin glass fibres and at the other end are changed back into electric
signals.
7. High power lasers are used to bring about thermonuclear reactions which would
become the ultimate inexhaustible power source for human civilization,
8. Lasers are also being employed for separating the various isotopes of an element.
9. Lasers are used in the production of three-dimensional images of an object in
holography.
10. Laser beams have also been used in the “inertial confinement” of plasma.
11. The narrow red laser beam is used in supermarkets to read the bar codes.
EXERCISE
1. Calculate the coherence length for laser whose line width is 1 X 10-5 nm at
IR emission wavelength of 10.6 µm.
(ans. ∆l = 11.2km)
2. Imagine that we chop a continuous laser beam ( assumed to be perfectly
monochromatic λ0 = 623.8 nm ) into ns pulses using some sort of shutter .
Compute the resultant line width, band width and coherence lengths. (ans. ∆l =
3cm. ∆ν = 1010 Hz. ∆λ = 0.013nm )
115
CHAPTER 9
FIBRE OPTICS
Fibre Optics
In case of electronic communication, the information is normally carried over in
the form of radio waves and microwaves, through copper wires and co-axial cables.
However, the information carrying capacity of these wires is restricted, due to their
limited band-width and is not efficient, as per the needs of the modern communication
techniques.
If light waves are used instead of radio waves, or microwaves, the number of
signals transmitted can be increased enormously. Technically, Grahm Bell in 1880
invented Photophone. which transmitted speech on a beam of light. By 1960, it was
established that light could t guided by glass fiber. However, the glass available at that
time heavily attenuated light propagating through them.
It was due to the invention of solid state lasers in 1970 and fabrication of low Ioss
glass fibres, that made the optical communication practical. With this the era of
photonics-started.
In addition to the use of optical fibre in communication, these have been extensively used
as sensors to detect electrical, mechanical and thermal energies. Also they are bung used
in copying machines and in medical diagnostics (Endoscopy).
Fig.1
Optical fibres
These are made up of glass or plastic conduicts as thin as a human hair which is designed
to guide light waves along the length of the fibre with the help of successive total internal
reflections from side walls of the fibre. The movement of light is in a zig-zag path and
some fraction of light may escape through side walls hut major fraction comes out from
the other end of the fibre. Fig.1
Construction
An optical fibre consists, in general, the following three regions:
116
(i) Inner-most region, which is the light guiding region is called core.
(ii) Core region is surrounded by a middle region, called cladding. The refractive index
of cladding is always lower than that of the core..
iii) The outermost region is called sheath. The sheath protects the core and cladding from
moisture, abrasion contamination and to give mechanical strength to fibre.
Optical fibres are either made as a single fibre or a flexible bundle of cable. A bundle
fibre is a number of fibres in a single jacket.
Fig.2
Basic propose of cladding is to confine the light to the core as the light falling on core
and striking core cladding interface at angle greater than the critical angle will be
reflected back to the core.
Propagation of Light Through Fibre
Main function of the optical fibre is to accept maximum light and transmit the same with
minimum attenuation.
Light gathering ability of a fibre depends on two factors:
(i) Core size
(ii) Numerical aperture
Numerical aperture of fibre is determined by acceptance angle and fractional refraction
index change.
117
Fig. 3
Let us now compute the incident angle i for which Φ ≤ C such that light rebounds within
the fibre.
Applying Snell’s law to the launching face of the fibre, we get
` (1)
If i is increased beyond a light, Φ will drop below the critical value C and the ray
escapes from the side walls of the fibre. The largest value of i occurs when Φ = C.
For the ∆ABC, we have
(2)
Putting sin i from equation (I) into equation (2), we get
(3)
But
(4)
Putting the expression (4) into (3), we get
(5)
When incident ray is launched from air medium, we have n0 = 1 Designing i (max) =θ0,
equation (5) may be simplified to
(6)
118
(7)
The angle θ0 is called the acceptance angle of the fibre. Acceptance angle may be
defined as the maximum angle that a light ray can have relative to the axis of the fibre
and propagate down the fibre.
Fig.4
The light rays contained within the cone having a full angle 2θ0 are accepted and
transmitted’ along the fibre. The cone is called the acceptance cone. (Fig. 4)
Light incident at an angle beyond θ0 refracts through the cladding and the corresponding
optical energy is lost. It is clear that the greater the diameter of the core, the larger the
acceptance angle.
(8)
This parameter is always positive as n1 n2 for the total internal reflection
conduction condition. In order to guide light rays effectively through a fibre, ∆ < 1.
Typically, ∆ is of the order of 0.01.
Numerical Aperture
The numerical aperture (NA) is defined as the sine of the acceptance angle. So we
may write NA = sinθ0
(9)
(10)
119
Numerical aperture accounts for the light gathering ability of the fibre and it measures the
amount of light accepted by the fibre. As is clear from eq. (9) numerical aperture depends
only on the refractive indices of the core and cladding materials.
Numerical aperture ranges between 0.13 to 0.50 and a larger value of numerical aperture
will mean that the fibre can accept more light from the source.
Modes of Propagation
120
In an optical fibre light travels as an e.m. wave and all the waves moving in directions
above the critical angle will be trapped in the fibre due to total internal reflections.
However, all such waves do not propagate through the fibre, and only certain ray
directions are allowed for propagations. These allowed directions corresponds to modes
of the fibre.
Fig.5
In simple form, we can understand the modes as the number of paths of light an
an optical fibre The paths are all zig-zag with the exception of light moving along axial
directions So the light rays moving through a fibre may be classified as (i) axial rays (ii)
zig-zag rays.
Since the rays get repeatedly reflected at the walls of the fibre, there occurs a
phase shift. So wave along certain zig-zag path be in phase and get intensified while
some other waves moving in other paths will be out of phase and may fade out due to
destructive interference.
The light ray paths along which the waves are in phase inside the fibre are called
modes.
Number of modes, a fibre can support depends on the ratio d/λ where ,d is the diameter of
the core and λ is the wavelength of the wave transmitted.
In general, modes are represented by an order number m. In a fibre of fixed
thickness, the higher order propagates at smaller angles than the lower order modes.
Axial ray that travels along the axis of the fibre is called zero order ray.
Types of Optical Fibre
Optical fibres are in general of two types:
(i) Single Mode fibre (SMF)
(ii) Multimode Fibre (MMF)
A single fibre has a smaller core diameter and can support only one mode of
propagation while a multimode fibre has a large core diameter and can support a large
number of modes.
Multimode fibres are further classified on the basis of index profie. An index
profile is graph of refractive index (along X-axis) and distance from the core (along Y-
axis). Index profit of a MMF can be either step index (SI) type or graded index (GRIN)
type. Index profile of SMF is usually a step index (SI) type.
121
only and so zero order mode is supported by SMF. Usually SMF is characterised by a
very small value of ∆(~ O.002).
The fibre is surrounded by some opaque protective sheath
Fig. 5
Fig.6
Grain Index Fibre (GRIN)
A GRIN is a multimode fibre which has concentric layers of refractive indices
which means that the refractive index of the core varies with distance from the fibres
axis, i.e., it has high value at the centre and falls of rapidly as the radial distance increases
from the axis.
As is shown in profile (Fig.7) such a profile causes typical periodic focussing of
the light moving through the fibre.
It is important to note that in case of GRIN fibres, acceptance angle and numerical
aperture diminish with increase of radial distance from the axis. In the case of fibres of
parabolic profile, numerical aperture is given by:
122
The convention of writing the size of optical fibre is to given the core/cladding
diameter.
Sizes of SI fibre are 50/125, 10/140, 200/230 etc.
Size of GRIN fibres are 50/125, 62.5/125, 85/125, etc. For example, a fibre has 50
µm core and 125 µm cladding.
Fig.7
Fig.8
(1)
where a ! Radius of the core
and λ! Free space wavelength.
Also we have Numerical Aperture given as:
Maximum number of modes Nm supported by an SI fibre may be got using the relation
123
In case V < 2.405, fibre can support only one mode and is called SMF.
For V> 2.405, ii is classified as MMF and it can support number of modes
simultaneously.
Wavelengths corresponding to value of V = 2.405 is called the cut off wavelength
of the fibre and is given as
Cut off wavelength λC = λV/2.405
In the case of GRIN fibres, for large values of V. we have
2. Optical fiber Sensors. In this case the basic characteristics of the optical fiber are
used, viz, the variation of refractive index of the fiber under the external forces may be
utilized for using the optical fiber as transducer.
In case the fiber is subjected to heat, the refractive index changes because of
temperature. As temperature increases, the difference of refractive index between the
core and cladding is reduced, thereby some light leaks into cladding. This characteristic
can be used for measurement of temperature using a LED as light source.
Fiber optics may also be used to measure pollution and foreign suspended particles
in the air. A beam of light is sent from one end of fiber and in case foreign particles are
present, they scatter the light and measurement of variation of intensity of light will
account of the extent of the presence of foreign particles.
Fiber optics may also be used to measure the level of a liquid in a container. A part
of fiber is suspended in the liquid and light is directed to pass through it and its intensity
is measured. A bare core loses more light when it is immersed in liquid than when it is
placed in air. A sudden change in intensity of light will indicate the liquid level. A LED
source, a photodector and a MMF is used to monitor the liquid level and is finding many
applications in the filling of petroleum tanks.
124
3. Entertainment applications. A coherent optical fiber bundle can greatly enhance the
size of the image on a TV screen. In case we have to use the conventional method of
getting larger images than the projection system because quick bulky and unmanageable.
(Endoscopy)
125
As a message in the form of an electric signal is received by the optical source,
light output will follow the path of variation of signal with the massage digital
modulation is preferred in optical communication over the analogue modulation as it
can pass through a larger distance with the same power.
However, one basic difficulty arises due to the attenuation of optical signal
overlarge distance because of various losses and dispersion in the optical fiber. So to
overcome it various repeaters are used in the transmission line to amplify and reshape the
signal.
At the receiver end an optical coupler is used to direct the light coming from the
fiber into a photo detector or, which converts light into electric current, which will
contain the message fed into the optical fiber. In the next step, the message is extracted
from this current and is amplified if the nearby. Then this message is put into a transducer
to get it either in video or radio form depending upon the nature of the original message.
Advantages
1. Cheaper. The main ingredient in glass is sand (SiO2), and there is an almost
unlimited supply of sand in the world compared to the supply of copper or
aluminum.
2. Safety. Through optical fibres, photons of light moves instead of an electrical
current. Therefore there is no chance of a spark flash, which could be dangerous.
3. Radio frequency interference (RFI). Since the fibre system carries no electrical
current. The energy transmitted through the fibre cannot radiate RF interference ,
nor can it be contaminated by any external noise or RF fields
4. Security. Because of the absence of the flow of current through the fibre, criminal
intrusion into th e system is also prevented. Confidential information cannot be
routed to unwanted receivers, nor can false information be fed into data stream,
5. Low losses. The transmission loss per unit length of an optical fibre is about 4
dB/km. Therefore, longer cables run between repeaters are feasible.
6. Wider bandwidth. Within a totally closed system, the number of signals that can
be modulated on a fibre optic light beam exceeds the number that can be
modulated on a very high frequency RF carrier by a factor of about 1000.
7. Deterioration. Glass is immune to corrosive and oxide degradation and will stand
up well in harsh environments. Moisture, toxic vapours and acids will not degrade
the glass fibres.
8. Small size and light weight. The size of the core and clad of a single fibre
conductor is much smaller than the diameter of a common copper wire conductor;
126
however, when the insulation is included, the sizes are similar. Bundles of optical
fibre cables are smaller by a factor of 10 and weight less by a factor of 14 than an
equal number of copper wire conductors.
9. Temperature. Excluding the protective insulation, the melting point of glass is
much higher than copper.
10. Long life. The life expectancy of glass fibres is predicted to exceed 100 years.
Disadvantages
1. Limited applications. All fibre optics system is limited to fixed point – to- point
ground installation. They cannot leave the ground nor be associated with a mobile
communication system.
2. Nuclear radiation. Glass, when exposed to neutron bombardment, will darken.
The harder the glass. The more quickly it will discolor.
3. Low power. Popular light –emitting sources are restricted to very low power
devices. Though, higher power devices are available but they are costly.
4. Distance. Because of the low power sources, the distance between repeater
amplifiers must be relatively short for the high date rates demanded in some
systems.
5. Modulation. There are limited ways in which the light source can be modulated
6. Fragility. The optical fibres are easily broken or damaged due to age and
vibrations
EXERCISE
1. Calculate the fractional index change for a given optical fiber if the refractive indices
of the core and cladding are 1.563 and 1.498, respectively. (ans. ∆ = 0.0415)
2. Calculate the numerical aperture and hence the acceptance angle for an optical fiber
given that refractive indices of the core and the cladding are 1.45 and 1.40
respectively. (ans. NA = 0.3775, θm = 22.180 )
3. A glass clad fiber is made with core glass of refractive index 1.5 and the cladding is
doped to give a refractive index difference of 0.0005, Find (a) the cladding index
(b) the critical internal reflection angle. (c) the external critical acceptance angle. (d)
the numerical aperture. ( ans. n2 = 1.49925, Φc = 88.20 ,θm = 2.720 , NA = 0.15)
4. An optical fiber has a NA of 0.20 and a cladding refractive index of 1.59. Determine
the acceptance angle for the fiber in water which has a refractive index of 1.33. ( ans.
θm = 8.60 )
127
Bibliography
1. Charles Kittel : Introduction to Solid State Physics
2. B.S. Sexena, R. C. Gupta & P.N. Sexena: Solid State Physics
3. A.J. Dekkar: Introduction to solid state physics
4. Arthur Beiser: Concept of modern physics
5. Nandita & Amitava Das Gupta: Semiconductor devices
6. Ashok Sharma: Solid State Physics & Electronics
7. Ben C. Streetman & Sanjay Banerjee: Solid State Electronics Devices
8. S.O. Pillai: Solid State physics.
9. Rakesh Dogra: Essentials of physics.
10. S. K. Hajra Chaudhary: Material Sci. & Progress.
11. Navneet Gupta & A. K. Tayagi: Fundamentals of Engg. Physics
128