Professional Documents
Culture Documents
>Sample-2
MASDASHALEAALEQMDGIIAGTKTGADLSDGTCEPGLASPASYMNPFPVLHLIEDLRLALEMLELPQER
AALLSQIPGPTAAYIKEWFEESLSQVNHHSAASNETYQERLARLEGDKESLILQVSVLTDQVEAQGEKIR
DLEVCLEGHQVKLNAAEEMLQQELLSRTSLETQKLDLMTEVSELKLKLVGMEKEQREQEEKQRKAEELLQ
ELRHLKIKVEELENERNQYEWKLKATKAEVAQLQEQVALKDAEIERLHSQLSRTAALHSESHTERDQEIQ
RLKMGMETLLLANEDKDRRIEELTGLLNQYRKVKEIVMVTQGPSERTLSINEEEPEGGFSKWNATNKDPE
ELFKQEMPPRCSSPTVGPPPLPQKSLETRAQKKLSCSLEDLRSESVDKCMDGNQPFPVLEPKDSPFLAEH
KYPTLPGKLSGATPNGEAAKSPPTICQPDATGSSLLRLRDTESGWDDTAVVNDLSSTSSGTESGPQSPLT
PDGKRNPKGIKKFWGKIRRTQSGNFYTDTLGMAEFRRGGLRATAGPRLSRTRDSKGQKSDANAPFAQWST
ERVCAWLEDFGLAQYVIFARQWVSSGHTLLTATPQDMEKELGIKHPLHRKKLVLAVKAINTKQEEKSALL
DHIWVTRWLDDIGLPQYKDQFHESRVDRRMLQYLTVNDLLFLKVTSQLHHLSIKCAIHVLHVNKFNPHCL
HRRPADESNLSPSEVVQWSNHRVMEWLRSVDLAEYAPNLRGSGVHGGLIILEPRFTGDTLAMLLNIPPQK
TLLRRHLTTKFNALIGPEAEQEKREKMASPAYTPLTTTAKVRPRKLGFSHFGNIRKKKFDESTDYICPME
PSDGVSDSHRVYSGYRGLSPLDAPELDGLDQVGQIS
>Sample-3
MGLTSTWRYGRGPGIGTVTMVSWGRFICLVVVTMATLSLARPSFSLVEDTTLEPEDAISSGDDEDDTDGA
EDFVSENSNNKRAPYWTNTEKMEKRLHAVPAANTVKFRCPAGGNPTPTMRWLKNGKEFKQEHRIGGYKVR
NQHWSLIMESVVPSDKGNYTCVVENEYGSINHTYHLDVVERSPHRPILQAGLPANASTVVGGDVEFVCKV
YSDAQPHIQWIKHVEKNGSKYGPDGLPYLKVLKAAGVNTTDKEIEVLYIRNVTFEDAGEYTCLAGNSIGI
SFHSAWLTVLPAPGREKEITASPDYLEIAIYCIGVFLIACMVVTVILCRMKNTTKKPDFSSQPAVHKLTK
RIPLRRQVTVSAESSSSMNSNTPLVRITTRLSSTADTPMLAGVSEYELPEDPKWEFPRDKLTLGKPLGEG
CFGQVVMAEAVGIDKDKPKEAVTVAVKMLKDDATEKDLSDLVSEMEMMKMIGKHKNIINLLGACTQDGPL
YVIVEYASKGNLREYLRARRPPGMEYSYDINRVPEEQMTFKDLVSCTYQLARGMEYLASQKCIHRDLAAR
NVLVTENNVMKIADFGLARDINNIDYYKKTTNGRLPVKWMAPEALFDRVYTHQSDVWSFGVLMWEIFTLG
GSPYPGIPVEELFKLLKEGHRMDKPANCTNELYMMMRDCWHAVPSQRPTFKQLVEDLDRILTLTTNEEYL
DLSQPLEPYSPCYPDPR
>Sample-4
MEPSSETGMDPPLSQETFEDLWSLLPDPLQTVTCRLDNLSEFPDYPLAADMSVLQEGLMGNAVPTVTSCA
PSTDDYAGKYGLQLDFQQNGTAKSVTCTYSPELNKLFCQLAKTCPLLVRVESPPPRGSILRATAVYKKSE
HVAEVVKRCPHHERSVEPGEDAAPPSHLMRVEGNLQAYYMEDVNSGRHSVCVPYEGPQVGTECTTVLYNY
MCNSSCMGGMNRRPILTIITLETPQGLLLGRRCFEVRVCACPGRDRRTEEDNYTKKRGLKPSGKRELAHP
PSSEPPLPKKRLVVDDDEEIFTLRIKGRSRYEMIKKLNDALELQESLDQQKVTIKCRKCRDEIKPKKGKK
LLVKDEQPDSE
>Sample-5
MAVVIRLQGLPIVAGTMDIRHFFSGLTIPDGGVHIVGGELGEAFIVFATDEDARLGMMRTGGTIKGSKVT
LLLSSKTEMQNMIELSRRRFETANLDIPPANASRSGPPPSSGMSSRVNLPTTVSNFNNPSPSVVTATTSV
HESNKNIQTFSTASVGTAPPNMGASFGSPTFSSTVPSTASPMNTVPPPPIPPIPAMPSLPPMPSIPPIPV
PPPVPTLPPVPPVPPIPPVPSVPPMTPLPPMSGMPPLNPPPVAPLPAGMNGSGAPMNLNNNLNPMFLGPL
NPVNPIQMNSQSSVKPLPINPDDLYVSVHGMPFSAMENDVRDFFHGLRVDAVHLLKDHVGRNNGNGLVKF
LSPQDTFEALKRNRMLMIQRYVEVSPATERQWVAAGGHITFKQNMGPSGQTHPPPQTLPRSKSPSGQKRS
RSRSPHEAGFCVYLKGLPFEAENKHVIDFFKKLDIVEDSIYIAYGPNGKATGEGFVEFRNEADYKAALCR
HKQYMGNRFIQVHPITKKGMLEKIDMIRKRLQNFSYDQREMILNPEGDVNSAKVCAHITNIPFSITKMDV
LQFLEGIPVDENAVHVLVDNNGQGLGQALVQFKNEDDARKSERLHRKKLNGREAFVHVVTLEDMREIEKN
PPAQGKKGLKMPVPGNPAVPGMPNAGLPGVGLPSAGLPGAGLPSTGLPGSAITSAGLPGAGMPSAGIPSA
GGEEHAFLTVGSKEANNGPPFNFPGNFGGSNAFGPPIPPPGLGGGAFGDARPGMPSVGNSGLPGLGLDVP
GFGGGPNNLSGPSGFGGGPQNFGNGPGSLGGPPGFGSGPPGLGSAPGHLGGPPAFGPGPGPGPGPGPIHI
GGPPGFASSSGKPGPTVIKVQNMPFTVSIDEILDFFYGYQVIPGSVCLKYNEKGMPTGEAMVAFESRDEA
TAAVIDLNDRPIGSRKVKLVLG
>Sample-6
MDNKNIDPNFNPERFLETQKYKVIVTALVFLLLFIVFLMVAFKKAFFAQANMPTLVMSKQDTATRGTIYS
QDNYSLATSQTLFKLGFDTRFLNPDKEDFFIDFLSIYSNIPKKSLKDAINTKGYTILAYDLTPNTAANLR
DLNKKFLTFGVFQNFKDARDKVWQKQGLNIEVSGVSRHYPYQNSLEPIIGYVQKQEENKLTLTTGKKGVE
KSQDHLLKAQQNGIRTGKRDVSFNFIQNHSYTEVERLDGYEVYLSIPLKLQREIETLLDKAKDKLKAEEI
LVGIINPKSGEILSLASSKRFNPNAIKTSDYESLNLSVAEKVFEPGSTIKPIVYSLLLDKNLINPKERID
LNHGYYQLGKYTIKDDFVPSKKAVVEDILIQSSNVGMIKISKNLNPEDFYNGLLGYGFSQKTGIDLSLEA
TGKIPPLSAFKREVLKGSVSYGYGLNATFLQLLRAYAVFSNEGKLTTPYLVQRETAPNGDIYIPSPKPTF
QVINPKSARKMKETLIKVVRYGTGKNAQFEGLYIGGKTGTARVAKNGSYSAQSYNSSFFGFAEDERQVFT
IGVVILGSHGKEEYYASKIAAPIFKEITEILVRYNYLSPSIAIQNALEKNRFKIK
>Sample-7
MSRRKPASGGLAASSSAPARQAVLSRFFQSTGSLKSTSSSTGAADQVDPGAAAAAAAAAAAAPPAPPAPA
FPPQLPPHIATEIDRRKKRPLENDGPVKKKVKKVQQKEGGSDLGMSGNSEPKKCLRTRNVSKSLEKLKEF
CCDSALPQSRVQTESLQERFAVLPKCTDFDDISLLHAKNAVSSEDSKRQINQKDTTLFDLSQFGSSNTSH
ENLQKTASKSANKRSKSIYTPLELQYIEMKQQHKDAVLCVECGYKYRFFGEDAEIAARELNIYCHLDHNF
MTASIPTHRLFVHVRRLVAKGYKVGVVKQTETAALKAIGDNRSSLFSRKLTALYTKSTLIGEDVNPLIKL
DDAVNVDEIMTDTSTSYLLCISENKENVRDKKKGNIFIGIVGVQPATGEVVFDSFQDSASRSELETRMSS
LQPVELLLPSALSEQTEALIHRATSVSVQDDRIRVERMDNIYFEYSHAFQAVTEFYAKDTVDIKGSQIIS
GIVNLEKPVICSLAAIIKYLKEFNLEKMLSKPENFKQLSSKMEFMTINGTTLRNLEILQNQTDMKTKGSL
LWVLDHTKTSFGRRKLKKWVTQPLLKLREINARLDAVSEVLHSESSVFGQIENHLRKLPDIERGLCSIYH
KKCSTQEFFLIVKTLYHLKSEFQAIIPAVNSHIQSDLLRTVILEIPELLSPVEHYLKILNEQAAKVGDKT
ELFKDLSDFPLIKKRKDEIQGVIDEIRMHLQEIRKILKNPSAQYVTVSGQEFMIEIKNSAVSCIPTDWVK
VGSTKAVSRFHSPFIVENYRHLNQLREQLVLDCSAEWLDFLEKFSEHYHSLCKAVHHLATVDCIFSLAKV
AKQGDYCRPTVQEERKIVIKNGRHPVIDVLLGEQDQYVPNNTDLSEDSERVMIITGPNMGGKSSYIKQVA
LITIMAQIGSYVPAEEATIGIVDGIFTRMGAADNIYKGQSTFMEELTDTAEIIRKATSQSLVILDELGRG
TSTHDGIAIAYATLEYFIRDVKSLTLFVTHYPPVCELEKNYSHQVGNYHMGFLVSEDESKLDPGAAEQVP
DFVTFLYQITRGIAARSYGLNVAKLADVPGEILKKAAHKSKELEGLINTKRKRLKYFAKLWTMHNAQDLQ
>Sample-8
MEYTYQYSWIIPFIPLPVPMLIGVGLLLFPTATKNLRRMWAFPSIFLLSIVMILSVYLSIQQINRSFIYQ
YVWSWTINNDFSLEFGHLIDPLTSIMLILITTVGILVLFYSDNYMSHDQGYLRFFAYMSFFNTSMLGLVT
SSNLIQIYIFWELVGMCSYLLIGFWFTRPSAATACQKAFVTNRVGDFGLLLGILGLYWITGSFEFRDLFQ
ILNNLIYNNEVPFLFLTLCAFLLFAGAVAKSAQFPLHVWLPDAMEGPTPISALIHAATMVAAGIFLVARL
LPLFIIIPYIMNLISLIGIITVLLGATLALAQKDIKRGLAYSTMSQLGYMMLALGMGSYRAALFHLITHA
YSKALLFLGSGSIIHSMEAIVGYSPDKSQNMVLMGGLKKHVPITKTAFLVGTLSLCGIPPLACFWSKDEI
LNDSWLYSPIFAIIACSTAGLTAFYMFRIYLLTFEGHFNVHFQNYNGQKSSSCYSISLWGKEVPKTIKNH
FCLLSLLTMNNNERASFFSNKTYQIDGNGKNRIHPFITITNFVTKNTFSYPHESDNTMLFSIVILVIFTL
FVGVVGIPFAFNQEEIHLDILSKLLNPSINLLHPNSNNSVDWYEFVTNASFSVSIAFFGIFIASFLYKPI
YSSLQNLNLLNSFSKRGPNRILGDRIRNGIYDWSYNRGYIDAFYTISLTQGIRGLAELIHFLDRRVIDGI
TNGFGLTSFFFGEGIKYVGGGRISSYLLLYLLFVLIFLLIYSFLFFF
2. For an unknown sequence
>Sample
MYGGENREKRTKASRPTKDSIITRREAQSTSPHVTFVCDSEGAETSVRHSKSSDVHCGGVRLFSDETVNA
VVPNSTPVESFNGAGANYWRNMDNMVVDRLSLSMDDISVMRLRGCRATGLSQGCCGASVSTSYVLPPSLY
ASPFEQLLDIGALRGCYSYHDTGNTILNGGEDSVDNLAAAAAAVDATVTMHDVGVEMANDNDKNNNIHDD
GDTPCGVRGDRGVQTPGLKLGCAPRIFSEALSSLHLENHDNLDAMISQRPGKNAVTPPASSRPSTTSSKN
HTPAFQPFSSWKFPVLGKVDSAPAVSLQRADLIGEGEKGAWHNGFQKEVNAAAAAGGGGGGGIPGARCGA
VNCSDNGDRCGYGAGGDDDDGDNDKSVSLLEGQEYQGYKKRLRFMYAIYERHALQEGRINNNINISQRDT
NRNGSNALALHTSLQCPSPTFTTTWVPSGYYSLGTRCSIHHPNKQVVPVVSPLLNSLMSRQRDECPRSCT
VVMDPSIVALIERRPVLQTTIFASHTYRQLRRQIKQQKLQSSGERGYGPDATPFLPHVEDSTRQQDMCSG
GVAGGISNVAAREKSPLKKLWATERARRLNSKVATGTTPVAAATVAAGETSSAEPAAVPLMSREEPPNLV
HHRVLTQVNSWNSKVHTIDGINRQVDNEADDLVVYVGMTLMGWLEVVDLLGAGTFGQVFLCKDLRIANGC
FMHPMEIEGEDFQYWQCSHEYIPFSDPSIMPTHPSLVAVKVVKSRALFEQQSVLEAEMLVCIGAQTPSQK
DHGPLQNEGFGAAVHTTEPPQVDPRCNYVAKVYAHGICYGHHCIVMERYGANLFEYVQSRGFKGLPMYYI
QTIGKKILLALTLLHDECRVVHCDIKPENVLLTLDSCISTVTIHGSGGPVGSNGSGAKATLEASGVLLAS
SVRKPCLSTRLEASMSNTIDVPLPAPLPLRLHRAVPLEKVHDKTRRGETNTDGEGIGDGGPREVPSGSVI
PPLHIKLIDFSSSAYVGGCVYTYVQSRYYRAPEVIIGAGYGPPIDVWSTGCFLAELLLGLPLLPGSCDYH
QLYLMEEMLGPLPTSLLAQGRLTHDYYDAEDAEPERESTASGSSTLLKTKTKGQSSFRLLREEEYRARHG
QKQPVEWRCYFQYHTLAELVRRCMLTAEEKRMAIGCSPIASVGDISEEEVEQQKPIKTILDEMMQQRLWL
YDLLKKMLHGDPSKRPTAREALAHSFFTHTPEYAKPYLPLPE
i. Perform BLASTp to identify the organism. Go to the GenPept page (Take a snapshot)
ii. Mention the accession number, locus, definition, sequence length, name of the
organism, the taxonomy, title, name of the journal, BioProject, BioSample (if
available).
iii. Now for the same, Repeat BLASTp for algorithm PAM 30 and PAM 250 and mention
3 scientific names each of distant and closely related species for this particular
organism.
Answer 1:
Sequence 1 :-
Sequence 3:
Sequence 5:
ii.
Accession number: XP_009309175.1
Locus: XP_009309175
Definition: protein kinase [Trypanosoma grayi]
Sequence length: 1232 aa
Name of the organism: Trypanosoma grayi
The taxonomy: Eukaryota; Discoba; Euglenozoa; Kinetoplastea; Metakinetoplastina;
o Trypanosomatida; Trypanosomatidae; Trypanosoma.
Title: protein kinase [Trypanosoma grayi]
Name of the journal: National Center for Biotechnology
o Information, NIH, Bethesda, MD 20894, USA
BioProject: PRJNA258390
BioSample: SAMN02726834
i. PAM30
CLOSELY:
DISTANT:
1. Trypanosoma theileri
2. Trypanosoma cruzi 1. Mortierella sp. AM989
3. Trypanosoma conorhini 2. Mortierella sp. AD032
3. Bodo saltans
PAM250
CLOSELY:
DISTANT:
1. Trypanosoma theileri
2. Trypanosoma cruzi 1. Micromonas pusilla CCMP1545
3. Trypanosoma melophagium 2. Amborella trichopoda
3. Mortierella antarctica