GENSCAN 1.0 Date run: 19-Feb-121 Time: 20:41:38 Sequence gi568815596r:156225747_156430186 : 204440 bp : 39.19% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 4782 5216 435 2 0 64 38 299 0.619 18.52 1.02 PlyA + 5338 5343 6 1.05 2.00 Prom + 28479 28518 40 -1.05 2.01 Init + 29678 29978 301 1 1 88 -8 339 0.938 21.76 2.02 Term + 30024 30691 668 1 2 -61 43 389 0.938 12.50 2.03 PlyA + 30919 30924 6 1.05 3.00 Prom + 31088 31127 40 -6.15 3.01 Sngl + 31181 32248 1068 1 0 44 42 411 0.892 28.98 3.02 PlyA + 32384 32389 6 1.05 4.00 Prom + 32414 32453 40 -13.78 4.01 Init + 32536 34139 1604 0 2 42 53 404 0.701 24.20 4.02 Intr + 70849 70967 119 1 2 82 18 81 0.000 -0.31 4.03 Intr + 90860 90941 82 0 1 60 108 43 0.033 1.28 4.04 Intr + 91697 91756 60 2 0 86 94 37 0.562 1.03 4.05 Intr + 93702 93850 149 0 2 23 28 193 0.290 5.76 4.06 Term + 96866 97116 251 0 2 -44 47 263 0.024 3.88 4.07 PlyA + 98128 98133 6 1.05 5.07 PlyA - 98709 98704 6 1.05 5.06 Term - 100254 99998 257 1 2 78 47 147 0.996 4.36 5.05 Intr - 100582 100404 179 0 2 96 82 113 0.946 10.14 5.04 Intr - 101174 100972 203 2 2 13 90 212 0.524 11.06 5.03 Intr - 102268 102105 164 2 2 44 90 131 0.731 7.67 5.02 Intr - 102787 102658 130 1 1 85 102 86 0.991 9.05 5.01 Init - 104440 103577 864 1 0 89 105 827 0.741 79.12 5.00 Prom - 105985 105946 40 -4.05 6.06 PlyA - 107190 107185 6 -0.45 6.05 Term - 108744 108380 365 1 2 81 43 288 0.143 17.34 6.04 Intr - 114052 114012 41 0 2 86 89 30 0.082 -0.15 6.03 Intr - 115661 115567 95 2 2 56 87 89 0.043 3.44 6.02 Intr - 124571 124417 155 0 2 67 43 178 0.357 10.07 6.01 Init - 147259 147118 142 1 1 70 72 111 0.931 8.05 6.00 Prom - 148214 148175 40 -6.15 7.02 PlyA - 148383 148378 6 1.05 7.01 Sngl - 149635 149297 339 1 0 88 37 363 0.811 26.98 7.00 Prom - 151912 151873 40 -3.35 8.02 PlyA - 152066 152061 6 1.05 8.01 Sngl - 174814 174359 456 1 0 56 42 247 0.563 10.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 90877 90941 65 0 2 64 108 39 0.838 4.17 S.002 Init + 96812 96887 76 1 1 75 60 96 0.822 6.80 S.003 Term + 96995 97116 122 0 2 78 47 108 0.808 3.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:156225747_156430186|GENSCAN_predicted_peptide_1|144_aa MGNITGLFEDIPSTSSEPSSPTDWLDRTVNTITAVQLSGSPIPRGRGRAPHQGIATWNKK IWTAALQSQIFPLKQSTQRGRNQKRKYDNVTKQDSITFTKDHTRAMNPNQEEISELPDKQ FRRLIIKLFKEVPEKGENQLNTIF >gi568815596r:156225747_156430186|GENSCAN_predicted_CDS_1|435_bp atgggtaacatcacaggactctttgaagacattcccagcaccagctcagagcccagtagc cccactgactggctagacagaacggtgaacacaatcactgctgttcagctctcaggaagc cccatccccaggggaagaggaagagcaccacatcaagggattgccacatggaacaaaaaa atctggacagcagcccttcagtcccagatctttccactgaaacagtctacccaaagggga aggaaccagaaacggaagtatgataatgtgacaaaacaagattctataacattcacaaaa gatcacactagagcaatgaatccaaaccaagaagaaatctctgaattgccagataaacaa ttcagaaggttgattattaagctattcaaggaggtaccagagaaaggtgaaaaccaactt aatacaattttttaa >gi568815596r:156225747_156430186|GENSCAN_predicted_peptide_2|322_aa MGKKQNRKTGNSKTQSTSPPPKERSSSPATEQSRMENDFDELREEGFRRSNYSELREDIQ TKGKELENFEKNLEECITRITNTEKCLKELMELKTKARDLQRVSAMEDEMNEMKREGKFR EKRIKRNEQSLQEIWDYVKRPNLRLIGVPESDGENGTKLENTLQDIIQENFPYLARQANV QIQEIQRTPQRYSSRRATPRHIVRFTKVEMKEKMLRAAREKGRVTLKGKPIRLTVNVSAE TLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDKQMLRDFVTSRPALKEL LKEALNMERNNRYQPLQNHAKM >gi568815596r:156225747_156430186|GENSCAN_predicted_CDS_2|969_bp atggggaaaaaacagaacagaaaaactggaaactctaaaacgcagagcacctctcctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagccggatggagaatgactttgat gagctgagagaagaaggcttcagacgatcaaattactctgagctacgggaggacattcaa accaaaggcaaagaacttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagtgcttaaaggagctgatggagctgaaaaccaaggctcgagatcta caaagggtatcagcaatggaagatgaaatgaatgaaatgaagcgagaagggaagtttaga gaaaaaagaataaaaagaaatgagcaaagcctccaagaaatatgggactatgtgaaaaga ccaaatctacgtctgattggtgtacctgaaagtgatggggagaatggaaccaagttggaa aacactctgcaggatattatccaggagaacttcccctatctagcaaggcaggccaacgtt cagattcaggaaatacagagaacgccacaaagatactcctcgagaagagcaactccaaga cacattgtcagattcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagag aaaggtcgggttaccctcaaagggaagcccatcagactaacagtgaatgtctcggcagaa accctacaagccagaagagagtgggggccaatattcaacattcttaaagaaaagaatttt caacccagaatttcatatccagccaaactaagcttcataagtgaaggagaaataaaatac tttacagacaagcaaatgctgagagattttgtcaccagcaggcctgccctaaaagagctc ctgaaggaagcgctaaacatggaaaggaacaaccggtaccagccactgcaaaatcatgcc aaaatgtaa >gi568815596r:156225747_156430186|GENSCAN_predicted_peptide_3|355_aa MGDFNTPLSTLDRSTRQKVNKDTQDLNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRTEIITNYLSDHSAIKLELSIKNLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDTFKAVCRRKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSCFFERINRIDRPLARLIK KKREKNQIDTIKNDRGDITTDPTEIQTTIIEYYKHLYANKLENLEEMDKFLDTYTLPRLN QGEVESLNRPITGSEIVAIINSLSIKKSPGPDGFTAEFYQRYKEELVPFLLKLFQ >gi568815596r:156225747_156430186|GENSCAN_predicted_CDS_3|1068_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacagaaagtcaac aaggatacccaggacttgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaacagagatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcagcattaagaatctc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacacattcaaagcagtgtgtagacggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagccgaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctgcttttttgaaaggatcaacagaattgatagaccactagcaagactaataaag aaaaaaagagagaagaatcaaatagacacaataaaaaatgatagaggggatatcaccacc gatcccacagaaatacaaactaccatcatagaatactacaaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacactctcccaagactaaac cagggagaagttgaatctctgaatagaccaataacaggatctgaaattgtggcaataatc aatagtttatcaatcaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaaggaggaactggtaccattccttctgaaactattccaatga >gi568815596r:156225747_156430186|GENSCAN_predicted_peptide_4|754_aa MIISIEAEKAFDKIQQPFMLKTLNKLGVDGTYLKIIRAIYDKPTANIILNGQKLEAFPLK TGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMILYLENPIVS AQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLT RDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRFNAIPIKLPI TFFTELEKNTLKFIWNQKRARIAKSILSQKNKAGGIMLPDFKLYYKAVVTKTAWYWYQNR DIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLT PYTKINSRWIKDLNIRPKTIKTLEENLGVTIQDIGMGKDFMSKTPKAMATKAKIDKWDLI KLKSFCTAKETTIRVNRKPTKWEKIFATHSSDKGLISRIYNELKQIYKKKTNNPIKKWAK DMNRHLSKEDIYAAKKHMKKCSPSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNSIYALF RPSVLVFIEKKSKIKNMIGTYSQKSLPQEAPASLKRHFENMSQGFLVPKSLIMNYFRITA DKSMLPSRSLNMPEATSSTLSSFQSQEIVVDSIPLEVPLVPDERLLTLAAAALYCTRDVS SRNLGIVSARTEDAGIRGRYEALITKFNLFLKKVMQRLAFGNAACLLFFLTLLLETLGQE KSSAASGKMQGQCMRRSSPEQLGARPAGSLTATL >gi568815596r:156225747_156430186|GENSCAN_predicted_CDS_4|2265_bp atgattatctcaatagaggcagaaaaagcctttgacaaaattcaacaacccttcatgcta aaaactctcaataaattaggtgttgatgggacgtatctcaaaataataagagctatctat gacaaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaa actggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagtt ctggccagggcaattaggcaagagaaggaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatgattttatatctagaaaaccccattgtctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat gtacaaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatg agtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttaca agggatgtgaaggacctcttcaaggagaactacaaaccactgctcaaggaaataaaagag gatacaaacaaatggaagaacattccatgctcatgggtaggaagaatcaatatagtgaaa atggccatactgcccaaggtaatttacagattcaatgccatccccatcaagctaccaatc actttcttcacagaattggaaaaaaatactttaaagttcatatggaaccaaaaaagagcc cgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcatgctacctgac ttcaaactatactacaaggctgtggtaaccaaaacagcatggtactggtaccaaaacaga gatattgatcaatggaacagaacagagccctcagaaataacgccacatatctacaactat ctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttacacaaaaatcaattcaagatggattaaagacttaaacattagacctaaaaccata aaaaccctagaagaaaacctaggcgttaccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacagaaaacctaca aaatgggagaaaattttcgcaacccactcatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaacaaacaaccccatcaaaaagtgggcgaag gacatgaacagacacttgtcaaaagaagacatttatgcagccaaaaaacacatgaaaaaa tgctcaccatcactggccatcagagaaatgcaaatcaaaaccacaatgagataccatctc acaccagttagaatggcaatcattaaaaagtcaggaaacaacagcatttatgccttattt cggccttctgtgcttgtgttcatagagaagaaaagcaaaataaagaatatgattggcact tactcacagaaatcacttcctcaagaagcacctgctagcctaaaaagacactttgagaac atgtcacaaggcttccttgtccctaaatcactgattatgaattatttccggataactgca gacaagtcaatgctcccctccagaagtctgaacatgcctgaagcaacatcctcaactctc agcagcttccagagccaagaaattgtggtagactcaatccctctggaagtccctctagta cccgatgaaagacttctcacactcgcagcagccgccctctactgtaccagggacgtctct agcagaaacttgggcattgtatctgctagaacggaggacgcgggaatacgagggcggtat gaagccctcattacaaagtttaatctctttctgaagaaggtaatgcaacgtctggctttt ggcaatgcagcgtgtctgctgtttttcctcactttactcctagaaacattaggacaagaa aagtcctcagctgcatctggaaagatgcaggggcaatgcatgcgacgctcttcacccgag caactcggtgcgcggccggccgggagtttgacagcgactctgtga >gi568815596r:156225747_156430186|GENSCAN_predicted_peptide_5|598_aa MPCVQAQYGSSPQGASPASQSYSYHSSGEYSSDFLTPEFVKFSMDLTNTEITATTSLPSF STFMDNYSTGYDVKPPCLYQMPLSGQQSSIKVEDIQMHNYQQHSHLPPQSEEMMPHSGSV YYKPSSPPTPTTPGFQVQHSPMWDDPGSLHNFHQNYVATTHMIEQRKTPVSRLSLFSFKQ SPPGTPVSSCQMRFDGPLHVPMNPEPAGSHHVVDGQTFAVPNPIRKPASMGFPGLQIGHA SQLLDTQVPSPPSRGSPSNEGLCAVCGDNAACQHYGVRTCEGCKGFFKRTVQKNAKYVCL ANKNCPVDKRRRNRCQYCRFQKCLAVGMVKEVVRTDSLKGRRGRLPSKPKSPQEPSPPSP PVSLISALVRAHVDSNPAMTSLDYSRFQANPDYQMSGDDTQHIQQFYDLLTGSMEIIRGW AEKIPGFADLPKADQDLLFESAFLELFVLRLAYRSNPVEGKLIFCNGVVLHRLQCVRGFG EWIDSIVEFSSNLQNMNIDISAFSCIAALAMVTERHGLKEPKRVEELQNKIVNCLKDHVT FNNGGLNRPNYLSKLLGKLPELRTLCTQGLQRIFYLKLEDLVPPPAIIDKLFLDTLPF >gi568815596r:156225747_156430186|GENSCAN_predicted_CDS_5|1797_bp atgccttgtgttcaggcgcagtatgggtcctcgcctcaaggagccagccccgcttctcag agctacagttaccactcttcgggagaatacagctccgatttcttaactccagagtttgtc aagtttagcatggacctcaccaacactgaaatcactgccaccacttctctccccagcttc agtacctttatggacaactacagcacaggctacgacgtcaagccaccttgcttgtaccaa atgcccctgtccggacagcagtcctccattaaggtagaagacattcagatgcacaactac cagcaacacagccacctgcccccccagtctgaggagatgatgccgcactccgggtcggtt tactacaagccctcctcgcccccgacgcccaccaccccgggcttccaggtgcagcacagc cccatgtgggacgacccgggatctctccacaacttccaccagaactacgtggccactacg cacatgatcgagcagaggaaaacgccagtctcccgcctctccctcttctcctttaagcaa tcgccccctggcaccccggtgtctagttgccagatgcgcttcgacgggcccctgcacgtc cccatgaacccggagcccgccggcagccaccacgtggtggacgggcagaccttcgctgtg cccaaccccattcgcaagcccgcgtccatgggcttcccgggcctgcagatcggccacgcg tctcagctgctcgacacgcaggtgccctcaccgccgtcgcggggctccccctccaacgag gggctgtgcgctgtgtgtggggacaacgcggcctgccaacactacggcgtgcgcacctgt gagggctgcaaaggcttctttaagcgcacagtgcaaaaaaatgcaaaatacgtgtgttta gcaaataaaaactgcccagtggacaagcgtcgccggaatcgctgtcagtactgccgattt cagaagtgcctggctgttgggatggtcaaagaagtggttcgcacagacagtttaaaaggc cggagaggtcgtttgccctcgaaaccgaagagcccacaggagccctctcccccttcgccc ccggtgagtctgatcagtgccctcgtcagggcccatgtcgactccaacccggctatgacc agcctggactattccaggttccaggcgaaccctgactatcaaatgagtggagatgacacc cagcatatccagcaattctatgatctcctgactggctccatggagatcatccggggctgg gcagagaagatccctggcttcgcagacctgcccaaagccgaccaagacctgctttttgaa tcagctttcttagaactgtttgtccttcgattagcatacaggtccaacccagtggagggt aaactcatcttttgcaatggggtggtcttgcacaggttgcaatgcgttcgtggctttggg gaatggattgattccattgttgaattctcctccaacttgcagaatatgaacatcgacatt tctgccttctcctgcattgctgccctggctatggtcacagagagacacgggctcaaggaa cccaagagagtggaagaactgcaaaacaagattgtaaattgtctcaaagaccacgtgact ttcaacaatggggggttgaaccgccccaattatttgtccaaactgttggggaagctccca gaacttcgtaccctttgcacacaggggctacagcgcattttctacctgaaattggaagac ttggtgccaccgccagcaataattgacaaacttttcctggacactttacctttctaa >gi568815596r:156225747_156430186|GENSCAN_predicted_peptide_6|265_aa MDKFLDTYIFPTLNQEEVESLNRPITGSETEAIKVQDQTDSQPNSTREFSPVGHWGHNIV QKITLPYVGKNCPLSQQSPTFLAPGTGFMDNFSTDLRSGIDSKSQVNGAKLDFPPKYNSV SRRRRSWISVSTSTRCVLNVISSDGHRRAAAAAAAAAAAATSGGNLRWSRRFPRGCGAKE PAASWGSRRASDGNASPAAGELGAAGSRQRRGCRLGLSSPGSRIVPRSHRPVLRTRVGGQ VAKGSRLPGNEPHRERRGGWDALRE >gi568815596r:156225747_156430186|GENSCAN_predicted_CDS_6|798_bp atggataaattcctggacacatacatcttcccaacactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaactgaggcaataaaagtccaggatcagacagat tcacagccgaattctaccagagaattctccccagtaggacactggggtcacaacattgtg cagaagattacacttccctatgttggtaaaaactgtccactaagtcaacagtcccccacg tttttggcaccagggactggtttcatggacaatttttccacggacctgcggtcggggata gactctaagtctcaagttaatggcgctaagctagattttccccccaagtacaattccgtc agccgtcgtcggcgcagctggatttccgtaagcaccagcactcgatgtgtcttaaatgtg atcagtagtgatgggcatcgccgagcagcggcggcggcggcagcagcagcggcagccgca acatctgggggaaacttaaggtggtcacgtaggtttccccgaggctgcggcgcaaaagag cccgcggcttcctggggcagccgcagggcatctgatggcaacgcctccccagccgccggc gagctgggggcagcagggagtaggcaaaggcggggctgcagattagggttgagcagcccg ggatcccgaatagttccacggagccatcggccggtactgagaacacgagtaggggggcaa gttgcaaaagggtcccggctgcctggaaacgagccacaccgagaaaggaggggcggctgg gacgctctgcgcgagtag >gi568815596r:156225747_156430186|GENSCAN_predicted_peptide_7|112_aa MGRNQSRKAENSKNQTASSPPKECSSLPAMEQSWMENDFDELREEGFRRSVTTNFSKLKE DVRTHRKEAKNLEKRLDEWLTIINSVEKTLNDLMELKTMAQELCDACTSFSS >gi568815596r:156225747_156430186|GENSCAN_predicted_CDS_7|339_bp atggggagaaaccagagcagaaaagctgaaaattctaaaaatcagactgcctcttctcct ccaaaggaatgcagctccttgccagcaatggaacaaagctggatggagaatgactttgat gagttgagagaagaaggcttcagacgatcggtaacaacaaacttctccaagctaaaggag gatgttcgaacccatcgcaaagaagctaaaaaccttgaaaaaagattagacgaatggcta actataataaacagtgtagagaagaccttaaatgacctgatggagctgaaaaccatggca caagaactatgtgacgcatgcacaagcttcagtagctga >gi568815596r:156225747_156430186|GENSCAN_predicted_peptide_8|151_aa MKPHSRALLLDVGSLGSWRDPVPCAATAAGLPGAQCLPELAPPQAYCAGSADPGGRGVSG LWLCCRVEPRSKPRARAQRSGRAASVASGAHGDWFMEQKGGDLLFAHENVMDLPEHSSPP KINAHLRKTTCSHQIPIPCLCKGYSQITPCR >gi568815596r:156225747_156430186|GENSCAN_predicted_CDS_8|456_bp atgaagcctcactccagagcacttctgctggacgtcgggagcctgggatcctggagggat ccggtgccctgcgcggccacagcggcgggactcccgggtgcccaatgcctgccggagctg gcacctccccaggcttattgcgcaggcagcgcagatccaggaggccgtggtgtgtccggc ctctggctttgctgccgggtagaaccccggtccaaacctcgagcccgggcccagcgctcg ggccgcgccgcgagcgtcgcctcgggtgctcatggagactggttcatggagcagaaaggt ggagaccttctctttgcacacgaaaatgttatggatttacctgaacactcttcacccccg aaaataaatgcacatctcaggaaaaccacgtgttcacaccaaatccccataccttgctta tgtaagggttacagccagataaccccttgcagataa