GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:06:07 Sequence gi568815590r:80876547_81093227 : 216681 bp : 41.05% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 1017 1012 6 -0.45 1.03 Term - 1534 1388 147 1 0 96 43 98 0.310 3.02 1.02 Intr - 3766 3625 142 0 1 55 36 70 0.338 -2.07 1.01 Init - 4482 4373 110 0 2 67 107 69 0.603 6.44 1.00 Prom - 4701 4662 40 -6.75 2.00 Prom + 7481 7520 40 -3.65 2.01 Sngl + 10520 10735 216 1 0 37 54 210 0.509 7.42 2.02 PlyA + 13104 13109 6 1.05 3.00 Prom + 14335 14374 40 -8.45 3.01 Init + 17193 17451 259 2 1 63 36 220 0.974 10.42 3.02 Term + 18885 19111 227 1 2 85 46 235 0.997 14.96 3.03 PlyA + 19319 19324 6 1.05 4.06 PlyA - 21385 21380 6 1.05 4.05 Term - 29987 29904 84 2 0 60 49 127 0.264 2.67 4.04 Intr - 34659 34303 357 0 0 69 97 125 0.124 6.03 4.03 Intr - 40591 40455 137 2 2 74 77 64 0.029 3.27 4.02 Intr - 55869 55786 84 1 0 79 93 71 0.149 5.57 4.01 Init - 69949 69898 52 1 1 80 90 27 0.504 3.47 4.00 Prom - 77626 77587 40 -5.25 5.13 PlyA - 79907 79902 6 1.05 5.12 Term - 81594 81409 186 0 0 37 43 147 0.331 1.61 5.11 Intr - 82498 82446 53 2 2 102 88 30 0.621 2.11 5.10 Intr - 93494 93404 91 2 1 88 75 77 0.344 5.05 5.09 Intr - 94361 94158 204 2 0 119 51 68 0.085 4.67 5.08 Intr - 99434 99280 155 0 2 62 26 84 0.007 -1.53 5.07 Intr - 100360 100036 325 1 1 103 -31 315 0.046 15.52 5.06 Intr - 103948 103889 60 1 0 57 107 64 0.344 3.21 5.05 Intr - 108831 108230 602 1 2 46 86 610 0.861 47.90 5.04 Intr - 110920 110824 97 1 1 107 98 109 0.996 12.46 5.03 Intr - 113344 113111 234 1 0 94 45 107 0.642 3.96 5.02 Intr - 114984 114933 52 2 1 90 77 18 0.607 -1.11 5.01 Init - 116681 116557 125 2 2 87 94 152 0.946 13.41 5.00 Prom - 117692 117653 40 -6.55 6.04 PlyA - 118355 118350 6 -0.45 6.03 Term - 118494 118430 65 1 2 80 48 65 0.539 -1.23 6.02 Intr - 119720 119485 236 1 2 41 61 188 0.524 7.81 6.01 Init - 134904 134888 17 0 2 91 95 29 0.045 3.61 6.00 Prom - 142077 142038 40 -4.15 7.00 Prom + 142204 142243 40 -7.45 7.01 Sngl + 142386 142757 372 2 0 44 42 244 0.832 11.27 7.02 PlyA + 144295 144300 6 1.05 8.11 PlyA - 144889 144884 6 1.05 8.10 Term - 150252 150089 164 1 2 80 39 141 0.258 5.52 8.09 Intr - 157261 156939 323 0 2 82 86 122 0.713 5.98 8.08 Intr - 167564 167332 233 2 2 29 74 111 0.167 -0.55 8.07 Intr - 174280 174191 90 1 0 39 20 135 0.356 1.07 8.06 Intr - 177545 177415 131 0 2 36 105 115 0.605 7.49 8.05 Intr - 179162 178995 168 0 0 15 88 116 0.374 3.30 8.04 Intr - 197353 197213 141 2 0 140 64 49 0.973 7.10 8.03 Intr - 197541 197416 126 1 0 83 86 56 0.948 4.63 8.02 Intr - 197802 197705 98 2 2 90 105 54 0.680 6.03 8.01 Intr - 213326 213137 190 0 1 97 73 46 0.453 1.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 100360 99998 363 1 0 103 43 320 0.911 22.58 S.002 Term + 188390 188574 185 0 2 76 36 179 0.879 8.22 S.003 Term - 196970 196840 131 0 2 46 54 100 0.942 -0.24 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:80876547_81093227|GENSCAN_predicted_peptide_1|132_aa MTDTAAEYSHVMDTYNSILILSVTLNSIKALEFLQAWPSITAIHPWPQVSAHDSDWAIQK SFTGIFPADAHSKDFLCTCSQGGQQCFSNFDQSTRTVGENGFYLQEEDRVQGCEFSFKVH TKLHTILMSVLI >gi568815590r:80876547_81093227|GENSCAN_predicted_CDS_1|399_bp atgacagatactgcagcagaatacagccatgtcatggatacatataacagcatactcata ctctcagtaaccctgaatagtatcaaagctttagaatttctacaagcctggccatccatc acagccatccacccttggccacaagtgtccgctcatgattcggactgggccatccagaaa tcattcactgggatttttccagctgatgcacatagcaaagacttcctctgcacctgttca caaggtggacaacagtgcttctcaaactttgaccaaagcacgcggacagtaggggagaat ggattctatctgcaggaagaagacagagtccagggctgcgagttttcattcaaagttcac actaaacttcacactattcttatgtctgttttaatataa >gi568815590r:80876547_81093227|GENSCAN_predicted_peptide_2|71_aa MKEQLKLAPTNGVSAAGLCDSLQCLSGVKKVNSVAINCGSQSMVHGPAAAAAAPENLQEM QNLRPHPRPTE >gi568815590r:80876547_81093227|GENSCAN_predicted_CDS_2|216_bp atgaaagagcagctgaaattggctcccacaaatggtgtttctgctgcaggactgtgtgat agtctccagtgtctctcaggagtgaagaaagtgaactcagtggctataaactgtggttct caaagtatggtccatggaccagcagcagctgctgcagcccctgaaaacctgcaagaaatg caaaacctcaggccacaccccaggccaactgaatga >gi568815590r:80876547_81093227|GENSCAN_predicted_peptide_3|161_aa MRIPPGRRAPVGFHFSARAHAHSERTPRPGFRPRLLPAGSRLPVNTRGFAERGGCYWRSP SRPPLRLAPELSVTRTVQRDFGATAGRKPAAMSSAALWSGPAGKELMSLANSQQGSEALR PKTIKELNAANNHGRELRSGFSTSPSLEMTAALADTLIAAL >gi568815590r:80876547_81093227|GENSCAN_predicted_CDS_3|486_bp atgcggattcctcctgggcggcgggccccagtaggctttcacttttctgcaagggcgcac gcccactccgagcgaactccacgtcctggttttcgtcctcgcctccttcccgccggcagc cggctgcctgtaaacacgcgggggttcgcagaaagaggcggctgctattggaggagccca tccaggcctcccctacgcctagctccggagctgtcagttactcgaacagtgcaacgagat tttggcgccacagctgggaggaagccagctgccatgtcatcagccgccctgtggagtggc cctgctggcaaggaactgatgtctttagccaacagccagcaaggatctgaggccctcaga ccaaaaaccatcaaagaactaaatgctgccaacaaccacgggagggagctgagaagtgga ttctccaccagtcccagcctagagatgactgcagccctggctgacaccttgattgcagct ttgtga >gi568815590r:80876547_81093227|GENSCAN_predicted_peptide_4|237_aa MTSFDSKFSHPGHADARAVEAHPGWDRCQTVSDEGEMAMKTGAEAERAPLLHAEDYLFTV CKLIPPIKLSFLLFSNSRDLLEGKMQIPLEESRHSLSFVKQEMKPCPNPLSPVNISLSSQ HNCLAGTDVFICQLDPHKNLQPKWTGPYTVILSKPTAARVQGLPHWIHHTRLKLTPKATP SSKTLTAGNTLRVPVYNNLNNNNNNKRSLKSTSGSLEEQSLLLLRSPHIAPSESEKE >gi568815590r:80876547_81093227|GENSCAN_predicted_CDS_4|714_bp atgacctcctttgactccaagttctcacatccaggtcatgctgatgcaagagcagtagaa gctcaccctggctgggataggtgccaaacagtcagtgatgaaggagagatggctatgaaa actggagccgaagctgagagagcacctttgctccatgctgaagactatctcttcacagtt tgcaaactgataccgccaataaagctctcctttctactatttagcaattctagggatctt ttggaaggcaaaatgcaaataccactggaagagtccagacacagtctttcatttgtgaag caggaaatgaagccatgccccaaccccctctcccccgtcaacatctccttgtcctctcaa cataactgtcttgcaggcacagacgtgtttatctgccaactcgaccctcacaaaaaccta caaccaaagtggacaggcccctacactgtgatactcagcaagccaactgcagcgagagtc caaggactcccccactggatccatcacaccaggctcaagctcacccccaaggctactcct tcctccaaaactttaacagcgggcaacaccctcagagtccctgtatataataacctaaac aacaacaacaacaacaaaagatccttaaagtccaccagcgggtccctggaagagcagagc ctgctgcttctgcgaagccctcatattgctccatctgaaagtgagaaagaataa >gi568815590r:80876547_81093227|GENSCAN_predicted_peptide_5|727_aa MGPAGSLLGSGQMQITLWGSLAAVAIFFVITFLIFLCSSCDREKKPRQHSGDHENLMNVS RGSSAVWPGPPPTAAAAPGSSLEMRIFLGVRPSKRLASTPGDPVVCSSVRATGLELGFPL FYSRSAAALGEIGREWQPSDKEMFSRSVTSLATDAPASSEQNGALTNGDILSEDSTLTCM QHYEEVQTSASDLLDSQDSTGKPKCHQSRELPRIPPESAVDTMLTARSVDGDQGLGMEGP YEVLKDSSSQENMVEDCLYETVKEIKEVAAAAHLEKGHSGKAKSTSASKELPGPQTEGKA EFAEYASVDRNKKCRQSVNVESILGNSCDPEEEAPPPVPVKLLDENENLQEKEGGEAEES ATDTTSETNKRFSSLSYKSREEDPTLTEEEISAMYSSVNKPGQLVNKSGQSLTVPESTYT SIQGDPQRSPSSCNDLYATVKDFEKTPNSTLPPAGRPSEEPEPDYEAIQTLNREEEKATL GTNGHHGLVPKENDYESITRQSSPKSCASAPGDLDPAGEGMGRDVPGVSSVFVPSYFGIR YQQPLPKVLEDMPLCLSSTDHSAAISQTLPAGTEPCLVTALPNSQAAKMRPQVHLGRQML VLFVTTFAKTCTLLLTSFETVPKDDVKLQPVCSDPPGPKKTVSASACKGPESAYKVRCHT GHFLPGPLLAASESTGKEDEKKRSKNTFNLAIRSFCHEWHPNAGSLKWRSSGTFRDSQRG TFVLHQQ >gi568815590r:80876547_81093227|GENSCAN_predicted_CDS_5|2184_bp atggggcccgcggggagcctgctgggcagcggacagatgcagatcaccctgtggggaagt ctggctgctgtcgccattttcttcgtcatcaccttcctcatcttcctgtgctctagttgt gacagggaaaagaagccgcgacagcatagtggggaccatgagaacctgatgaacgtgagc aggggttcttctgcagtgtggcctggacctccacccacagcagcagcagcacctgggagc tccttagaaatgcgaattttcctgggggtgaggcccagtaaaaggctcgcaagcacccca ggtgatccagttgtatgctcgagtgtgagagccactggcctagagcttgggttcccgctg ttctactcaaggtcagcagcagccctgggcgagataggaagagagtggcagccttcagac aaggagatgttcagccgttcagttactagcctggcaacagatgctcctgccagcagtgag cagaatggggcactcaccaatggggacattctttcagaggacagtactctgacctgcatg cagcattacgaggaagtccagacatcggcctcggatctgctggattcccaggacagcaca gggaaaccaaaatgtcatcagagtcgggagctgcccagaatccctcccgagagcgcagtg gataccatgctcacggcgagaagtgtggacggggaccaggggctggggatggaagggccc tatgaagtgctcaaggacagctcctcccaagaaaacatggtggaggactgcttgtatgaa actgtgaaagagatcaaggaggtggctgcagctgcacacctggagaaaggccacagtggc aaggcaaaatctacttctgcctcgaaagagctcccagggccccagactgaaggcaaagct gagtttgctgaatatgcctcggtggacagaaacaaaaaatgtcgtcaaagtgttaatgta gagagtatccttggaaattcatgtgatccagaagaggaggccccaccacctgtccctgtt aagcttctggacgagaatgaaaaccttcaggagaaggaagggggagaggcggaagagagt gccacagacacgaccagtgaaactaacaagagatttagctcattgtcatacaagtctcgg gaagaagaccccactctcacagaagaagagatctcagctatgtactcatcagtaaataaa cctggacagttagtgaataaatcggggcagtcgcttacagttccggagtccacctacacc tccattcaaggggacccacagaggtcaccctcctcctgtaatgatctctatgctactgtt aaagacttcgaaaaaactccaaacagcacacttccaccagcagggaggcccagcgaggag ccagagcctgattatgaagcgatacagactctcaacagagaggaagaaaaggccaccctg gggaccaatggccaccacggtctcgtcccaaaggagaacgactacgagagcataactcgc cagtcctccccaaagagttgcgcatcagcacctggggatctggaccctgcgggtgaaggg atggggagggacgtccctggagtctcttctgtctttgttccttcttattttggcattcga tatcagcagcctctccccaaagtacttgaagacatgccattatgtctttccagtacggat cacagtgctgccatatcacagaccctgccagccggcacagaaccatgcctcgtcacagct ctgcccaactcacaggctgccaagatgaggccacaggtccacctagggcggcagatgctg gtgctatttgtcacgacttttgccaaaacatgtacactgttgctcacatctttcgaaaca gtgcctaaggatgatgtcaagttacagccagtttgctctgatcccccaggacctaagaaa actgtgagtgcttcagcatgcaaagggcctgagagcgcatacaaagtcagatgccacact ggacacttcctgcctggccccctccttgcagcgagtgaaagcacaggaaaagaggatgaa aagaaaagaagtaaaaacacctttaacctagctatccggtccttctgtcatgaatggcat ccaaatgctggctctttgaagtggagaagcagtgggaccttcagagacagccaaaggggg acatttgtgctgcaccagcagtga >gi568815590r:80876547_81093227|GENSCAN_predicted_peptide_6|105_aa MEEGERPLKKSRWQLLVQPLNPPALPTTTGTGVPQSSCRAALRGSKRRKLPPQLGPLSLR TAVHLTQKRAPDDDHSVRCENFVHRGTLVCCEATPAVLMSVLRDG >gi568815590r:80876547_81093227|GENSCAN_predicted_CDS_6|318_bp atggaggaaggtgaaaggcccctaaagaagagtaggtggcagttgctggtgcagccactc aacccgccagcactgcccacgaccacggggactggtgtcccacagtcctcctgcagagca gcacttagagggtccaagaggagaaagctgcccccgcaactggggcctttgtccctgagg actgcagtgcatctgactcagaaacgtgcacctgatgatgatcattctgttcgctgtgag aattttgtccacagaggcaccctggtttgctgcgaagccacccctgccgtcttgatgtct gtgctcagggatggctga >gi568815590r:80876547_81093227|GENSCAN_predicted_peptide_7|123_aa MWQSLELPRHLLNGFDKNADSDMDNKVPAKVVSDGNGKLVGNWSKDHSCYAKRLAALCPC PRGLWNFEFETNDLGYLVEEISKQQSIQEEAENKSLENLQPDDAIKKKNPFSGRNSSQLQ KFA >gi568815590r:80876547_81093227|GENSCAN_predicted_CDS_7|372_bp atgtggcaaagtttggaacttcctagacacttattgaatggctttgataaaaatgctgat agtgatatggacaataaagtccctgctaaagtggtctcagatggaaatggcaaacttgtt gggaactggagtaaagatcactcttgctatgcaaagagactggcagcattgtgcccttgc cctagaggtctgtggaactttgaatttgagacaaatgatttaggatatctggttgaagaa atttctaagcagcaaagcattcaagaggaagcagagaataaaagtttggaaaatttgcag cctgatgatgcaattaaaaagaaaaacccattttctgggagaaattcaagccagctgcag aaatttgcctaa >gi568815590r:80876547_81093227|GENSCAN_predicted_peptide_8|554_aa XLFCFLNEHAGRPFFMDSHVSACHASRGTNNPFVLNCLFKDVCNVAVEDKAVEAKDSVSL QSKGFFAAPSSLMILTGKTPTLRNTALSPRHIFTYTANNLAAFPGSAHMSARVPTPGQCS PQWVWKHQACCSSLRGAQGLCSANIFSVLPFLTPQLKAAPTPQAVPTSTVPLWFSSVIVT AMLLDENKIPRNPTYKGCEGPLQGELQTTAQRNKRGHKQMEEHSMLMDRKNQYRENGHTA QGLANYDLQAEYILAPIFVNKVLLEHRHTRPLATVDGCFRAIVSELMQQECEDAIAELTH QLMDSSTYFSPFIASKVTWRVVWRMNAKGARVVGNLLALTVQEALWAVYFILHTPDSSSL KRRRKRNLHIHSKTRKSGRTTISGEGYLRIPRRVSVSQNGALFQFCWVALARTRVQVSLV KTHPELEFTCLGLLMNVIISIKKSPTVLMKTVREPDVTETAGRGIPWTGYVGFRERSCLT KSSVSEVLKFHSVEKEINIEGTPWRCCLQQQNWQVPGRHLRVQGNCWPICDSAAMPPILP ACSFPAEFPVTTET >gi568815590r:80876547_81093227|GENSCAN_predicted_CDS_8|1665_bp natttgttttgttttttaaatgagcatgctgggagaccattctttatggactctcacgtt tctgcatgtcatgcaagcagaggcactaacaacccttttgttctaaactgtcttttcaag gatgtttgtaatgtagctgtagaagataaagctgtagaagctaaagatagtgtttccctc cagagcaaagggttctttgctgctccctcttccctcatgatcctcactggcaaaacccca accctgcgtaacacagctctctccccacgccacatcttcacctacactgctaataatctt gctgcctttcctggatctgcccacatgtctgcacgtgtgcccacacctggccagtgctcc cctcagtgggtgtggaagcaccaggcatgctgtagctctctgaggggtgcccaaggtctg tgctcagctaacattttctcagtgctgcctttcctgactccccaactcaaagcagcgccc actccccaggcagtccctacctccacagtgcctctgtggttttcttcagtgattgtcaca gctatgctgctagatgagaataaaatacctaggaatccaacttacaagggatgtgaagga cctcttcaaggagaactacaaaccactgctcaacgaaataaaagaggacacaaacaaatg gaagaacattccatgctcatggataggaagaatcaatatcgtgagaatggccatactgcc caagggttggcaaactatgacctgcaagctgaatacattctggcacctatttttgtaaat aaagttttattggaacacaggcacactcgtcctttagctactgttgatggctgctttcgt gcaatagtgtcagagctcatgcagcaggaatgtgaggatgcgattgcggagttgactcac cagctgatggactcaagcacctacttctctcctttcattgcctcaaaagtcacttggaga gttgtgtggaggatgaatgcaaaaggggcaagagtagttgggaacctcttggccttgacc gttcaagaagccttgtgggctgtttactttatccttcacactccagattcttcaagttta aaaagaagaagaaaaagaaatctgcacatccacagcaaaactagaaagtcagggagaact accatcagtggagaaggatatttaagaattcccagaagggtttcagtgtcccagaatgga gctctgtttcagttctgctgggtggctttagccagaacaagagttcaggttagtttggta aaaacacatcctgaactggagttcacttgcttgggccttcttatgaatgtcatcattagt attaagaagtcaccaaccgtcctcatgaaaactgtcagagagccagatgtaacagaaact gctgggcgtggcattccatggacaggatatgtgggtttcagggagaggtcatgtttgact aaatcgtcagtatcagaagtcctaaaattccattcggtggaaaaagaaatcaatattgaa gggactccttggaggtgctgcctgcagcagcagaactggcaggtccctggaagacacctt cgtgtgcagggtaattgctggcccatctgtgactcagcagccatgcctcctatcctgcct gcttgttcctttcctgcggaatttccggtaaccactgagacctga