GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:24:43 Sequence gi568815596f:138450217_138669077 : 218861 bp : 38.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6466 6571 106 0 1 44 53 122 0.085 4.73 1.02 Intr + 11177 11283 107 0 2 48 82 53 0.146 -0.29 1.03 Term + 12928 13053 126 0 0 87 42 102 0.387 2.80 1.04 PlyA + 13382 13387 6 1.05 2.08 PlyA - 15337 15332 6 1.05 2.07 Term - 19402 19151 252 1 0 29 37 181 0.046 1.75 2.06 Intr - 44329 44161 169 0 1 61 36 60 0.034 -2.87 2.05 Intr - 46267 46083 185 1 2 126 71 94 0.067 9.16 2.04 Intr - 54264 54105 160 2 1 36 75 51 0.001 -2.33 2.03 Intr - 55841 55628 214 0 1 68 50 91 0.003 0.15 2.02 Intr - 61653 61558 96 1 0 7 105 76 0.025 0.26 2.01 Init - 74614 74509 106 1 1 63 98 87 0.570 7.63 2.00 Prom - 85558 85519 40 -5.05 3.04 PlyA - 85750 85745 6 1.05 3.03 Term - 86650 86534 117 1 0 71 48 84 0.365 0.26 3.02 Intr - 90281 90114 168 2 0 18 86 105 0.324 2.52 3.01 Init - 91285 91106 180 1 0 70 41 134 0.502 6.13 3.00 Prom - 92227 92188 40 -6.15 4.02 PlyA - 92396 92391 6 1.05 4.01 Sngl - 93639 93250 390 0 0 88 54 401 0.967 32.67 4.00 Prom - 99547 99508 40 -5.55 5.00 Prom + 99728 99767 40 -4.35 5.01 Init + 100330 100388 59 0 2 67 95 13 0.456 0.73 5.02 Intr + 100687 100838 152 1 2 81 106 71 0.994 7.09 5.03 Intr + 102338 102465 128 0 2 81 90 41 0.984 3.08 5.04 Intr + 108806 108983 178 1 1 83 80 91 0.906 6.37 5.05 Intr + 109066 109121 56 2 2 66 92 46 0.894 0.58 5.06 Intr + 110589 110711 123 2 0 61 106 115 0.996 10.46 5.07 Intr + 114492 114634 143 2 2 32 87 137 0.890 6.23 5.08 Term + 118720 118864 145 1 1 65 44 145 0.980 4.20 5.09 PlyA + 119948 119953 6 1.05 6.02 PlyA - 120193 120188 6 1.05 6.01 Sngl - 125814 125161 654 0 0 54 49 260 0.682 14.62 6.00 Prom - 131757 131718 40 -2.85 7.00 Prom + 149311 149350 40 -4.75 7.01 Init + 154249 154343 95 0 2 78 89 35 0.224 2.57 7.02 Intr + 159266 159363 98 2 2 73 96 12 0.189 -0.77 7.03 Intr + 162551 162715 165 0 0 73 44 184 0.546 11.51 7.04 Intr + 166121 166257 137 0 2 35 15 151 0.485 1.87 7.05 Intr + 169396 169481 86 0 2 16 23 154 0.112 -0.60 7.06 Intr + 171967 172094 128 1 2 50 80 53 0.128 0.10 7.07 Intr + 187559 187704 146 0 2 104 33 75 0.149 2.68 7.08 Intr + 200468 200554 87 1 0 79 84 62 0.290 4.15 7.09 Intr + 212371 212509 139 0 1 148 72 -6 0.056 2.92 7.10 Term + 215880 216040 161 1 2 77 42 59 0.019 -2.68 7.11 PlyA + 216141 216146 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:138450217_138669077|GENSCAN_predicted_peptide_1|112_aa MASKTSTQENQCTKHNYNQGPSQSPRHSLTTSTGAGLHMPWWFAAPINPSSTLGISSDAI PPLTPIPHQAPVMLMEEVGSHGLTQYLTKELKDLYNKNYKTLLKEIIDDTNK >gi568815596f:138450217_138669077|GENSCAN_predicted_CDS_1|339_bp atggctagcaaaactagcactcaagaaaaccagtgcactaaacacaactacaaccaagga ccttcacagagtccacgacactcccttactacctccactggagcaggtctacacatgcca tggtggtttgctgcacccatcaacccgtcatctacattaggtatttcttctgatgctatc cctcccctaactcccattcctcatcaggccccggtcatgctgatggaagaggtgggctcc catggccttacacaatacctaaccaaggagttgaaagacctctacaacaaaaactacaaa acactgctgaaagaaatcatagatgacacaaacaaataa >gi568815596f:138450217_138669077|GENSCAN_predicted_peptide_2|393_aa MLNGFDQNANNDMDNEIQAEVVSDGDEECVRNYSKGWLYAPSLYALMPGELWSKADLRLS RLDFILGGLSTKCLAWILAMVICSSLFPINQPHVLKFSSYWTFQVQKVEIANITWVTTVH PLGFCLKVISSEKHSLMDCSISFKSLILGELFFFNLPCLYKFMAKNADIDIRKAKKLNVG RNYVEAVRDENQVTEKEELWLALEPGYRPRGSLKLSTLFSQRPPASAPALIGSRGATDRS PPLLSLLSHHHVSRCLLTFPCALSVSFSKGAFYESVTCGIPATLLCGFSDQLYRLRTCTR KPKYPQQLPQSLSQEPLGAFLMATAAWLTTVFKQPGCAPELHWASFHNYGSVSITLISEC GRHLNKNHESHFTNQDTQDVRLSDLSYQGHKAS >gi568815596f:138450217_138669077|GENSCAN_predicted_CDS_2|1182_bp atgttgaatggctttgaccaaaatgctaataatgatatggacaatgaaatccaggctgag gtggtctcagatggagatgaggaatgtgttaggaactacagcaaagggtggctgtatgct cctagtctctatgctctgatgcctggagagctctggagtaaagcagacctacgtttgtcc agacttgacttcattttgggaggattaagtacaaaatgcttagcgtggattttggccatg gtgatctgctccagtctattccccatcaaccaacctcacgtgctgaagttcagttcttac tggaccttccaggtccagaaagttgaaattgccaacatcacttgggtcactacagttcat cctttaggtttctgtctaaaagtcatttcttcagagaagcattccctaatggattgttct ataagctttaaaagtctgattctaggggagttatttttcttcaacctgccctgtttatat aaattcatggccaaaaatgcagatatagacattaggaaagccaagaaactaaatgttgga aggaactatgtggaagcagtgagagatgaaaatcaggtaacagagaaagaggagttgtgg ttggccctggaacctggatacagacccaggggaagtctaaagctctccactctgttcagc cagaggccaccagcctctgcaccagccctaattggctctagaggtgcaactgatcggtct cctcctcttttatctctgctcagtcatcaccatgtctccaggtgcctgcttacatttccc tgtgctttgagtgtctccttctctaaaggagccttttatgagtcagttacatgtggcatc cctgctacacttctctgtggcttttctgatcagttatacagattaagaacatgtacaaga aagccaaaatatccccagcaattaccccagtctttatcacaggaaccactgggtgccttt ctcatggccacagcagcttggcttacaaccgtcttcaaacagccaggctgtgccccagaa cttcactgggcttccttccataactatggatctgtgagcatcactttaatttcagagtgt ggaagacaccttaataagaatcatgaatcacattttacaaatcaggatacacaggatgta aggttaagtgacctgtcctatcagggccacaaagccagttaa >gi568815596f:138450217_138669077|GENSCAN_predicted_peptide_3|154_aa MDEFLNTYTLPRLNQEEVESLNRPITGSEIEALINSLPTKKSAGPDGFTAEFYQRYKEEL RIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRISIVKMAILPKSLGL EMFACGKKWTIKQTAFNLFPEKRSSFARKQASMI >gi568815596f:138450217_138669077|GENSCAN_predicted_CDS_3|465_bp atggatgaattcctcaacacatacaccctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggctctgaaattgaggcattaattaatagcttaccaaccaaa aaaagtgcaggaccagatggattcacagctgaattctaccagaggtacaaggaggagctg agaataaaatacctaggaatccaacttacaagagatgtgaaggacctcttcaaggagaac tacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattccatgc tcatgggtaggaagaatcagtatcgtgaaaatggccatactgccaaagagcctgggcttg gagatgtttgcctgtggaaagaagtggaccataaagcagactgcttttaatctcttccca gagaaaagaagttcatttgcaaggaaacaggcaagcatgatctga >gi568815596f:138450217_138669077|GENSCAN_predicted_peptide_4|129_aa MGKKQSRKTGNSKNQSASPPPKERSSSLATEQSWMENDFDELREEGFRRSNYSKLKEEVQ THGKEVKNLEKKLDEWLTRITNAEKSLKDLMELKTKTRELRDECTSLSSQCDQLKERVSV MEDEMNEVK >gi568815596f:138450217_138669077|GENSCAN_predicted_CDS_4|390_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaatcagagcgcctctcctcct ccaaaggaacgcagctcctcactagcaacagaacaaagctggatggagaatgactttgac gagttgagagaagaaggcttcagacgatcaaactactccaagctaaaggaggaagttcaa acccatggcaaagaagttaaaaaccttgaaaaaaaattagacgaatggctaactagaata accaatgcagagaagtccttaaaggacctgatggagctgaaaaccaagacacgagaactc cgtgatgaatgcacaagcctcagtagccaatgcgaccaactgaaagaaagggtatcagtg atggaagatgaaatgaatgaagtgaagtga >gi568815596f:138450217_138669077|GENSCAN_predicted_peptide_5|327_aa MGEVLKSSTFSSGPSDKMKWCLRVNPKGLDDESKDYLSLYLLLVSCPKSEVRAKFKFSLL NAKREETKAMESQRAYRFVQGKDWGFKKFIRRDFLLDEANGLLPDDKLTLFCEVSVVQDS VNISGHTNTNTLKVPECRLAEDLGNLWENTRFTDCSFFVRGQEFKAHKSVLAARSPVFNA MFEHEMEESKKNRVEINDLDPEVFKEMMRFIYTGRAPNLDKMADNLLAAADKYALERLKV MCEEALCSNLSVENVADTLVLADLHSAEQLKAQAIDFINSQATDIMETSGWKSMIQSHPH LVAEAFRALASAQCPQFGIPRKRLKQS >gi568815596f:138450217_138669077|GENSCAN_predicted_CDS_5|984_bp atgggtgaagtgttaaaaagttcaacattttcatctggcccaagtgacaaaatgaaatgg tgcctgagggtaaacccaaagggattagatgatgaaagtaaagactacttgtccttatat ttgcttttagtcagctgccccaaaagtgaagttcgagcaaaattcaaattttcccttctg aatgctaaaagggaagaaacaaaagcaatggaaagccaaagagcatatcgatttgtgcaa gggaaggactggggttttaaaaaattcattagaagggactttttgcttgatgaagctaat ggtcttttaccagatgacaagcttacattattttgtgaggtgagtgtggtccaagattca gtaaacatatcaggacatactaatacaaatactttgaaggtgcctgagtgtcgtctagca gaagatttaggtaatctctgggaaaacacaagatttacagactgcagttttttcgtgaga ggacaagaatttaaagctcataaatctgtgcttgcagctcgatctccagtttttaacgcc atgtttgaacatgaaatggaagaaagcaaaaagaatcgagtggaaataaatgatttagac cctgaagtttttaaagaaatgatgagattcatttacacagggagagcaccaaaccttgac aaaatggctgacaacttgttggcagctgcagacaaatatgcactggaacggctgaaggtc atgtgcgaagaagctttgtgtagtaacctctcagtagagaatgttgcagatacccttgtc cttgcagatttgcacagtgcagaacagttgaaagcacaagccatagactttattaatagc caagcaaccgacataatggaaacatcagggtggaagtccatgattcagtctcaccctcat ttagtagcagaagcctttcgagcactagcatctgcacagtgtccacagtttggcattcca cgcaaacggctaaaacagtcctga >gi568815596f:138450217_138669077|GENSCAN_predicted_peptide_6|217_aa MTVYLENPIISAPNLLKLISKFSKVSGYIINVQKSQAFLYTNNRQIESQIMSELPFTIAS KRIKYPGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGRINIVKMAILPKVIY RFNAIPIKPPMTFFTELEKTTFKFIWNQKRACIAKTILSQKNKAGGIMLPDFKLYYKATV TKTAWYWYQNRDIDQWNRTEPSEIIPHIYNHLIFESP >gi568815596f:138450217_138669077|GENSCAN_predicted_CDS_6|654_bp atgactgtatatttagaaaaccccatcatctcagccccaaatctccttaagctgataagc aagttcagcaaagtctcaggatacataatcaatgtgcagaaatcacaagcattcttatac accaataacagacaaatagagagccaaatcatgagtgaactcccattcacaattgcttca aagagaataaaatacccaggaatccaacttacaagggatgtgaaggaccttttcaaggag aactacaaaccactgctcaacgaaataaaagaggacacaaacaaatggaagaacattcca tgctcatggataggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttat agattcaatgccatccccatcaagccaccaatgactttcttcacagaattggaaaaaact actttcaaattcatatggaaccaaaaaagagcctgcattgccaagacaatcctaagccaa aagaacaaagctggaggcatcatgctacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagag ccctcagaaataataccacacatctacaaccatctgatctttgaatcaccatag >gi568815596f:138450217_138669077|GENSCAN_predicted_peptide_7|413_aa MKTSANLLGVCSGAIFALLMKTKAVDPLNPPSVLSLIPLVTNVGKAVVFLKHPSCFLTFF STCTAAAAALSEFTQEQHDGAQPSPKCLAEELGDAWTIQIEASWKYRAVNTNQRGKLLAR VTQIHANEDSGESRHQRWLDGIVRFPICQTPVLALHLMITFGTKQTDDRCCFAVQGGIIS NAGLLPEENLPLTGNYNSTRDVAGDTQPNHINGCGISEVVVWVCDQWESMKECSNVDQMA ASGKYDHYYHLVNCDYEASHSTISALPALESSGLIINMQIPGAPRKFNPSHGEAVMESGN NRPRFSHPSAANPRDSFTCFSYLDFGKHYVQHLVSVMFQDELSVGLHSKFTEKASDWPSM GHLSQSAEGKRTQRSMGILLGESTPKNAHTLSSHTPLAVAQQQGHTILHEMQG >gi568815596f:138450217_138669077|GENSCAN_predicted_CDS_7|1242_bp atgaaaacaagtgcaaatctgttaggagtatgttctggggcaatttttgctctcctgatg aagacaaaggctgttgatccactgaacccacccagtgttctttccttgatacccttagtg actaatgtgggaaaggcagtggtgttcctcaaacatcccagttgcttcctcacgtttttc agcacctgcacagcggctgctgcagctctgtctgaattcacacaggagcaacatgatggt gctcagccctcgccgaagtgtcttgctgaagagttgggagatgcttggactattcagata gaagccagctggaagtacagggcagtcaacacaaaccagagaggcaaacttttggccaga gtaacccagattcatgctaatgaggacagtggggaaagccgtcatcaaaggtggctcgat gggattgttcgctttccaatctgtcagaccccagttctggctctgcatttgatgattaca tttggtaccaaacagactgacgaccgctgctgtttcgctgtacaaggtggcatcatcagc aatgcaggactgctcccggaggaaaatctgcctctgacggggaattataactcaacacga gatgtggctggggacacacagccaaaccatatcaatgggtgtggcatcagtgaggtagtg gtgtgggtttgtgatcagtgggaaagcatgaaagaatgctcaaatgtggaccaaatggct gcctctggcaaatatgatcattattatcatctagtgaattgtgactatgaagcttctcac agcacaatctcggcactaccagcattagaatcatcagggttaataattaatatgcagatt cctggggccccacggaaattcaatccttctcatggtgaagcagtcatggaatctgggaat aacagaccccgcttctcccatccctcagctgccaacccgagggacagtttcacttgcttc tcttacttagattttggaaaacattatgtccaacatctcgtaagtgtaatgttccaagat gaactgtctgttggtcttcattctaaattcacagagaaggcttctgactggcccagtatg gggcatctatcccagtcagccgaagggaaaagaacacagagaagtatgggaatactcttg ggtgagtctactcctaaaaatgcacacaccctttcctctcacactccactagctgtggct cagcaacaaggccacaccatactacatgagatgcagggctag