GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:47:07 Sequence gi568815588f:5997066_6215553 : 218488 bp : 46.99% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.13 Intr - 2876 2728 149 0 2 -47 81 173 0.034 3.05 1.12 Intr - 3641 3549 93 0 0 55 81 69 0.183 2.84 1.11 Intr - 11433 11283 151 0 1 84 52 106 0.143 6.34 1.10 Intr - 21054 20988 67 2 1 91 98 12 0.082 1.41 1.09 Intr - 22434 22299 136 1 1 81 53 91 0.088 4.53 1.08 Intr - 22876 22805 72 2 0 104 80 25 0.844 2.68 1.07 Intr - 24628 24413 216 2 0 82 91 62 0.597 4.38 1.06 Intr - 27289 27179 111 2 0 61 94 77 0.981 5.95 1.05 Intr - 28960 28769 192 2 0 106 119 136 0.973 17.96 1.04 Intr - 38936 38809 128 1 2 20 78 84 0.087 0.92 1.03 Intr - 56855 56724 132 1 0 86 61 115 0.734 8.36 1.02 Intr - 63389 63335 55 0 1 96 41 40 0.583 -1.86 1.01 Init - 65086 65023 64 1 1 76 110 85 0.911 9.08 1.00 Prom - 67999 67960 40 -4.56 2.06 PlyA - 69806 69801 6 1.05 2.05 Term - 74670 74446 225 0 0 33 38 306 0.932 16.88 2.04 Intr - 74860 74714 147 1 0 101 7 114 0.899 5.03 2.03 Intr - 80086 80004 83 2 2 102 13 27 0.254 -4.04 2.02 Intr - 80489 80395 95 1 2 112 34 78 0.423 4.41 2.01 Init - 80622 80582 41 0 2 95 94 52 0.420 4.24 2.00 Prom - 85339 85300 40 -1.76 3.00 Prom + 94549 94588 40 -6.06 3.01 Init + 100001 100123 123 1 0 66 98 88 0.957 7.77 3.02 Intr + 103303 103431 129 0 0 117 68 16 0.912 3.39 3.03 Intr + 104206 104322 117 0 0 55 121 112 0.997 11.86 3.04 Intr + 107866 108032 167 0 2 79 78 140 0.940 10.86 3.05 Intr + 109076 109173 98 2 2 73 72 100 0.995 6.55 3.06 Intr + 111621 111677 57 1 0 104 84 22 0.824 2.26 3.07 Intr + 112921 113062 142 2 1 60 47 134 0.994 5.91 3.08 Intr + 115145 115296 152 2 2 85 93 264 0.996 26.41 3.09 Intr + 116443 116516 74 2 2 43 90 39 0.934 -1.27 3.10 Intr + 116984 117082 99 1 0 65 63 102 0.977 5.71 3.11 Intr + 118174 118246 73 0 1 101 96 92 0.990 10.28 3.12 Intr + 122951 123095 145 0 1 87 101 54 0.241 5.94 3.13 Intr + 138846 138933 88 0 1 77 115 44 0.054 5.97 3.14 Intr + 142792 142867 76 0 1 70 45 133 0.097 6.29 3.15 Intr + 143521 143646 126 2 0 8 77 105 0.146 2.05 3.16 Term + 144442 144626 185 2 2 59 41 98 0.265 -0.09 3.17 PlyA + 145797 145802 6 -0.45 4.14 PlyA - 146782 146777 6 1.05 4.13 Term - 147644 147417 228 2 0 91 41 107 0.315 2.93 4.12 Intr - 148469 148300 170 0 2 82 59 52 0.245 1.37 4.11 Intr - 154281 154092 190 0 1 5 127 148 0.469 9.56 4.10 Intr - 154666 154444 223 0 1 43 54 84 0.195 -1.37 4.09 Intr - 166977 166860 118 1 1 48 30 166 0.564 6.32 4.08 Intr - 167271 167116 156 1 0 109 23 104 0.845 5.98 4.07 Intr - 176360 176140 221 1 2 89 72 66 0.347 3.05 4.06 Intr - 177543 177432 112 1 1 59 76 65 0.419 1.84 4.05 Intr - 179946 179827 120 1 0 24 110 73 0.458 3.57 4.04 Intr - 181363 181211 153 2 0 74 92 44 0.608 3.44 4.03 Intr - 206281 205993 289 1 1 62 36 255 0.101 14.52 4.02 Intr - 215041 214837 205 0 1 52 59 159 0.078 8.60 4.01 Init - 215547 215312 236 0 2 56 52 131 0.160 3.91 4.00 Prom - 215773 215734 40 -5.36 5.00 Prom + 216289 216328 40 -9.55 5.01 Init + 216403 216436 34 0 1 87 33 26 0.939 -4.28 5.02 Intr + 216558 216683 126 1 0 101 100 244 0.990 27.55 5.03 Intr + 216987 217106 120 1 0 58 105 56 0.846 4.77 5.04 Intr + 218156 218252 97 0 1 102 72 218 0.727 20.67 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 146044 145920 125 2 2 57 39 109 0.882 1.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:5997066_6215553|GENSCAN_predicted_peptide_1|522_aa MDSYLLMWGLLTFIMVPGCQAGFGGQVVFGYMNKFFSGDLLFPPTLYAVQEQAGAIGLAA IESFYPKEEERAMFQKNKKEAAARRQPDKAKAPKGCYPSAAYRTPRGNKSTRKDTPMIVY FSHNDGELCDDDPPEIPHATFKAMAYKEGTMLNCECKRGFRRIKSGSLYMLCTGNSSHSS WDNQCQCTSSATRNTTKQVTPQPEEQKERKTTEMQSPMQPVDQASLPGHCREPPPWENEA TERIYHFVVGQMVYYQCVQGYRALHRGPAESVCKMTHGKTRWTQPQLICTGEMETSQFPG EEKPQASPEGRPESETSCLVTTTDFQIQTEMAATMETSIFTTEYQVAGEWGTGFVDKMYT RLRYGQVDCGRLCFPADQRPPPEWAHLAAETELNEVFTVKMGEKLPCAFDGGKRKGTVTR IRVKRVHQQALCEQQGCLFHLESFVNPASPASATSFAFHKVLTQTHGRISETVRLRFGSS SQEYKSCPREGGKAGESFRQPSEDPSGKRGLNEIQMSPHFQL >gi568815588f:5997066_6215553|GENSCAN_predicted_CDS_1|1566_bp atggattcatacctgctgatgtggggactgctcacgttcatcatggtgcctggctgccag gcaggttttggaggacaggtggtgtttggttacatgaataagttctttagtggtgatttg ctttttcctcccacgctgtatgcagtgcaagaacaagctggtgcaattggactagcagca attgagtccttttacccaaaagaagaagaaagagcaatgtttcaaaagaataaaaaggaa gctgcagccaggcgtcagccagacaaggcaaaagctcccaagggatgttacccttcagct gcatacagaactcctagaggcaacaaatccacaaggaaagatacacccatgatcgtgtat ttcagccacaatgatggagagctctgtgacgatgacccgccagagatcccacacgccaca ttcaaagccatggcctacaaggaaggaaccatgttgaactgtgaatgcaagagaggtttc cgcagaataaaaagcgggtcactctatatgctctgtacaggaaactctagccactcgtcc tgggacaaccaatgtcaatgcacaagctctgccactcggaacacaacgaaacaagtgaca cctcaacctgaagaacagaaagaaaggaaaaccacagaaatgcaaagtccaatgcagcca gtggaccaagcgagccttccaggtcactgcagggaacctccaccatgggaaaatgaagcc acagagagaatttatcatttcgtggtggggcagatggtttattatcagtgcgtccaggga tacagggctctacacagaggtcctgctgagagcgtctgcaaaatgacccacgggaagaca aggtggacccagccccagctcatatgcacaggtgaaatggagaccagtcagtttccaggt gaagagaagcctcaggcaagccccgaaggccgtcctgagagtgagacttcctgcctcgtc acaacaacagattttcaaatacagacagaaatggctgcaaccatggagacgtccatattt acaacagagtaccaggtagcaggtgagtggggcactggctttgtggacaaaatgtacacc aggctgagatatggacaggttgactgtggccggctgtgttttcctgctgatcagcgtcct cctcctgagtgggctcacctggcagcggagacagaactcaatgaggtcttcacagtgaag atgggagaaaaacttccttgtgcttttgacgggggaaagagaaaaggtactgtcacacgc atccgtgtgaagagagtccaccaacaggctttgtgtgagcaacaaggctgtttatttcac ctagaatcctttgtgaatcctgcatccccggcttctgccaccagctttgctttccacaag gttttgacacagacccatggaagaatctctgagacagtccggctgcgttttggaagcagc agccaggagtacaagtcctgcccacgggaaggaggaaaagctggagaaagcttcagacag ccctctgaggatccctcgggcaaacgtggcctgaacgagatccaaatgtccccgcacttc caactg >gi568815588f:5997066_6215553|GENSCAN_predicted_peptide_2|196_aa MVAGPAALRDTGCRREVSSKLEGLFLGVHGEVYPSSAEMDEREHREVDPNQYPAIPNPSP CLPSECLDSTDDYHCGCTRTPHDDQDPHKEDKKFIQYQSGQYPKISITGRNAEASTTPLD NRVMGAAEYMLPSGFQKLLVHPVKKPEVLLRCNTSQCAETALNVAPKNCKAILETVAQLP IGVTNPSAKLCSKEKD >gi568815588f:5997066_6215553|GENSCAN_predicted_CDS_2|591_bp atggtggctgggcctgcggctctccgggacacagggtgcagaagagaggtcagcagcaag ctggagggcctgtttcttggtgtccatggagaagtctatccgagcagtgctgagatggat gaaagggaacacagagaggttgatcccaatcagtatcctgcaatcccaaatccatctcct tgtctgccttctgagtgtctagacagcactgatgactatcactgtggctgcactaggacc cctcatgatgaccaagatcctcataaagaggacaagaagttcatccagtaccagtcaggc caatatcccaaaattagcataactggcagaaatgcagaggcatcaacaactcctcttgac aacagggttatgggagcagcagagtacatgctgcccagcggcttccagaagctcctggtc caccccgtcaagaagccggaggtcctgctgaggtgcaacacatctcagtgtgctgagact gccctcaatgttgcccccaagaactgcaaagccatcctggaaacagtggcccagctgccc attggagtcaccaatcccagtgccaagctgtgcagcaaagaaaaggattag >gi568815588f:5997066_6215553|GENSCAN_predicted_peptide_3|616_aa MSLYDDLGVETSDSKTEGWSKNFKLLQSQLQVKKAALTQAKEGAHCLTWEVFLPKSFTLN SIRPLTSRFQELQGIEEQAQLNNQSQRTKQSTVLAPVIDLKRGGSSDDRQIVDTPPHVAA GLKDPVPSGFSAGEVLIPLADEYDPMFPNDYEKVVKRQREERQRQRELERQKEIEEREKR RKDRHEASGFARRPDPDSDEDEDYERERRKRSMGGAAIAPPTSLVEKDKELPRDFPYEED SRPRSQSSKAAIPPPVYEEQDRPRSPTGPSNSFLANMGGTVAHKIMQKYGFREGQGLGKH EQGLSTALSVEKTSKRGGKIIVGDATEKDASKKSDSNPLTEILKCPTKVVLLRNMVGAGE VDEDLEVETKEECEKYGKVGKCVIFEIPGAPDDEAVRIFLEFERVESAIKENPNIPPPPP CFLLPLPLSPHLQNEDCVDYKLDLCFHQASLIVQLRAVRPNVKYEYAHFTDGQTKVEKLH ALPKATQMLNNNDDNGNDHNDDDNGNDHNDDDNDSMGAVSLRTGENSRKEPFSSCNEGNR KLWPRHLLSPEPGHRPRLWRVRCSIPDGSANPTPSALFYRRMDLQRQPFLPGGFAFREGQ HRTTLSYLTLTSILVG >gi568815588f:5997066_6215553|GENSCAN_predicted_CDS_3|1851_bp atgtccctgtacgatgacctaggagtggagaccagtgactcaaaaacagaaggctggtcc aaaaacttcaaacttctgcagtctcagcttcaggtgaagaaggcagctctcactcaggca aaggaaggggcacactgcctcacctgggaagtgttcctgcctaaaagctttaccctgaat tcgatcaggcctttaacctccaggtttcaggaattacaaggcatagaggaacaagctcaa ctaaataaccagagccaaaggacgaaacaaagtacagtcctcgccccagtcattgacctg aagcgaggtggctcctcagatgaccggcaaattgtggacactccaccgcatgtagcagct gggctgaaggatcctgttcccagtgggttttctgcaggggaagttctgattcccttagct gacgaatatgaccctatgtttcctaatgattatgagaaagtagtgaagcgccaaagagag gaacgacagagacagcgggagctggaaagacaaaaggaaatagaagaaagggaaaaaagg cgtaaagacagacatgaagcaagtgggtttgcaaggagaccagatccagattctgatgaa gatgaagattatgagcgagagaggaggaaaagaagtatgggcggagctgccattgcccca cccacttctctggtagagaaagacaaagagttaccccgagattttccttatgaagaggac tcaagacctcgatcacagtcttccaaagcagccattcctcccccagtgtacgaggaacaa gacagaccgagatctccaaccggacctagcaactccttcctcgctaacatggggggcacg gtggcgcacaagatcatgcagaagtacggcttccgggagggccagggtctggggaagcat gagcagggcctgagcactgccttgtcagtggagaagaccagcaagcgtggcggcaagatc atcgtgggcgacgccacagagaaagatgcatccaagaagtcagattcaaatccgctgact gaaatacttaagtgtcctactaaagtggtcttactaaggaacatggttggtgcgggagag gtggatgaagacttggaagttgaaaccaaggaagaatgtgaaaaatatggcaaagttgga aaatgtgtgatatttgaaattcctggtgcccctgatgatgaagcagtacggatattttta gaatttgagagagttgaatcagcaattaaagagaatcccaacatcccgcctccacctcct tgcttcttactgcctcttccactcagcccacacttgcagaatgaggactgcgtcgactac aaattagacctctgcttccaccaggcctccttgatagttcagctaagagccgtgcgacct aatgtgaagtatgaatatgcccactttacagacggacaaactaaggtggaaaagttacat gccttacccaaagccacacaaatgctaaacaataatgatgataatggcaatgatcataat gatgatgataatggcaatgatcataatgatgatgataatgacagcatgggagccgtcagc ctacgaactggagagaactccaggaaggagcccttttcttcctgcaatgagggcaaccgg aagctgtggccacggcacctgctgtccccggagcctgggcaccgaccacggctgtggcga gttcgttgttcaatcccggatggctcagcaaatccaaccccctccgccctgttctatcgc cgcatggatctgcagaggcagccgttcctgccgggtggatttgccttccgcgaaggacaa cacagaaccactctgagttatttgacactgacatcgatcctagttggctga >gi568815588f:5997066_6215553|GENSCAN_predicted_peptide_4|806_aa MLKLPQWTGKSSTRASHALSDPLFWCQARMNAGALPREGPVTKPQPGAGSTTVSRKPGDA LACQLPPGFRCYMTVLASQKTSESRTQNQADVRLDRGPDVRAEQAYAAGATTPQTGRSSG KKPLCKPEPPAGLQTEVNVQDLPLGLELHSHLGNEGLWSTGTQIFCTRLCVSSNGIFAAA PDGPPPGAEIPTLAGERPPTPGTKEERGACAGARPGVSCAARGNVLWAKFRTLACAGLDV CCEAHISRMSYASFAVLLLLVVSPQEDRAFLFLLLMDPGTSLAHSRGAPHSGQVGRSGKS GTATHILEPSDSRLLSCPELTRESHQVAFYVQIAVQGQQAQCHRNRNTPDKRAPAPGFLG GGRRQLQASKPRKRCCNHMHRTHKSCTQVDNLFLSAPSRLKAHHCPAPAAEDYRLRLDLL WAAGKHKHDHQQRLWWDSFCCGNAQGPVHEDSDGATDPTHTRSSGSLCPTLGARSGPRVL TGGASTADLPQPRIHARATPNAGTRKNTCRAYFLSGIIYSRSQAFSENILCRKHNPRERG GYPPDFPFAPACLIGSFSTAVAGQASGTTSAASGEDFQNLIARGQYLKQTVTTADQLHPK KAQLSVTPQTESTFRSYSWKGCLRKCQDDQTPLLFRHLVAAPTRTLDAPGATPPSSDPHT RYHQDSYTLPVGSYGPRAPPPSGSLRVRSPGISAARGWSAAGSGNSLQVPPPARARGGGR SRQRGRAGLEEGRPAAPQTRNLPALKLPPVLWTPRDRGKLVREETNSLFSFGLFRAILPE GRDLREFHLIFTFMSYGHRGHARITS >gi568815588f:5997066_6215553|GENSCAN_predicted_CDS_4|2421_bp atgctaaaattaccccagtggacaggtaaatcctccacccgagcctctcacgccctgtct gaccccctcttctggtgccaggctcggatgaatgctggggccctgccgagggaagggccg gtcacgaagcctcagccaggggccggctccaccactgtctccaggaaacctggagatgct cttgcctgccagctccctcctggcttccgctgctacatgacggtcttggcatctcagaaa acctctgaaagccgcacacagaaccaggcagatgtcagactggacagaggccctgatgtc cgagctgagcaggcttatgctgcgggagccaccacccctcagacgggccgcagctcagga aagaagcccctctgtaagccggagcccccagcagggctccaaactgaagtcaatgtccag gatctgccgctaggcctggagctgcactcacatctgggcaacgagggcctgtggtccacg ggcacccagatcttctgcactcggctctgcgtcagttccaacggcatcttcgcggctgcg cccgacggcccgcctcccggggccgagatcccgacgctggcaggagagcggccgccgacc cccggaacaaaggaggagaggggggcgtgcgccggcgcgcggccgggcgtttcctgcgct gccaggggaaacgtgctttgggctaagttccggaccctcgcctgtgcggggctcgacgtg tgctgcgaggcccatatcagccgcatgtcctacgcctccttcgctgttctcttgttgctg gttgtgtctccccaagaggacagggcatttttgtttctgttgctgatggaccctggaacc agcctggcccacagcaggggcgccccacacagtggtcaggtgggcaggagcgggaagtca ggcactgccacccacatcctggagcccagtgacagccgtctcctgtcctgtccggaactc actcgtgagtcccaccaggtagcattttatgttcagattgcagtccaggggcagcaggcc cagtgccacagaaacagaaacacgcctgacaagcgagccccagctcctggctttctcggc gggggccgcaggcagctccaggcctccaagcccaggaagcggtgctgtaaccatatgcac aggacacataaaagctgcacacaagtggacaacctcttcctgagcgccccctccaggctg aaagcacaccactgcccggccccagcagctgaggactacaggttgcgcctcgacctcttg tgggctgctggcaaacacaaacatgaccaccagcaacggctgtggtgggacagtttctgc tgtggaaatgcccagggccctgttcatgaagattcggatggagccacagacccaacgcat acgaggtcctcagggagtctgtgtcccacactgggggctaggagcggtccccgggtgctc actggcggcgcctccaccgcggacctgccccagccccgcatccacgcgcgggccacaccc aacgccgggacgcgcaaaaacacctgccgggcctatttcctctcgggtattatttacagc cgctcccaggccttctcggaaaacatcctttgccggaaacacaaccccagggagcgggga gggtaccctcccgacttcccgtttgcccctgcctgcctgattggatctttctccactgct gtggctggacaagcctctgggacgacttcagcagcctcaggagaggacttccagaatctg atcgccaggggtcagtatctcaagcagacagtcacgacagcagaccaactccaccctaag aaagcacagttgtcagtcacccctcaaacagagagcaccttccggagctacagctggaaa ggctgcttgcggaagtgccaggacgaccaaaccccgctgctcttcagacaccttgtggct gccccaaccagaaccttggatgctcctggggccacgccccctagctcagacccgcacaca agataccaccaagactcatacacactgccagtggggagctacggaccccgggccccgccg ccctccgggtctctccgggtccggagccctggtatatctgcggcccggggctggagcgca gcgggaagcgggaattcgctgcaagtgccgcccccggctcgggcacgtggtggcggccgg agtcgccagaggggacgcgcgggtctggaggaggggcggcccgcagccccccagacgcgg aacttgccggccttgaagctgccgcctgtcttgtggactccacgggacaggggaaaactg gtccgggaagaaaccaactcccttttctctttcgggctgttccgagccatccttcccgaa ggacgagacctgagggaattccatcttatttttacttttatgtcctacgggcacagggga cacgctcgaatcaccagctga >gi568815588f:5997066_6215553|GENSCAN_predicted_peptide_5|126_aa MVATVLPPGQQTCGPKLTNSPTVIVMVGLPARGKTYISKKLTRYLNWIGVPTKVLANIPL LIALSTAYRRSRVFNPQAVDCYVGWPVRNLAAGVFNVGEYRREAVKQYSSYNFFRPDNEE AMKVRN >gi568815588f:5997066_6215553|GENSCAN_predicted_CDS_5|378_bp atggtggccaccgtgctcccgcctgggcaacagacctgtgggccaaagctgaccaactcc cccaccgtcatcgtcatggtgggcctccccgcccggggcaagacctacatctccaagaag ctgactcgctacctcaactggattggcgtccccacaaaagtcctagctaacattcctctg ctgatagccctaagcactgcctaccgtagatcaagggtcttcaacccccaggctgtggac tgctatgttgggtggcctgttaggaacctggctgcaggagtgttcaacgtcggggagtat cgccgggaggctgtgaagcagtacagctcctacaacttcttccgccccgacaatgaggaa gccatgaaagtccggaan