GENSCAN 1.0 Date run: 2-Nov-116 Time: 21:37:16 Sequence gi568815588r:5918052_6162151 : 244100 bp : 45.78% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 291 427 137 1 2 97 111 134 0.879 16.89 1.02 Intr + 3207 3306 100 2 1 129 61 67 0.997 7.78 1.03 Intr + 3397 3518 122 2 2 51 78 100 0.907 5.51 1.04 Intr + 5624 5789 166 1 1 43 3 133 0.726 -0.17 1.05 Intr + 6260 6457 198 0 0 80 110 212 0.997 21.92 1.06 Intr + 7316 7441 126 0 0 79 91 91 0.814 9.15 1.07 Intr + 9384 9490 107 1 2 69 86 66 0.964 4.43 1.08 Intr + 18405 18536 132 2 0 124 54 228 0.998 23.84 1.09 Term + 19059 19229 171 2 0 104 55 280 0.997 24.13 1.10 PlyA + 20044 20049 6 1.05 2.07 PlyA - 25196 25191 6 1.05 2.06 Term - 35155 35044 112 0 1 94 47 93 0.913 3.83 2.05 Intr - 38403 38328 76 1 1 91 113 39 0.778 5.27 2.04 Intr - 41735 41703 33 0 0 131 119 38 0.981 9.39 2.03 Intr - 42516 42316 201 1 0 79 71 98 0.724 6.46 2.02 Intr - 48288 48094 195 1 0 97 64 134 0.917 11.29 2.01 Init - 59441 59354 88 2 1 122 37 261 0.560 23.20 2.00 Prom - 65818 65779 40 0.54 3.00 Prom + 70793 70832 40 -6.26 3.01 Init + 74128 74213 86 0 2 82 78 89 0.427 7.59 3.02 Intr + 83327 83435 109 2 1 68 81 52 0.062 2.79 3.03 Term + 86046 86150 105 2 0 68 41 89 0.126 0.61 3.04 PlyA + 86307 86312 6 1.05 4.11 PlyA - 86979 86974 6 1.05 4.10 Term - 94845 94556 290 1 2 72 42 160 0.699 5.34 4.09 Intr - 101448 101377 72 1 0 81 109 54 0.531 6.18 4.08 Intr - 101890 101819 72 2 0 104 80 25 0.844 2.68 4.07 Intr - 103642 103427 216 2 0 82 91 62 0.597 4.38 4.06 Intr - 106303 106193 111 2 0 61 94 77 0.981 5.95 4.05 Intr - 107974 107783 192 2 0 106 119 136 0.973 17.96 4.04 Intr - 117950 117823 128 1 2 20 78 84 0.087 0.92 4.03 Intr - 135869 135738 132 1 0 86 61 115 0.734 8.36 4.02 Intr - 142403 142349 55 0 1 96 41 40 0.583 -1.86 4.01 Init - 144100 144037 64 1 1 76 110 85 0.911 9.08 4.00 Prom - 147013 146974 40 -4.56 5.06 PlyA - 148820 148815 6 1.05 5.05 Term - 153684 153460 225 0 0 33 38 306 0.932 16.88 5.04 Intr - 153874 153728 147 1 0 101 7 114 0.899 5.03 5.03 Intr - 159100 159018 83 2 2 102 13 27 0.254 -4.04 5.02 Intr - 159503 159409 95 1 2 112 34 78 0.423 4.41 5.01 Init - 159636 159596 41 0 2 95 94 52 0.420 4.24 5.00 Prom - 164353 164314 40 -1.76 6.00 Prom + 173563 173602 40 -6.06 6.01 Init + 179015 179137 123 1 0 66 98 88 0.957 7.77 6.02 Intr + 182317 182445 129 0 0 117 68 16 0.912 3.39 6.03 Intr + 183220 183336 117 0 0 55 121 112 0.997 11.86 6.04 Intr + 186880 187046 167 0 2 79 78 140 0.940 10.86 6.05 Intr + 188090 188187 98 2 2 73 72 100 0.995 6.55 6.06 Intr + 190635 190691 57 1 0 104 84 22 0.824 2.26 6.07 Intr + 191935 192076 142 2 1 60 47 134 0.994 5.91 6.08 Intr + 194159 194310 152 2 2 85 93 264 0.996 26.41 6.09 Intr + 195457 195530 74 2 2 43 90 39 0.934 -1.27 6.10 Intr + 195998 196096 99 1 0 65 63 102 0.977 5.71 6.11 Intr + 197188 197260 73 0 1 101 96 92 0.990 10.28 6.12 Intr + 201965 202109 145 0 1 87 101 54 0.241 5.94 6.13 Intr + 217860 217947 88 0 1 77 115 44 0.054 5.97 6.14 Intr + 221806 221881 76 0 1 70 45 133 0.097 6.29 6.15 Intr + 222535 222660 126 2 0 8 77 105 0.146 2.05 6.16 Term + 223456 223640 185 2 2 59 41 98 0.265 -0.09 6.17 PlyA + 224811 224816 6 -0.45 7.06 PlyA - 225796 225791 6 1.05 7.05 Term - 226658 226431 228 2 0 91 41 107 0.350 2.93 7.04 Intr - 227483 227314 170 0 2 82 59 52 0.273 1.37 7.03 Intr - 233295 233106 190 0 1 5 127 148 0.525 9.56 7.02 Intr - 233680 233458 223 0 1 43 54 84 0.212 -1.37 7.01 Init - 235425 235358 68 0 2 106 77 15 0.421 2.75 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 225058 224934 125 2 2 57 39 109 0.882 1.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588r:5918052_6162151|GENSCAN_predicted_peptide_1|419_aa XIMNIVLSQPCGKIFVGDPHQQIYTFRGAVNALFTVPHTHVFYLTQSFRFGVEIAYVGAT ILDVCKRVRKKTLVGGNHQSGIRGDAKGQVALLSRTNANVFDEAVRVTEGEFPSRIHLIG PEEERRKREYPPGLGALEGRTQVTGTRKKQAQSESGTRFPPEKGELVLLSSHDEGENLVI KDKFIRRWVHKEGFSGFKRYVTAAEDKELEAKIAVVEKYNIRIPELVQRIEKCHIEDLDF AEYILGTVHKAKGLEFDTVHVLDDFVKVPCARHNLPQLPHFRVESFSEDEWNLLYVAVTR AKKRLIMTKSLENILTLAGEYFLQAELTSNVLKTGVVRCCVGQCNNAIPVDTVLTMKKLP ITYSNRKENKGGYLCHSCAEQRIGPLAFLTASPEQVRAMERTVENIVLPRHEALLFLVF >gi568815588r:5918052_6162151|GENSCAN_predicted_CDS_1|1260_bp nctatcatgaacatagttctgtctcagccatgtgggaaaatctttgtaggggacccgcac cagcagatctataccttccggggtgcggtcaacgccctgttcacagtgccccacacccac gtcttctatctcacgcagagttttcggtttggtgtggaaatagcttatgtgggagctact atcttggatgtttgcaagagagtcaggaaaaagactttggttggaggaaaccatcagagt ggcattagaggtgacgcaaaggggcaagtggccttgttgtcccggaccaacgccaacgtg tttgatgaggccgtacgggtgacggaaggggaattcccttcaaggatacatttgattggg ccagaggaagaacggaggaaacgtgagtacccacctggccttggtgcattggaaggacgc acccaagtgacagggacgagaaagaagcaggcccagtctgagtcagggacccgtttccct ccagagaagggcgagctagtgttgctgtcttcccatgacgagggggaaaacctcgtcatt aaagacaaatttatcagaagatgggtgcacaaagaaggctttagtggcttcaagaggtat gtgaccgctgccgaggacaaggagcttgaagccaagatcgcagttgttgaaaagtataac atcaggattccagagctggtgcaaaggatagaaaaatgccatatagaagatttggacttt gcagagtacattctgggcactgtgcacaaagccaaaggcctggagtttgacactgtgcat gttttggatgattttgtgaaagtgccttgtgcccggcataacctgccccagcttccgcac ttcagagttgagtcattttctgaggatgaatggaatttactgtatgttgcagtaactcga gccaagaagcgtctcatcatgaccaaatcattggaaaacattttgactttggctggggag tacttcttgcaagcagagctgacaagcaacgtcttaaaaacaggcgtggtgcgctgctgc gtgggacagtgcaacaatgccatccctgttgacaccgtccttaccatgaagaagctgccc atcacctatagcaacaggaaggaaaacaaggggggctacctctgccactcctgtgcggag cagcgcatcgggcccctggcgttcctgacagcctccccggagcaggtgcgcgccatggag cgcactgtggagaacatcgtactgccccggcatgaggccctgctcttcctcgtcttctga >gi568815588r:5918052_6162151|GENSCAN_predicted_peptide_2|234_aa MAPRRARGCRTLGLPALLLLLLLRPPATRGITCPPPMSVEHADIWVKSYSLYSRERYICN SGFKRKAGTSSLTECVLNKATNVAHWTTPSLKCIKPAASSPSSNNTAATTAAIVPGSQLM PSKSPSTGTTEISSHESSHGTPSQTTAKNWELTASASHQPPGVYPQGHSDTTVAISTSTV LLCGLSAVSLLACYLKSRQTPPLASVEMEAMEALPVTWGTSSRDEDLENCSHHL >gi568815588r:5918052_6162151|GENSCAN_predicted_CDS_2|705_bp atggccccgcggcgggcgcgcggctgccggaccctcggtctcccggcgctgctactgctg ctgctgctccggccgccggcgacgcggggcatcacgtgccctccccccatgtccgtggaa cacgcagacatctgggtcaagagctacagcttgtactccagggagcggtacatttgtaac tctggtttcaagcgtaaagccggcacgtccagcctgacggagtgcgtgttgaacaaggcc acgaatgtcgcccactggacaacccccagtctcaaatgcattaagcccgcagcttcatct cccagctcaaacaacacagcggccacaacagcagctattgtcccgggctcccagctgatg ccttcaaaatcaccttccacaggaaccacagagataagcagtcatgagtcctcccacggc accccctctcagacaacagccaagaactgggaactcacagcatccgcctcccaccagccg ccaggtgtgtatccacagggccacagcgacaccactgtggctatctccacgtccactgtc ctgctgtgtgggctgagcgctgtgtctctcctggcatgctacctcaagtcaaggcaaact cccccgctggccagcgttgaaatggaagccatggaggctctgccggtgacttgggggacc agcagcagagatgaagacttggaaaactgctctcaccacctatga >gi568815588r:5918052_6162151|GENSCAN_predicted_peptide_3|99_aa MVSALKDSFNRLIHEMNVAEERIGEREERLLPRTRRHLSLQALKEIVCVCTGYYQEHSFI WTCRLASSEGMFINIVEEEDTCFALHSLRGETEITGKSF >gi568815588r:5918052_6162151|GENSCAN_predicted_CDS_3|300_bp atggtctcagctttaaaggattccttcaacaggcttattcatgaaatgaatgtagcagaa gaaagaattggtgaacgtgaagagaggttattaccaagaacacggcgtcaccttagtctg caggctctgaaggagatcgtgtgtgtatgtacgggttattaccaagagcacagtttcatc tggacctgcaggctggctagctctgaaggtatgttcatcaacatcgtagaagaagaggac acgtgtttcgcactgcactctttaagaggagaaacggaaataacagggaaaagtttttaa >gi568815588r:5918052_6162151|GENSCAN_predicted_peptide_4|443_aa MDSYLLMWGLLTFIMVPGCQAGFGGQVVFGYMNKFFSGDLLFPPTLYAVQEQAGAIGLAA IESFYPKEEERAMFQKNKKEAAARRQPDKAKAPKGCYPSAAYRTPRGNKSTRKDTPMIVY FSHNDGELCDDDPPEIPHATFKAMAYKEGTMLNCECKRGFRRIKSGSLYMLCTGNSSHSS WDNQCQCTSSATRNTTKQVTPQPEEQKERKTTEMQSPMQPVDQASLPGHCREPPPWENEA TERIYHFVVGQMVYYQCVQGYRALHRGPAESVCKMTHGKTRWTQPQLICTGEMETSQFPG EEKPQASPEGRPESETSCLVTTTDFQIQTEMAATMETSIFTTEYQVAGGRVEEQSRKPKE QEFLGKKPGTDNRSHEAQVKSKVLNGRPGDIRCACLRFGSSEVTSQDTGQWQPCLYASSV PSESERYPLLNSNFAVEEEGQNH >gi568815588r:5918052_6162151|GENSCAN_predicted_CDS_4|1332_bp atggattcatacctgctgatgtggggactgctcacgttcatcatggtgcctggctgccag gcaggttttggaggacaggtggtgtttggttacatgaataagttctttagtggtgatttg ctttttcctcccacgctgtatgcagtgcaagaacaagctggtgcaattggactagcagca attgagtccttttacccaaaagaagaagaaagagcaatgtttcaaaagaataaaaaggaa gctgcagccaggcgtcagccagacaaggcaaaagctcccaagggatgttacccttcagct gcatacagaactcctagaggcaacaaatccacaaggaaagatacacccatgatcgtgtat ttcagccacaatgatggagagctctgtgacgatgacccgccagagatcccacacgccaca ttcaaagccatggcctacaaggaaggaaccatgttgaactgtgaatgcaagagaggtttc cgcagaataaaaagcgggtcactctatatgctctgtacaggaaactctagccactcgtcc tgggacaaccaatgtcaatgcacaagctctgccactcggaacacaacgaaacaagtgaca cctcaacctgaagaacagaaagaaaggaaaaccacagaaatgcaaagtccaatgcagcca gtggaccaagcgagccttccaggtcactgcagggaacctccaccatgggaaaatgaagcc acagagagaatttatcatttcgtggtggggcagatggtttattatcagtgcgtccaggga tacagggctctacacagaggtcctgctgagagcgtctgcaaaatgacccacgggaagaca aggtggacccagccccagctcatatgcacaggtgaaatggagaccagtcagtttccaggt gaagagaagcctcaggcaagccccgaaggccgtcctgagagtgagacttcctgcctcgtc acaacaacagattttcaaatacagacagaaatggctgcaaccatggagacgtccatattt acaacagagtaccaggtagcaggaggaagagtagaagaacaatctagaaaaccaaaagaa caagaatttcttggtaagaagccgggaacagacaacagaagtcatgaagcccaagtgaaa tcaaaggtgctaaatggtcgcccaggagacatccgttgtgcttgcctgcgttttggaagc tctgaagtcacatcacaggacacggggcagtggcaaccttgtctctatgccagctcagtc ccatcagagagcgagcgctacccacttctaaatagcaatttcgccgttgaagaggaaggg caaaaccactag >gi568815588r:5918052_6162151|GENSCAN_predicted_peptide_5|196_aa MVAGPAALRDTGCRREVSSKLEGLFLGVHGEVYPSSAEMDEREHREVDPNQYPAIPNPSP CLPSECLDSTDDYHCGCTRTPHDDQDPHKEDKKFIQYQSGQYPKISITGRNAEASTTPLD NRVMGAAEYMLPSGFQKLLVHPVKKPEVLLRCNTSQCAETALNVAPKNCKAILETVAQLP IGVTNPSAKLCSKEKD >gi568815588r:5918052_6162151|GENSCAN_predicted_CDS_5|591_bp atggtggctgggcctgcggctctccgggacacagggtgcagaagagaggtcagcagcaag ctggagggcctgtttcttggtgtccatggagaagtctatccgagcagtgctgagatggat gaaagggaacacagagaggttgatcccaatcagtatcctgcaatcccaaatccatctcct tgtctgccttctgagtgtctagacagcactgatgactatcactgtggctgcactaggacc cctcatgatgaccaagatcctcataaagaggacaagaagttcatccagtaccagtcaggc caatatcccaaaattagcataactggcagaaatgcagaggcatcaacaactcctcttgac aacagggttatgggagcagcagagtacatgctgcccagcggcttccagaagctcctggtc caccccgtcaagaagccggaggtcctgctgaggtgcaacacatctcagtgtgctgagact gccctcaatgttgcccccaagaactgcaaagccatcctggaaacagtggcccagctgccc attggagtcaccaatcccagtgccaagctgtgcagcaaagaaaaggattag >gi568815588r:5918052_6162151|GENSCAN_predicted_peptide_6|616_aa MSLYDDLGVETSDSKTEGWSKNFKLLQSQLQVKKAALTQAKEGAHCLTWEVFLPKSFTLN SIRPLTSRFQELQGIEEQAQLNNQSQRTKQSTVLAPVIDLKRGGSSDDRQIVDTPPHVAA GLKDPVPSGFSAGEVLIPLADEYDPMFPNDYEKVVKRQREERQRQRELERQKEIEEREKR RKDRHEASGFARRPDPDSDEDEDYERERRKRSMGGAAIAPPTSLVEKDKELPRDFPYEED SRPRSQSSKAAIPPPVYEEQDRPRSPTGPSNSFLANMGGTVAHKIMQKYGFREGQGLGKH EQGLSTALSVEKTSKRGGKIIVGDATEKDASKKSDSNPLTEILKCPTKVVLLRNMVGAGE VDEDLEVETKEECEKYGKVGKCVIFEIPGAPDDEAVRIFLEFERVESAIKENPNIPPPPP CFLLPLPLSPHLQNEDCVDYKLDLCFHQASLIVQLRAVRPNVKYEYAHFTDGQTKVEKLH ALPKATQMLNNNDDNGNDHNDDDNGNDHNDDDNDSMGAVSLRTGENSRKEPFSSCNEGNR KLWPRHLLSPEPGHRPRLWRVRCSIPDGSANPTPSALFYRRMDLQRQPFLPGGFAFREGQ HRTTLSYLTLTSILVG >gi568815588r:5918052_6162151|GENSCAN_predicted_CDS_6|1851_bp atgtccctgtacgatgacctaggagtggagaccagtgactcaaaaacagaaggctggtcc aaaaacttcaaacttctgcagtctcagcttcaggtgaagaaggcagctctcactcaggca aaggaaggggcacactgcctcacctgggaagtgttcctgcctaaaagctttaccctgaat tcgatcaggcctttaacctccaggtttcaggaattacaaggcatagaggaacaagctcaa ctaaataaccagagccaaaggacgaaacaaagtacagtcctcgccccagtcattgacctg aagcgaggtggctcctcagatgaccggcaaattgtggacactccaccgcatgtagcagct gggctgaaggatcctgttcccagtgggttttctgcaggggaagttctgattcccttagct gacgaatatgaccctatgtttcctaatgattatgagaaagtagtgaagcgccaaagagag gaacgacagagacagcgggagctggaaagacaaaaggaaatagaagaaagggaaaaaagg cgtaaagacagacatgaagcaagtgggtttgcaaggagaccagatccagattctgatgaa gatgaagattatgagcgagagaggaggaaaagaagtatgggcggagctgccattgcccca cccacttctctggtagagaaagacaaagagttaccccgagattttccttatgaagaggac tcaagacctcgatcacagtcttccaaagcagccattcctcccccagtgtacgaggaacaa gacagaccgagatctccaaccggacctagcaactccttcctcgctaacatggggggcacg gtggcgcacaagatcatgcagaagtacggcttccgggagggccagggtctggggaagcat gagcagggcctgagcactgccttgtcagtggagaagaccagcaagcgtggcggcaagatc atcgtgggcgacgccacagagaaagatgcatccaagaagtcagattcaaatccgctgact gaaatacttaagtgtcctactaaagtggtcttactaaggaacatggttggtgcgggagag gtggatgaagacttggaagttgaaaccaaggaagaatgtgaaaaatatggcaaagttgga aaatgtgtgatatttgaaattcctggtgcccctgatgatgaagcagtacggatattttta gaatttgagagagttgaatcagcaattaaagagaatcccaacatcccgcctccacctcct tgcttcttactgcctcttccactcagcccacacttgcagaatgaggactgcgtcgactac aaattagacctctgcttccaccaggcctccttgatagttcagctaagagccgtgcgacct aatgtgaagtatgaatatgcccactttacagacggacaaactaaggtggaaaagttacat gccttacccaaagccacacaaatgctaaacaataatgatgataatggcaatgatcataat gatgatgataatggcaatgatcataatgatgatgataatgacagcatgggagccgtcagc ctacgaactggagagaactccaggaaggagcccttttcttcctgcaatgagggcaaccgg aagctgtggccacggcacctgctgtccccggagcctgggcaccgaccacggctgtggcga gttcgttgttcaatcccggatggctcagcaaatccaaccccctccgccctgttctatcgc cgcatggatctgcagaggcagccgttcctgccgggtggatttgccttccgcgaaggacaa cacagaaccactctgagttatttgacactgacatcgatcctagttggctga >gi568815588r:5918052_6162151|GENSCAN_predicted_peptide_7|292_aa MTSYQPLKASTKDQATQAYPIHMERGGYPPDFPFAPACLIGSFSTAVAGQASGTTSAASG EDFQNLIARGQYLKQTVTTADQLHPKKAQLSVTPQTESTFRSYSWKGCLRKCQDDQTPLL FRHLVAAPTRTLDAPGATPPSSDPHTRYHQDSYTLPVGSYGPRAPPPSGSLRVRSPGISA ARGWSAAGSGNSLQVPPPARARGGGRSRQRGRAGLEEGRPAAPQTRNLPALKLPPVLWTP RDRGKLVREETNSLFSFGLFRAILPEGRDLREFHLIFTFMSYGHRGHARITS >gi568815588r:5918052_6162151|GENSCAN_predicted_CDS_7|879_bp atgacctcctaccagccactcaaagcatctaccaaggaccaagccacacaggcttacccc attcacatggagcggggagggtaccctcccgacttcccgtttgcccctgcctgcctgatt ggatctttctccactgctgtggctggacaagcctctgggacgacttcagcagcctcagga gaggacttccagaatctgatcgccaggggtcagtatctcaagcagacagtcacgacagca gaccaactccaccctaagaaagcacagttgtcagtcacccctcaaacagagagcaccttc cggagctacagctggaaaggctgcttgcggaagtgccaggacgaccaaaccccgctgctc ttcagacaccttgtggctgccccaaccagaaccttggatgctcctggggccacgccccct agctcagacccgcacacaagataccaccaagactcatacacactgccagtggggagctac ggaccccgggccccgccgccctccgggtctctccgggtccggagccctggtatatctgcg gcccggggctggagcgcagcgggaagcgggaattcgctgcaagtgccgcccccggctcgg gcacgtggtggcggccggagtcgccagaggggacgcgcgggtctggaggaggggcggccc gcagccccccagacgcggaacttgccggccttgaagctgccgcctgtcttgtggactcca cgggacaggggaaaactggtccgggaagaaaccaactcccttttctctttcgggctgttc cgagccatccttcccgaaggacgagacctgagggaattccatcttatttttacttttatg tcctacgggcacaggggacacgctcgaatcaccagctga