GENSCAN 1.0 Date run: 3-Nov-116 Time: 19:42:39 Sequence gi568815581f:70075040_70276320 : 201281 bp : 38.04% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 13338 13522 185 0 2 54 47 103 0.030 1.31 1.02 Intr + 16792 16877 86 2 2 94 72 38 0.072 1.42 1.03 Term + 29806 29991 186 0 0 2 48 193 0.138 3.21 1.04 PlyA + 30495 30500 6 1.05 2.03 PlyA - 31870 31865 6 1.05 2.02 Term - 32368 32285 84 1 0 89 39 97 0.362 1.57 2.01 Init - 38541 38536 6 0 0 86 91 8 0.146 1.40 2.00 Prom - 40887 40848 40 -2.15 3.00 Prom + 43052 43091 40 -3.25 3.01 Init + 53828 53950 123 1 0 76 87 148 0.690 13.72 3.02 Term + 56956 58305 1350 0 0 95 37 932 0.989 79.27 3.03 PlyA + 58631 58636 6 1.05 4.00 Prom + 62493 62532 40 -5.35 4.01 Init + 76193 76566 374 1 2 44 71 282 0.079 18.58 4.02 Intr + 82292 82519 228 2 0 53 48 120 0.001 0.56 4.03 Intr + 88259 88323 65 2 2 79 72 3 0.002 -4.76 4.04 Intr + 93724 94662 939 2 0 60 110 286 0.068 18.21 4.05 Term + 99975 101284 1310 1 2 10 42 1518 0.014 130.15 4.06 PlyA + 101992 101997 6 1.05 5.04 PlyA - 102220 102215 6 1.05 5.03 Term - 107251 107183 69 1 0 85 43 58 0.134 -2.04 5.02 Intr - 123134 123002 133 1 1 113 46 102 0.622 8.23 5.01 Init - 133572 133562 11 0 2 82 116 9 0.804 3.03 5.00 Prom - 137562 137523 40 -4.15 6.00 Prom + 148834 148873 40 -5.85 6.01 Init + 166429 166680 252 0 0 75 97 305 0.678 27.49 6.02 Intr + 169729 169789 61 0 1 100 66 15 0.163 -2.11 6.03 Intr + 179438 179523 86 0 2 62 32 119 0.007 2.32 6.04 Term + 190220 190357 138 1 0 57 46 108 0.221 0.48 6.05 PlyA + 190841 190846 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 100001 101284 1284 1 0 82 42 1493 0.970 139.79 S.002 Init + 176375 176518 144 1 0 62 32 167 0.853 8.67 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:70075040_70276320|GENSCAN_predicted_peptide_1|152_aa XRRSSDGNAGCLLLLFTACCAARFLTGHGLVPVPGLGVEDPCLRDIGAMKLKQPTHNRHF RNGQYLHLIGECGVHQSMQVNVNIPCPVTAWSEAENSVAFVCGECIRKGKEERDSPVGFL ISMAVGADSFRRVLTGPREKGPEMLSYGALTR >gi568815581f:70075040_70276320|GENSCAN_predicted_CDS_1|459_bp nngcggaggagctcagatggtaatgctggatgtctgctgttgctgttcaccgcctgctgt gcggcccggttcctaacaggccatggactggtcccggtccctggcctgggggttgaggac ccctgtcttagggatattggggcaatgaaattgaaacaaccaactcataacaggcacttt agaaatgggcaatatttacatttaatcggggagtgtggagttcatcagtctatgcaagta aatgttaacattccatgtcctgtgacagcttggagcgaagctgagaactcagtggccttt gtctgtggtgaatgtatccggaaaggcaaagaagagagagattctcctgtggggtttctc atctccatggcggtaggtgctgacagcttcagaagggtactaacagggccaagagaaaag ggccctgagatgctcagctacggtgctcttacaagatga >gi568815581f:70075040_70276320|GENSCAN_predicted_peptide_2|29_aa MMGAPESTVMLGVLHLEGDPSSGFDDMAY >gi568815581f:70075040_70276320|GENSCAN_predicted_CDS_2|90_bp atgatgggtgcccctgaatctactgtgatgttgggtgtgttgcatcttgaaggtgaccca agctcaggttttgatgatatggcttattag >gi568815581f:70075040_70276320|GENSCAN_predicted_peptide_3|490_aa MASAPNEYAAGVYKVSCKQEVSQSDVTATQYADGLSFRLQRVLTENPNQEIATSLEFLLL QNSPGSLRAQQRMSYYGSSYHIINADAKYPGYPPEHIIAEKRRARRRLLHKDGSCNVYFK HIFGEWGSYVVDIFTTLVDTKWRHMFVIFSLSYILSWLIFGSVFWLIAFHHGDLLNDPDI TPCVDNVHSFTGAFLFSLETQTTIGYGYRCVTEECSVAVLMVILQSILSCIINTFIIGAA LAKMATARKRAQTIRFSYFALIGMRDGKLCLMWRIGDFRPNHVVEGTVRAQLLRYTEDSE GRMTMAFKDLKLVNDQIILVTPVTIVHEIDHESPLYALDRKAVAKDNFEILVTFIYTGDS TGTSHQSRSSYVPREILWGHRFNDVLEVKRKYYKVNCLQFEGSVEVYAPFCSAKQLDWKD QQLHIEKAPPVRESCTSDTKARRRSFSAVAIVSSCENPEETTTSATHEYRETPYQKALLT LNRISVESQM >gi568815581f:70075040_70276320|GENSCAN_predicted_CDS_3|1473_bp atggcgtcagcaccaaatgagtatgcggcaggggtttataaagtctcctgtaaacaggaa gtgtctcagtctgatgtaactgctacgcagtacgcggacggcctctctttccgtcttcag cgggttctaactgaaaacccaaaccaagaaatagcaacaagtctagaattcttactacta caaaactcacctggatccctaagggcacagcaaagaatgagctattacggcagcagctat catattatcaatgcggacgcaaaatacccaggctacccgccagagcacattatagctgag aagagaagagcaagaagacgattacttcacaaagatggcagctgtaatgtctacttcaag cacatttttggagaatggggaagctatgtggttgacatcttcaccactcttgtggacacc aagtggcgccatatgtttgtgatattttctttatcttatattctctcgtggttgatattt ggctctgtcttttggctcatagcctttcatcatggcgatctattaaatgatccagacatc acaccttgtgttgacaacgtccattctttcacaggggcctttttgttctccctagagacc caaaccaccataggatatggttatcgctgtgttactgaagaatgttctgtggccgtgctc atggtgatcctccagtccatcttaagttgcatcataaatacctttatcattggagctgcc ttggccaaaatggcaactgctcgaaagagagcccaaaccattcgtttcagctactttgca cttataggtatgagagatgggaagctttgcctcatgtggcgcattggtgattttcggcca aaccacgtggtagaaggaacagttagagcccaacttctccgctatacagaagacagtgaa gggaggatgacgatggcatttaaagacctcaaattagtcaacgaccaaatcatcctggtc accccggtaactattgtccatgaaattgaccatgagagccctctgtatgcccttgaccgc aaagcagtagccaaagataactttgagattttggtgacatttatctatactggtgattcc actggaacatctcaccaatctagaagctcctatgttccccgagaaattctctggggccat aggtttaatgatgtcttggaagttaagaggaagtattacaaagtgaactgcttacagttt gaaggaagtgtggaagtatatgcccccttttgcagtgccaagcaattggactggaaagac cagcagctccacatagaaaaagcaccaccagttcgagaatcctgcacgtcggacaccaag gcgagacgaaggtcatttagtgcagttgccattgtcagcagctgtgaaaaccctgaggag accaccacttccgccacacatgaatatagggaaacaccttatcagaaagctctcctgact ttaaacagaatctctgtagaatcccaaatgtag >gi568815581f:70075040_70276320|GENSCAN_predicted_peptide_4|971_aa MLNVIGQGSPTPSTGRWPVACYELGHKAECEWQVSEYYHLSSTSCQIGAAFDSRRSVNPI VNCAGKGSRLRVPYENLGIGVMCLNHPETITHNCPSRVRGKMVFHETSPWCQNVGDHCYM TKNTRRIHSHPDTSMAKGKRATSPLCGSRCPCSSVDEKILAEEKERQRKGDCWVVKRYEQ EGEVATHFSARFLLAPENPILPQSVIFPFLCPCDLIVQFPPMDQQSVPGGGGETEGGQGE RERAVHPSEKPRRLVSNSRTVGRTNSQAALQWPHLRASQSWGLCAPAPVPRPIDSVASPG AAASAGGFEGPRGSGREGVERETRRDRRTILPLRQRSLPGAGTSQAPRGARRASGPRAGE PGSVWGSRRSRKLCPRPGCNQAAATAGRGASHLLLSQLGRNFLLPGPGPTPRPGSEGGGG RGGGRFPPLATPRSILPSPPRFPGPAARPGALRLAGSGRGRGQESRCRTDRLKRAQDIAE RTGALASAQPSRRRRAGSWEFWFALAHSLFTNHWILHASVPPTSTPCPHAPAPATEALES PAEAMGSVRTNRYSIVSSEEDGMKLATMAVANGFGNGKSKVHTRQQCRSRFVKKDGHCNV QFINVGEKGQRYLADIFTTCVDIRWRWMLVIFCLAFVLSWLFFGCVFWLIALLHGDLDAS KEGKACVSEVNSFTAAFLFSIETQTTIGYGFRCVTDECPIAVFMVVFQSIVGCIIDAFII GAVMAKMAKPKKRNETLVFSHNAVIAMRDGKLCLMWRVGNLRKSHLVEAHVRAQLLKSRI TSEGEYIPLDQIDINVGFDSGIDRIFLVSPITIVHEIDEDSPLYDLSKQDIDNADFEIVV ILEGMVEATAMTTQCRSSYLANEILWGHRYEPVLFEEKHYYKVDYSRFHKTYEVPNTPLC SARDLAEKKYILSNANSFCYENEVALTSKEEDDSENGVPESTSTDTPPDIDLHNQASVPL EPRPLRRESEI >gi568815581f:70075040_70276320|GENSCAN_predicted_CDS_4|2916_bp atgttgaatgttataggacagggatccccaacccccagtacaggtcgatggcctgtggcc tgttacgaactgggccacaaagcagaatgtgagtggcaggtgagcgaatattaccacctg agctccacttcctgtcagatcggggcagcatttgattctcgtaggagtgtaaaccccatt gtgaactgtgcaggcaagggatctaggttacgtgttccttatgagaatttagggataggt gtaatgtgcttgaatcatcctgaaaccatcacccacaactgcccctctcgtgtccgtgga aaaatggtcttccatgaaaccagtccctggtgccaaaatgttggggaccattgttatatg actaaaaacaccaggcgaatacacagccaccctgatacatccatggccaaaggcaaaaga gcaacttctcctctctgtggcagcagatgcccctgttcttcagttgatgaaaaaatccta gcagaggaaaaggaaagacagaggaaaggagattgttgggttgttaagagatatgagcag gagggagaggttgcaacccacttctccgcccgctttctgctggccccagagaaccctata cttccccagagtgtgatattccccttcctgtgtccatgtgatctcattgttcagttccca cctatggatcagcaaagcgtgccgggcggtggtggagagactgagggcggacaaggcgag agggaacgagccgtccacccttcggagaagcctaggcgccttgtaagtaattcgcgaaca gtcgggagaacaaacagccaagcggcgctgcagtggccgcacttgcgcgcgtctcaatcc tgggggctctgcgcgcccgccccagtccctcgccccattgactcagtggcttctccgggc gctgcagcctccgcggggggcttcgaagggccgaggggctccggcagagagggagtggag agggagacgcgccgggaccgacgaacaatcctgcccctgcggcaaaggtctctacccggc gctggcacctcgcaggcccctcgaggagcacgcagggcaagcggcccaagagcgggggaa ccgggaagtgtgtggggctccagacggagtaggaagctttgcccaaggccaggctgcaat caggcagccgcaacagccgggcgcggagcttcccacctgctgctgtcccagctgggccgc aacttcctcctccccggcccgggcccgactccccggccgggctccgagggtggaggggga cgaggcggcggcaggttccctccgctggcaacgcctcgcagcatcctcccctccccgccg cggttcccggggccggccgcgcggccaggcgcgctgcgattggccggcagcggccggggg cggggccaggagagccggtgtcgcacggaccgcctcaaaagagcccaggatattgcagag cgcactggagccctggccagcgcgcagccttcccggcgccggcgggctgggtcttgggaa ttctggtttgctttggctcactcgctttttacaaaccactggatcttacatgcctctgta ccccccacttccactccatgtccccatgctcctgcgccagcaacagaagcactggagtcc ccagcagaagcgatgggcagtgtgcgaaccaaccgctacagcatcgtctcttcagaagaa gacggtatgaagttggccaccatggcagttgcaaatggctttgggaacgggaagagtaaa gtccacacccgacaacagtgcaggagccgctttgtgaagaaagatggccactgtaatgtt cagttcatcaatgtgggtgagaaggggcaacggtacctcgcagacatcttcaccacgtgt gtggacattcgctggcggtggatgctggttatcttctgcctggctttcgtcctgtcatgg ctgttttttggctgtgtgttttggttgatagctctgctccatggggacctggatgcatcc aaagagggcaaagcttgtgtgtccgaggtcaacagcttcacggctgccttcctcttctcc attgagacccagacaaccataggctatggtttcagatgtgtcacggatgaatgcccaatt gctgttttcatggtggtgttccagtcaatcgtgggctgcatcatcgatgctttcatcatt ggcgcagtcatggccaagatggcaaagccaaagaagagaaacgagactcttgtcttcagt cacaatgccgtgattgccatgagagacggcaagctgtgtttgatgtggcgagtgggcaat cttcggaaaagccacttggtggaagctcatgttcgagcacagctcctcaaatccagaatt acttctgaaggggagtatatccctctggatcaaatagacatcaatgttgggtttgacagt ggaatcgatcgtatatttctggtgtccccaatcactatagtccatgaaatagatgaagac agtcctttatatgatttgagtaaacaggacattgacaacgcagactttgaaatcgtggtc atactggaaggcatggtggaagccactgccatgacgacacagtgccgtagctcttatcta gcaaatgaaatcctgtggggccaccgctatgagcctgtgctctttgaagagaagcactac tacaaagtggactattccaggttccacaaaacttacgaagtccccaacactcccctttgt agtgccagagacttagcagaaaagaaatatatcctctcaaatgcaaattcattttgctat gaaaatgaagttgccctcacaagcaaagaggaagacgacagtgaaaatggagttccagaa agcactagtacggacacgccccctgacatagaccttcacaaccaggcaagtgtacctcta gagcccaggcccttacggcgagagtcggagatatga >gi568815581f:70075040_70276320|GENSCAN_predicted_peptide_5|70_aa MNERTFAVGYCKPLSMEIMQFVPKMKVSKERLMVWKPFKNIEEIDSSMEDDQLPWFAQSF SSFSTESLAR >gi568815581f:70075040_70276320|GENSCAN_predicted_CDS_5|213_bp atgaatgaaagaacatttgctgttggttactgtaaacctctgagtatggaaataatgcag ttcgtgccaaaaatgaaagtgtcaaaggagcgacttatggtctggaagccattcaaaaat atcgaagagatagattcttcaatggaggatgaccaactcccctggtttgcccagagcttt tccagtttcagcactgaaagtcttgcacgctag >gi568815581f:70075040_70276320|GENSCAN_predicted_peptide_6|178_aa MINEVDADGNRTDSPEFLTMMARKMKDTQSEEEIREAFLVFDKDGNGYISAAELCHVMTN PGEKLTDDKVDEMIREAGIDGDGQPKSTRNQRQGSPLIQYSYGLERNVTVGEDGRDTTKL ALKVEEGDRQPRNDLLFGEQVEILFDENLEQTLSPVLTWILFLAMLREKKRIDTSLRC >gi568815581f:70075040_70276320|GENSCAN_predicted_CDS_6|537_bp atgattaatgaagtagatgctgatggtaatagaacggactctcctgaatttctgacaatg atggcaagaaaaatgaaagacacacaaagtgaagaagaaattagagaagcattccttgtg tttgataaggatggcaatggctatatcagtgcagcagaactttgccatgtgatgacaaac cctggagagaagctaacagatgacaaggttgatgaaatgatcagggaagcaggtattgat ggtgatggtcagccaaaatcaactagaaatcagaggcaaggaagccctttgattcagtat tcatatggattagagagaaatgtgactgtgggagaggatgggagagatacaactaaattg gctttgaaggtggaggaaggggaccgtcagccaaggaatgatttattgtttggagaacaa gtggaaattctgtttgatgagaacttggagcaaacgctttctccagtgctgacctggatc ctattcttggcaatgctcagagagaagaagaggatagatacatcactgagatgttga