GENSCAN 1.0 Date run: 6-Nov-116 Time: 01:09:40 Sequence gi568815583r:89984269_90202390 : 218122 bp : 49.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 397 392 6 1.05 1.04 Term - 18236 17916 321 2 0 100 39 134 0.871 4.52 1.03 Intr - 19707 19514 194 1 2 19 80 149 0.393 6.41 1.02 Intr - 22232 22114 119 1 2 70 95 51 0.648 4.11 1.01 Init - 25667 25648 20 2 2 79 110 33 0.635 3.14 1.00 Prom - 43282 43243 40 -4.56 2.07 PlyA - 43847 43842 6 1.05 2.06 Term - 49006 48862 145 0 1 73 41 137 0.723 4.88 2.05 Intr - 50791 50369 423 0 0 70 76 131 0.386 3.08 2.04 Intr - 59017 58735 283 2 1 73 98 119 0.157 7.88 2.03 Intr - 60273 60069 205 0 1 53 -1 87 0.066 -4.93 2.02 Intr - 62820 62692 129 0 0 114 75 145 0.889 16.69 2.01 Init - 67977 67912 66 0 0 51 94 55 0.267 2.47 2.00 Prom - 68297 68258 40 -7.96 3.00 Prom + 69832 69871 40 -6.46 3.01 Init + 71070 71092 23 2 2 54 94 39 0.031 0.26 3.02 Intr + 82842 84327 1486 0 1 111 116 2490 0.770 243.84 3.03 Intr + 88803 88994 192 2 0 110 75 263 0.998 26.89 3.04 Intr + 89848 90022 175 0 1 67 98 359 0.984 34.31 3.05 Term + 95392 95561 170 2 2 91 43 375 0.998 31.34 3.06 PlyA + 97905 97910 6 1.05 4.14 PlyA - 98811 98806 6 1.05 4.13 Term - 100085 99998 88 1 1 79 50 154 0.994 7.83 4.12 Intr - 100299 100239 61 1 1 71 54 45 0.673 -2.81 4.11 Intr - 100640 100531 110 1 2 86 54 185 0.647 14.83 4.10 Intr - 100830 100733 98 0 2 107 99 166 0.999 18.41 4.09 Intr - 101119 101007 113 2 2 50 113 113 0.998 10.10 4.08 Intr - 102995 102844 152 1 2 92 71 268 0.999 25.31 4.07 Intr - 103307 103171 137 2 2 86 42 259 0.999 20.47 4.06 Intr - 104234 104091 144 2 0 87 78 216 0.999 20.98 4.05 Intr - 104479 104319 161 2 2 88 96 197 0.981 20.21 4.04 Intr - 105899 105816 84 0 0 46 90 63 0.508 2.09 4.03 Intr - 106376 106211 166 2 1 96 86 248 0.990 24.93 4.02 Intr - 107376 107285 92 1 2 95 67 179 0.210 16.21 4.01 Init - 118122 118008 115 0 1 57 74 179 0.435 11.77 4.00 Prom - 127961 127922 40 -5.36 5.00 Prom + 138218 138257 40 -4.26 5.01 Init + 144335 144419 85 1 1 52 44 140 0.338 6.98 5.02 Intr + 149293 149364 72 2 0 82 102 57 0.277 5.88 5.03 Intr + 153949 154080 132 2 0 58 99 46 0.614 3.32 5.04 Term + 154497 154528 32 1 2 86 48 17 0.193 -4.48 5.05 PlyA + 154668 154673 6 1.05 6.03 PlyA - 155282 155277 6 1.05 6.02 Term - 157567 157395 173 2 2 109 53 62 0.067 2.79 6.01 Init - 181722 181683 40 0 1 63 119 53 0.310 6.25 6.00 Prom - 190526 190487 40 -5.96 7.06 PlyA - 191964 191959 6 1.05 7.05 Term - 200848 200347 502 0 1 65 37 213 0.229 7.85 7.04 Intr - 201242 200917 326 2 2 100 64 136 0.252 6.97 7.03 Intr - 206252 206126 127 1 1 112 85 11 0.402 3.88 7.02 Intr - 207698 207534 165 1 0 99 48 61 0.237 2.28 7.01 Init - 213526 213477 50 1 2 71 80 18 0.027 -0.28 7.00 Prom - 215140 215101 40 -5.66 8.00 Prom + 215757 215796 40 -5.16 8.01 Init + 217326 217467 142 2 1 93 92 287 0.336 27.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:89984269_90202390|GENSCAN_predicted_peptide_1|217_aa MGHRAVRETASFTRTGPAQCLAHHRLPMNEELLWSGTGWASILEQADTGNQARLEEENVT QTSNSPYPKESSSIPTALMQESQLQKVTGAPGGHLSGATDDRSEVLLRDWQLHPIRKMTA RRRREGDEEEGGLSRGRLKTGRRDPARHPHPHAPRQRAPVLGPRPAAPRRCPGGDPAAPS VRRPVALGDAPSPASGALSSPPPPAGWDAQQPHPLRE >gi568815583r:89984269_90202390|GENSCAN_predicted_CDS_1|654_bp atgggccaccgggcagtcagggagactgcaagcttcacgaggacgggacctgcacagtgt ctagcacatcacagactgccaatgaatgaggagctgctgtggtctggaacaggctgggcc agcatcctggagcaggcagacacaggcaaccaggccaggctggaggaagaaaatgtgacc caaacatccaattccccctacccaaaagaaagcagctctatccccacggccctcatgcag gaaagccagctccagaaggtcactggggcaccaggtgggcacctctctggagctacagat gaccgatcagaggttctgctccgagactggcagctccatccaattaggaaaatgaccgcg agaagacggagagagggcgacgaggaggaaggggggctgagccgcgggaggctcaagacg ggccgtagggaccccgcccgccacccccacccccacgcccccagacagcgagcgccggtc ctgggtccccggcccgcggcgccccgacgctgtcccggcggagatcccgcggcgcccagc gtgcggcggccagtggccctcggggacgcccccagccccgccagcggcgcgctcagctct cccccacccccggccggctgggacgcgcaacaaccccacccgctccgagaataa >gi568815583r:89984269_90202390|GENSCAN_predicted_peptide_2|416_aa MGMKKGALGLQEPSSCVSVWLQASVAVRGHLHGRKETFLPKLEKPDGLAGLPWQLPGCQQ YAEKQDVSTQAAKPKVAGPLFGEGYPAQLSVQLVSCGPAPFLTLSPCVMFTRTPPSPALA GNGDTTLRQSGEARLLYESPRALNQVAGQMRGFSVRQLPWLEVKAVATHPAWAVSTQFSG CLHTGGARSPVGCGVKGILSQPWDGLSPRGTAPKKPAACSGWALTPGGDPKSQYYTGLFT LLEPPSDDGRQRPSRHPFYQQGSPGAPAIKPLSLSEQRVLTASFLGACNSEDPNLFSPTL GRCGRGLSPQQAPGVQRKCTCRQNLRQVEPGAPPTARTKKAQKACKGVLGSRVAGEQAGL SEGVRRAGGSNINKQFSPESCHVGSTVLTLAHTLFEMKPPIAHLDVILPMSNYRSF >gi568815583r:89984269_90202390|GENSCAN_predicted_CDS_2|1251_bp atggggatgaagaaaggggccctaggtcttcaggagccctcgagctgtgtgagtgtctgg ctgcaggccagcgtggctgtgcgtgggcatcttcatgggcgtaaggagacgttcttgccc aaactggaaaaacctgatggccttgcaggcctgccctggcagcttccgggttgccagcag tatgctgagaagcaggatgtgtcaacacaggctgccaaacccaaggtggcgggcccctta tttggtgagggttatccagctcagctgtcggtccagcttgtgtcgtgtggaccggccccc ttcctcacactcagcccctgtgtcatgttcacacggacgccacctagccccgccttggca gggaatggcgacaccacgctccgccagagtggggaggcccgtctcctctatgagagccct cgggcccttaaccaggtagcggggcagatgagggggttctcagtgaggcagctgccctgg ctggaggtgaaagccgtggcaactcaccctgcctgggccgtgtccacccagttctccggc tgcctgcacacaggaggtgctcggtccccggtggggtgtggggtgaagggcattctgagt cagccctgggatgggctgtcccctcgagggactgctccaaagaagcccgccgcctgctca ggctgggccctcactccaggaggggatccaaaatcccagtactatacgggcctctttacc ttgctagaacctccttctgatgacggaaggcagaggccctccagacaccccttctaccag cagggaagccctggggctccggccatcaagcctctttctctgagtgaacagagggtccta acagcatcgtttctgggagcttgcaactcagaagatccaaatctcttcagcccgaccttg gggcgatgtggaagaggtctaagtccccaacaggcgccaggtgtgcaaaggaaatgcacc tgcaggcaaaacctgaggcaggtggagccgggtgccccacccacggccagaacaaagaag gctcagaaggcctgcaagggtgtcctgggctctagggtagcaggagagcaggcgggcctc tcagaaggggtgagaagagctggcggatctaacatcaacaagcagtttagtcctgagtcc tgccacgtggggtcaacagtgctcaccctggcccacacgctgttcgagatgaagccacca atcgcccatttggatgtcatcctgccaatgtcaaactacagaagtttctaa >gi568815583r:89984269_90202390|GENSCAN_predicted_peptide_3|681_aa MCEEAVCSDALLASRHGMEGFMDSGTQTDAVVVLSLAQAAVLGLVSENELFGATISAEAF YPDLGPELSGAAMGEPEPPGPDVYQLACNGRALEEPAEEEVLEVEAACEKHTRRKTRPPV RLVPKVKFEKVEEEEQEVYEVSVPGDDKDAGPAEAPAEAASGGCDALVQSSAVKMIDLSA FSRKPRTLRHLPRTPRPELNVAPYDPHFPAPARDGFPEPSMALPGPEALPTECGFEPPHL APLSDPEAPSMESPEPVKPEQGFVWQEASEFEADTAGSTVERHKKAQLDRLDINVQIDDS YLVEAGDRQKRWQCRMCEKSYTSKYNLVTHILGHNGIKPHSCPHCSKLFKQPSHLQTHLL THQGTRPHKCQVCHKAFTQTSHLKRHMLLHSEVKPYSCHFCGRGFAYPSELKAHEVKHES GRCHVCVECGLDFSTLTQLKRHLASHQGPTLYQCLECDKSFHYRSQLQNHMLKHQNVRPF VCTECGMEFSQIHHLKQHSLTHKGVKEFKCEVCGREFTLQANMKRHMLIHTSVRPYQCHI CFKTFVQKQTLKTHMIVHSPVKPFKCKVCGKSFNRMYNLLGHMHLHAGSKPFKCPYCSSK FNLKGNLSRHMKVKHGVMDIGLDSQDPMMELTGTDPSELDGQQEMEDFEENAYSYASVDS SAEASVLTEQAMKEMAYYNVL >gi568815583r:89984269_90202390|GENSCAN_predicted_CDS_3|2046_bp atgtgtgaagaggccgtgtgcagcgatgccctcctagctagccgtcacgggatggagggc ttcatggactcagggacacagacggacgccgtggtggtgctgtccttggctcaggccgcc gtgcttggcctggtctccgaaaatgagctctttggagctaccataagcgccgaggccttc tacccggacctggggcccgagctttcaggggcagccatgggagagcccgagccaccaggc cccgacgtctaccagctggcctgcaacgggagggccttggaggagccggcggaggaggag gtgctggaggtggaggcagcctgtgagaagcacacccggcggaagacgcggccacctgtg cggttggtgcccaaggtcaagttcgagaaggtggaggaggaggaacaggaggtctatgag gtttctgtgccaggtgacgacaaggacgcagggccagcagaagcccccgccgaggcggcc agtggcggctgcgacgccctggtgcagagcagcgccgtcaagatgatcgacctcagcgcc ttcagccgcaagccccggacgctccggcatctgccccgaaccccgaggccggagctgaac gtggccccatatgaccctcacttcccggccccggcccgggatggcttccccgagcccagc atggcgctgcctgggccagaggccttgcccacagagtgtgggttcgagccaccccacctg gcccccctgagtgaccccgaggcccccagcatggagtccccggagcctgtcaagccggaa cagggcttcgtgtggcaggaggccagtgagttcgaggctgacacggcgggttcgaccgtg gaacgccacaagaaggcccagctggatcggctggacatcaacgtgcagattgacgactcc tatctggtggaggcgggcgaccgccagaagcgctggcagtgccgcatgtgcgagaagtcc tacacgtccaagtacaacctggtgacgcacatcctgggccacaacggcatcaagccacac tcgtgcccacactgcagcaagctcttcaagcagcccagccacctgcagacgcacctgctg acgcaccagggcacccggccccacaagtgccaggtatgccacaaggccttcacgcagacc agccacctcaagcgccacatgctgctgcactcggaggtcaagccctacagctgccacttc tgcggccgcggcttcgcctaccccagcgagctcaaggcccacgaagtgaagcatgagagt ggccgctgccatgtctgcgtcgagtgcggcctggacttctccaccctgacccagctcaag cgccacctggcctcccaccagggccccaccctctaccagtgcctcgagtgtgacaagtcc ttccactaccgcagccagttgcagaaccacatgctcaagcaccagaacgtgcgacccttc gtgtgcactgaatgcggcatggagttcagccagattcaccacctcaagcagcactccctc acccacaagggcgtgaaggagttcaagtgcgaggtgtgtggccgggagttcaccctacag gcgaacatgaagcggcacatgctgatccacaccagcgtccggccctaccagtgccacatc tgcttcaagacctttgtacagaagcagactctcaagacccacatgattgtacactcgccc gtgaagccattcaaatgcaaggtgtgcgggaagtccttcaaccgcatgtacaacctgctg ggccacatgcacctgcacgccggcagcaagcccttcaagtgcccctactgctccagcaag tttaatctcaagggcaacctgagccggcacatgaaggtcaagcatggcgtcatggacatc ggcctggacagccaagaccccatgatggagctgacaggcactgacccttcagagctcgac ggccagcaggagatggaggacttcgaggagaacgcctacagctatgcgagcgtggacagc agcgccgaggccagtgtcctcactgaacaggccatgaaagagatggcctactacaatgtg ctatag >gi568815583r:89984269_90202390|GENSCAN_predicted_peptide_4|506_aa MAGYLRVVRSLCRASGSRPAWAPAALTAPTSQEQPRRHYADKRIKVAKPVVEMDGDEMTR IIWQFIKEKLILPHVDIQLKYFDLGLPNRDQTDDQVTIDSALATQKYSVAVKCATITPDE ARVEGQWCFCFTLALTVVGVIGDRAEEEEEEEEFKLKKMWKSPNGTIRNILGGTVFREPI ICKNIPRLVPGWTKPITIGRHAHGDQYKATDFVADRAGTFKMVFTPKDGSGVKEWEVYNF PAGGVGMGMYNTDESISGFAHSCFQYAIQKKWPLYMSTKNTILKAYDGRFKDIFQEIFDK HYKTDFDKNKIWYEHRLIDDMVAQVLKSSGGFVWACKNYDGDVQSDILAQGFGSLGLMTS VLVCPDGKTIEAEAAHGTVTRHYREHQKGRPTSTNPIASIFAWTRGLEHRGKLDGNQDLI RFAQMLEKVCVETVESGAMTKDLAGCIHGLSKCVAWGGEQLPGKTDPAGACRISVASCVK LNEHFLNTTDFLDTIKSNLDRALGRQ >gi568815583r:89984269_90202390|GENSCAN_predicted_CDS_4|1521_bp atggccggctacctgcgggtcgtgcgctcgctctgcagagcctcaggctcgcggccggcc tgggcgccggcggccctgacagcccccacctcgcaagagcagccgcggcgccactatgcc gacaaaaggatcaaggtggcgaagcccgtggtggagatggatggtgatgagatgacccgt attatctggcagttcatcaaggagaagctcatcctgccccacgtggacatccagctaaag tattttgacctcgggctcccaaaccgtgaccagactgatgaccaggtcaccattgactct gcactggccacccagaagtacagtgtggctgtcaagtgtgccaccatcacccctgatgag gcccgtgtggaaggccagtggtgtttctgcttcactctggctctgacagtggtaggtgtc atcggtgacagggctgaggaggaggaggaggaggaagagttcaagctgaagaagatgtgg aaaagtcccaatggaactatccggaacatcctgggggggactgtcttccgggagcccatc atctgcaaaaacatcccacgcctagtccctggctggaccaagcccatcaccattggcagg cacgcccatggcgaccagtacaaggccacagactttgtggcagaccgggccggcactttc aaaatggtcttcaccccaaaagatggcagtggtgtcaaggagtgggaagtgtacaacttc cccgcaggcggcgtgggcatgggcatgtacaacaccgacgagtccatctcaggttttgcg cacagctgcttccagtatgccatccagaagaaatggccgctgtacatgagcaccaagaac accatactgaaagcctacgatgggcgtttcaaggacatcttccaggagatctttgacaag cactataagaccgacttcgacaagaataagatctggtatgagcaccggctcattgatgac atggtggctcaggtcctcaagtcttcgggtggctttgtgtgggcctgcaagaactatgac ggagatgtgcagtcagacatcctggcccagggctttggctcccttggcctgatgacgtcc gtcctggtctgccctgatgggaagacgattgaggctgaggccgctcatgggaccgtcacc cgccactatcgggagcaccagaagggccggcccaccagcaccaaccccatcgccagcatc tttgcctggacacgtggcctggagcaccgggggaagctggatgggaaccaagacctcatc aggtttgcccagatgctggagaaggtgtgcgtggagacggtggagagtggagccatgacc aaggacctggcgggctgcattcacggcctcagcaagtgcgtggcctggggtggagagcag ttacctggcaagacagaccctgcgggtgcctgccgaatctcggtggcctcatgtgtgaag ctgaacgagcacttcctgaacaccacggacttcctcgacaccatcaagagcaacctggac agagccctgggcaggcagtag >gi568815583r:89984269_90202390|GENSCAN_predicted_peptide_5|106_aa MTVRGKLRSGQVNEHRKVSQGSGNQEAIRERSTTTLGLRTNQPKEHLTNFKSARLHAGEI NALLLTQSLLVDSLHTDTHDRYSTHRYRGPTVFSQAGTNGFRSYGV >gi568815583r:89984269_90202390|GENSCAN_predicted_CDS_5|321_bp atgactgtgcgggggaagctccgaagtggacaagtcaacgagcaccggaaggtgtcccag ggatctggcaatcaagaggccatcagagaaagatccaccacgaccttgggtcttcggacc aaccagcccaaggaacatctcaccaattttaaatcggcccgcctgcacgcaggtgaaata aacgccttgttgctcacacaaagcctgttggtggactctcttcacacggacacgcatgac agatacagcacccacagatacagagggccaactgtattttcccaggctggtacaaatgga tttagatcttatggagtctga >gi568815583r:89984269_90202390|GENSCAN_predicted_peptide_6|70_aa MLAPFPGYTDEEAELADSLGTSHSPVPSSPCRSCHKLAAGVWRTSNISKDVRLWEAEDFT ARGESKPSGR >gi568815583r:89984269_90202390|GENSCAN_predicted_CDS_6|213_bp atgcttgcaccctttccaggatatacggatgaggaggctgagttggctgactcattaggc acttcccacagtccagttccttcatccccctgccgctcctgccacaagcttgctgctgga gtctggaggacctcaaatatttccaaggacgtcagactttgggaggctgaggacttcacg gcccgaggggagagcaagccgtccggccgctga >gi568815583r:89984269_90202390|GENSCAN_predicted_peptide_7|389_aa MITIPYAFIQAIDKNIGGWAAAQLQHQYMLKRKRHQLAHPVYSFKSANGCQVLRAGSFLG AAPSAAEGIHPGEEELSDPGVSPSPQVTWWLPLLLPDRALLLVLMLIPESGQEQPSPSST LVLSEKKEAMTKGLTRRSYPRPFRRGPGLFRIPNDFPPGPGPGFRSRGGALARCERCSKA PGGHGNAGPASCAAMATGSPGGGPQCPSERAGGAFWGEGPGGSGSHLARHPPRLPGVSQG QRTVIAPRGAALPGVSTDSSPLPGRPGAAPGDPESRAHSLSPARRPRKCWAPAKPRPAPP RSARVYGNCPEGASGICLMRAVPPAPPPPPAPGHLHQRWTRCHDHPGLSGPPPHDRWSAQ VSRGAQGTVRSFTLGDSASLDLGHRGRPH >gi568815583r:89984269_90202390|GENSCAN_predicted_CDS_7|1170_bp atgataaccattccttatgcctttattcaagctattgataaaaatattgggggctgggca gcagctcagctgcagcatcagtacatgttgaagaggaagcgacaccagctggcacaccct gtgtattctttcaaatcagccaatggctgccaagtcctcagggctgggagcttccttggg gcagcgccttctgcagcagagggcatccaccccggggaggaggagctctctgaccctgga gtttccccatcacctcaggtcacctggtggctgccactcttactcccagacagagccttg ctcttggttctgatgctgatcccagagagtgggcaggaacagccttctccttctagcacc ttggtgctcagtgagaagaaagaagccatgacaaagggcctaactcggaggtcatatcca cgaccattccgcaggggcccaggccttttccgaattccaaatgacttccccccaggacca ggtcccgggtttaggagtaggggtggtgcactggcgcgatgtgaacgctgctcaaaggcc cccggtggccatgggaacgcgggcccagcctcctgcgctgccatggcaacaggctccccg ggcggcgggccccagtgcccatcagagcgtgcaggtggcgccttctggggagaggggccg ggcgggagcggctcccacctcgccaggcaccccccgcgcctacctggagtgagtcagggt cagcgcacggtcatcgccccccgcggggccgctcttcctggagtgtccactgactcgtcc cccttgccaggccggcccggcgcagcccctggggacccggagtctcgggcgcacagcctc agtccggcgcggcgtccccggaaatgctgggcccccgccaagccccgcccagccccgccc cgctctgcccgtgtttacgggaattgtccagagggcgcctccgggatttgcctgatgcgg gccgtgccccccgccccaccaccgccaccggcccccggccatttgcaccaacgatggact cgatgccacgaccaccctggcctgtcgggtccgcctccccacgaccgctggtcggcccaa gttagccgtggagcccaagggactgtgcggtccttcaccttgggagactcggcctcgctg gacctgggccacagggggcggcctcactag >gi568815583r:89984269_90202390|GENSCAN_predicted_peptide_8|48_aa MGLRSWLAAPWGALPPRPPLLLLLLLLLLLQPPPPTWALSPRISLPLX >gi568815583r:89984269_90202390|GENSCAN_predicted_CDS_8|144_bp atgggcctgaggagctggctcgccgccccatggggcgcgctgccgcctcggccaccgctg ctgctgctcctgctgctgctgctcctgctgcagccgccgcctccgacctgggcgctcagc ccccggatcagcctgcctctggnn