GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:26:13 Sequence gi568815575f:147952213_148152758 : 200546 bp : 39.03% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 16248 16550 303 2 0 88 54 244 0.981 16.58 1.02 PlyA + 17133 17138 6 1.05 2.00 Prom + 17669 17708 40 -6.15 2.01 Sngl + 19439 19762 324 1 0 58 40 244 0.943 12.35 2.02 PlyA + 20123 20128 6 1.05 3.00 Prom + 20887 20926 40 -3.65 3.01 Init + 21065 21225 161 1 2 83 68 61 0.676 2.94 3.02 Intr + 24585 24729 145 0 1 68 47 131 0.947 6.36 3.03 Intr + 29041 29467 427 0 1 -2 117 533 0.499 39.64 3.04 Intr + 50989 51108 120 2 0 52 108 108 0.865 8.75 3.05 Intr + 54490 54630 141 2 0 34 115 77 0.807 4.40 3.06 Intr + 67945 68041 97 2 1 96 13 39 0.010 -4.65 3.07 Term + 72653 72788 136 2 1 90 43 150 0.284 7.21 3.08 PlyA + 73400 73405 6 1.05 4.03 PlyA - 74476 74471 6 1.05 4.02 Term - 92882 92681 202 1 1 82 38 194 0.994 9.68 4.01 Init - 96243 96146 98 0 2 86 89 91 0.991 8.83 4.00 Prom - 96911 96872 40 -6.65 5.00 Prom + 97289 97328 40 -7.75 5.01 Init + 98211 98364 154 2 1 99 61 28 0.506 1.40 5.02 Term + 99959 100230 272 0 2 88 51 303 0.516 21.26 5.03 PlyA + 100692 100697 6 1.05 6.06 PlyA - 100794 100789 6 1.05 6.05 Term - 107720 107396 325 1 1 58 49 223 0.016 8.65 6.04 Intr - 118173 118082 92 0 2 64 55 75 0.003 -0.33 6.03 Intr - 141064 140916 149 2 2 86 90 26 0.642 1.63 6.02 Intr - 141637 141268 370 1 1 87 110 199 0.589 15.75 6.01 Init - 141924 141919 6 0 0 96 93 0 0.824 2.21 6.00 Prom - 142908 142869 40 -5.55 7.00 Prom + 147626 147665 40 -6.45 7.01 Sngl + 148583 149242 660 1 0 -7 54 278 0.177 10.82 7.02 PlyA + 150653 150658 6 1.05 8.00 Prom + 158314 158353 40 -2.85 8.01 Init + 164220 164353 134 2 2 70 121 119 0.996 13.06 8.02 Intr + 166751 166860 110 2 2 70 83 122 0.570 8.91 8.03 Term + 168064 168185 122 2 2 98 40 10 0.228 -5.14 8.04 PlyA + 168802 168807 6 1.05 9.02 PlyA - 168874 168869 6 1.05 9.01 Sngl - 174873 172951 1923 0 0 44 48 750 0.383 60.22 9.00 Prom - 174966 174927 40 -6.15 10.02 PlyA - 175135 175130 6 1.05 10.01 Sngl - 176379 175363 1017 0 0 88 43 687 0.995 60.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 29602 29681 80 2 2 121 37 9 0.936 -3.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:147952213_148152758|GENSCAN_predicted_peptide_1|100_aa MGRNQSRKDENSKNQSAPSPPKDHSSSPAMEQSRTENDFDELTEAGFRKLVITNFSKLKE DVRTHHKEAKNFEKRLDKWLTRINSVEKTLNDPMERKTMA >gi568815575f:147952213_148152758|GENSCAN_predicted_CDS_1|303_bp atggggagaaaccagagcagaaaagatgaaaattccaaaaaccagagtgccccttctcct ccaaaggatcacagctcctcaccagcaatggaacaaagcaggacggagaatgactttgac gagttgacagaagcaggcttcagaaagttggtaataacaaacttctccaagctaaaggag gatgttcgaacccatcacaaggaagctaaaaactttgaaaaaagattagacaaatggcta actagaataaacagtgtagagaagaccttaaatgacccgatggagcggaaaaccatggca tga >gi568815575f:147952213_148152758|GENSCAN_predicted_peptide_2|107_aa MIIYLENPIISAQNLLKLISNFSKLSGYKVYVQKSQAFLYTNNRQTESQIMSELAFTIAT KRIKYLGIQLTRDVKDLFKENYKPLVNEMKEEANGRTFHAHGKEESV >gi568815575f:147952213_148152758|GENSCAN_predicted_CDS_2|324_bp atgattatatatttagaaaaccccatcatctcagcccaaaatctccttaaactgataagc aacttcagcaagctctcagggtacaaagtctatgtgcaaaaatcacaagcattcttatac accaataacagacaaacagagagccaaatcatgagtgaactcgcattcacaattgctaca aagagaataaaatacctaggaatccaacttacaagggacgtgaaggacctcttcaaggag aactacaaaccactggtcaatgaaatgaaagaggaggcaaatggaagaacattccatgct catggaaaggaggaatcagtatag >gi568815575f:147952213_148152758|GENSCAN_predicted_peptide_3|408_aa MDEAGNHHSEQTITRTENQTPHVLTHRWDLNNENTWTQRGEHHTLGPVMEWAVGKKKKAV ARIYAESWQRLMLRLDGQGLGRSKFGVLQVWSKFGARKFEKELLGHGLPRKLLGHGLPRK LLGHGLPRKLLGHGLPDRWAVRQRLSEAAPGAMSSHRRKAKGRNRRSHRAMRVAHLELAT YELAATESNPESSHPGYEAAMADRPQPGWRESLKMRVSKPFGMLMLSIWILLFVCYYLSY YLCSGSSYFVLANGHILPNSENAHGQSLEEDSALEALLNFFFPTTCNLRENQVAKPCNEL QDLSESECLRHKCCFSSSGTTSFKCFAPFRDGRRALPCGNQHCRPTGSTANVLLKPKGSS ISLCEPADDLQRQDNRVVTGLKKQRRKRKRKSEMLQKAARGREEHGDE >gi568815575f:147952213_148152758|GENSCAN_predicted_CDS_3|1227_bp atggatgaagctggaaaccatcattcagaacaaactatcacaaggacagaaaaccaaaca ccacatgttctcactcataggtgggatctgaacaatgaaaacacttggacacagcgtggg gaacatcatacactggggcctgtcatggagtgggcggttgggaagaagaaaaaggctgta gcaaggatctatgctgaatcatggcagagactaatgttgcggctggatggtcagggtctt ggaagaagcaagtttggagtattgcaagtttggagcaagtttggagcaaggaaatttgag aaagaacttctgggccacggactgccccggaagcttctgggccacggactgccccggaag cttctgggccacggactgccccggaagcttctgggccacggactgccggaccgttgggct gtgaggcagcgtctcagcgaggcggcacccggagccatgtcttcacataggaggaaagcg aaggggaggaataggagaagtcaccgtgccatgcgtgtggctcacttagagctggcaact tatgagttggcggcaactgagtcgaatcccgagagcagccatcctggatacgaggccgcc atggctgacaggcctcagccaggatggcgggaatctctaaagatgcgggtcagcaaaccc tttgggatgctcatgctctccatttggatcctgctgttcgtgtgctactacctgtcctac tacctgtgctccgggtcctcatattttgtgcttgcaaatggacatatcctgcccaacagt gaaaatgctcatggccaatctctggaagaagattccgcattggaagctttgctgaatttt ttctttccaacaacttgcaatctgagggaaaatcaggtggcaaagccttgtaatgagctg caagatcttagtgagagtgaatgtttgagacacaaatgctgtttttcatcatcggggacc acgagcttcaaatgttttgctccatttagagatggcagaagagccctaccttgtggcaat caacactgcaggcccacagggagtactgccaatgttctcttaaagcccaagggctcttct atcagcttgtgcgaaccggccgatgatttacaaaggcaggacaacagagttgtaacgggt ttgaagaaacaaagaaggaagcgaaagaggaagtctgaaatgttacagaaagcagcaaga ggacgtgaggaacatggtgacgagtag >gi568815575f:147952213_148152758|GENSCAN_predicted_peptide_4|99_aa MVEGKGGAKVCLTWQQAREHVQENRPLYNHQIFGHTAHGERIHALDGGRAQWLWDYALEF SAALSEWKATWGRIQPVPIEGAFRPTLDKGKLSSPVVGT >gi568815575f:147952213_148152758|GENSCAN_predicted_CDS_4|300_bp atggtggaaggcaaaggaggagcaaaggtatgtcttacatggcagcaggcaagagagcat gtgcaggagaaccgccctttatacaaccatcagatctttggtcacacggcacatggagag agaatccacgcacttgatggagggagagcacagtggttgtgggactatgcattggaattc agtgctgccctgtcagagtggaaagcaacatggggcagaattcagccggtgcccatagag ggagcatttagaccaaccttagacaaaggcaaattatccagcccagtggttggaacctaa >gi568815575f:147952213_148152758|GENSCAN_predicted_peptide_5|141_aa MSQACNCGKLLVFCECQLASCSHLQPLPPQEFWFGSGLGFYKLQRELLGDHRPRSHRRRL SLVAALTTASTSQVRQNYHQDSEAAINRQINLELYASYVYLSMSYYFDHDDVALKNFAKY FLHQSHEEREHAEELMKLQNQ >gi568815575f:147952213_148152758|GENSCAN_predicted_CDS_5|426_bp atgagccaagcttgtaattgcgggaagcttctagtcttctgtgagtgtcagcttgcctct tgttcccaccttcaacccctgccaccccaagagttttggtttggatcaggactaggtttc tataaactgcaacgtgagctgctaggtgatcacaggccgcgcagccaccgccgccgcctc tccttagtcgccgccttgacgaccgcgtccacctcgcaggtgcgccagaactaccaccag gactcagaggcggccatcaaccgccagatcaacctggagctctacgcctcctacgtttac ctgtccatgtcttactactttgaccacgatgatgtggctttgaagaactttgccaaatac tttcttcaccaatctcatgaggagagggaacatgctgaggaactgatgaagctgcagaac caatga >gi568815575f:147952213_148152758|GENSCAN_predicted_peptide_6|313_aa MTVHLLANPSQGSVAWPPHRSVHTVQPLLPKLGALLHLSAFWQPESPLDTPAHLEPNPTS PEEEATSRSWCSRAAAHGLGMPSQDLWPALEKGRSSCSQKTETGEWHRFVAWCGTWVCLS PQGWSEHKCKLEDTQRSHAAKNLSTARYFPPPSTGSQPKHHHQRTLLILPIVILRREVVL QEVTLSWFKGKGNPTLFNFYLSENESKQIGGMAFCHYPRDLWNFELERDNLGHLVEEITK QQSIQEVTEHKSLENLQPDNVIEKNNPFSGEKFKTAAEICISNEEPDVNHQDNGENVSRA CQRPSWQLLLSQA >gi568815575f:147952213_148152758|GENSCAN_predicted_CDS_6|942_bp atgactgtccacctgctcgccaacccttcccagggcagcgttgcctggccaccccaccgg agtgtgcacacagtgcagcctctgctgcccaagctgggtgctttgctccacctgagtgca ttctggcagcctgagagccctttggataccccagcacacttggaacccaaccccacgagt ccagaagaggaagccacaagcaggtcctggtgctccagagctgcagcccatggcttggga atgcccagccaggatctgtggccagcgcttgagaaaggaaggagctcatgctctcagaaa actgagacgggtgagtggcaccggttcgtggcctggtgtgggacctgggtgtgcctctct ccacagggctggtcagagcacaaatgcaaattggaagatacacaaaggagccacgcagct aagaacctatctactgcccgctattttccaccgccatctactggatcgcagcctaaacac caccaccaaagaacactgttaattctccccattgtgatactaaggagggaggtggtacta caggaagtgactctgagctggtttaaaggaaaggggaatcctacactcttcaacttttat ctgagtgaaaacgagagcaaacagattggtggcatggcattttgccactaccctagagat ttgtggaacttcgaacttgagagagataatttagggcatctggtagaagaaattactaag cagcaaagcattcaagaggtgacagagcataaaagtttggaaaatttgcagcctgacaat gtgatagaaaagaataacccattttctggagagaaattcaagacagctgcagaaatttgc ataagtaacgaggagccagatgtcaatcaccaagacaatggggaaaatgtctccagggca tgtcagagaccttcatggcagctcctcctatcacaggcctga >gi568815575f:147952213_148152758|GENSCAN_predicted_peptide_7|219_aa MHSQRDGLELQLMFKREAKHKSLENLQPNDVIEKKNPFSGEKFKPAAEICISNEEPNVNG QYNGEMSPGHVRDLCGSPSHHRPGGPGQENGFLGQAQSSVPLCKSSLGTWCSASQPLWLQ PWLKGAKVQFRPLLQRVQASSLGSFHMVLGLSVCRRQELSFGNLFLNFRGCIEMAGCVGR SLLKGQSPCGEPLLGQCGREMWGQSAHTESSGGHCLVEL >gi568815575f:147952213_148152758|GENSCAN_predicted_CDS_7|660_bp atgcattcacaaagagatggtttggaattgcaacttatgtttaaaagggaagccaagcat aaaagcttggaaaatttgcagcctaatgatgtgatagaaaagaaaaacccattttctggg gagaaattcaagccagctgcagaaatttgcataagtaatgaggagccaaatgttaatggt caatacaatggggaaatgtctccaggtcatgtcagagacctttgtggcagcccctcccat cacaggcctggaggcccagggcaggaaaatggtttcctgggccaggcccagagctctgtt cctctgtgcaagagcagccttgggacctggtgctctgcatcccaacctctctggctccag ccatggctaaaaggagccaaggtacagttcaggccattgcttcagagggtgcaagcctcg agccttggcagctttcacatggttttgggcctgagcgtgtgcagaagacaagaactgagc tttggaaacctcttcctaaatttcagaggatgtatagaaatggctggatgtgtaggcaga agtctgctgaaggggcagagcccttgtggagaacctctgctagggcagtgtgggagggaa atgtggggtcagagtgcccatacagagtcctcaggggggcactgcctagtggaactgtga >gi568815575f:147952213_148152758|GENSCAN_predicted_peptide_8|121_aa MAESLWNTETHNGTIERRAKYPVSATLYPPLESAQSQEELLVGKRVITKEEIAADTQIRK RKESKLGITENCHATKVNNKRGRAVCCRESVSFWEGEQSDCGTLHWNSVLPCHSSIWQKD T >gi568815575f:147952213_148152758|GENSCAN_predicted_CDS_8|366_bp atggctgaatccctgtggaatacagaaacccacaatggtaccatagagaggcgagcaaaa tatcctgtgtctgccaccctgtatcctccactggaatcagctcaaagccaggaggaactt cttgtgggaaaaagagtaatcacaaaggaagaaattgcagcagatacacaaataagaaag agaaaagaatcaaaacttggcatcacagaaaactgccatgctacaaaggtgaacaacaag agaggcagggcagtgtgctgcagagaatctgtgtccttctgggagggagagcaaagtgat tgtgggactttgcactggaactcagtgctgccctgtcatagcagcatttggcagaaagat acctga >gi568815575f:147952213_148152758|GENSCAN_predicted_peptide_9|640_aa MGDFNTPLSTLDRSTRKKVNKDTQELNSALHQADLIDIYRTLHPKSTEYTFFSAPHHTYS KIDHILGSKALLSKCKRPEIITNYLSDHSAIKLELRIKNLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNENKDTTYQNLWDAFKAVCRGKFIALNAHKRKQERSKIDTLTSQLK ELEKQEQTHSKASRRQEITKIRAELKEIGTQKTLQKINESRSWFFERINKIDRPLATLIK KKRERNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLEEMDKFLDTYTLPRLN QEEVESLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRYKQELVPFLLKLFQSTEKE GILPNSFYEASIILIPKPGRDTPKKENFRPISLMNIDAKILNKILAKRIQQHIKKLIHHD QVGFIPGMQGWFNIRKSINVIQHINRAKDKNHMIISIDAGKAFDKIQQPFMLKTLNKLGI DGTYFKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQE KEIKGIQLGKEEVKLSLFADDMIVYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFL YTNNRQTESQIMSELPFTIAAKRIKYLGIQLTRDLKDLFK >gi568815575f:147952213_148152758|GENSCAN_predicted_CDS_9|1923_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagaaagaaagtcaac aaggatacccaggaattgaactcagctctgcaccaagcagacctaatagacatctacaga actctccaccccaagtcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatacttggaagtaaagctctcctcagcaaatgtaaaagaccagaaatt ataacaaactatctctcagaccacagtgcaatcaaactagaactcaggattaagaatctc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacgagaacaaagacaca acataccagaatctctgggacgcattcaaagcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaattgacaccctaacatcacaattaaaa gaactagaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagggacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaacactaataaag aaaaaaagagagaggaatcaaatagatgcaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaccatcagagaatactacaaacacctctatgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacactctcccaagactaaac caggaagaagttgaatctctgaatagaccaataacaggatctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag aggtacaagcaggaactggtaccattccttctgaaactattccaatcaacagaaaaagag ggaatcctccctaactcattttatgaggccagcatcattctgataccaaagccaggcaga gacacaccaaaaaaagagaattttagaccaatatccttgatgaacattgatgcaaaaatc ctcaataaaatactggcaaaacgaatccagcagcacatcaaaaagcttatccaccatgat caagtgggcttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgta atccagcatataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagga aaagcctttgacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtatt gatgggacgtatttcaaaataataagagctatttatgacaaacccacagccaatatcata ctgaatgggcaaaaactggaagcattccctttgaaaactggcacaagacagggatgccct ctctcaccactcctattcaacatagtgttggaagttctggccagggcaattaggcaggag aaggaaataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagat gacatgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaagctgata agcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattctta tacaccaacaacagacaaacagagagccaaatcatgagtgaactcccattcacaattgct gcaaagagaataaaatacctaggaatccaacttacaagggatctgaaggacctcttcaag tag >gi568815575f:147952213_148152758|GENSCAN_predicted_peptide_10|338_aa MGKKQNRKAGNSKKQSASPPPKERSSSPATEQSWMENDFDELREEGFRRSNNSELREDIQ TKGKEVENFEKNLEECITRITNTEKSLKELMELKTKAGELLEECSSLRSRCDQLEERVSA MENEMNEMKQEGKFREKRIKRNEQSLQEIWDYVKRPDLRLIGVPESDGENGTKLENTLQD IIQENVPNLARQANIQIQEIQRTPQRYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGWV TLKGKPVRLTADLSAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKYFTDK QMLRDFVTTRPALKKLLKEALNMERKNWYQPLQNHAKM >gi568815575f:147952213_148152758|GENSCAN_predicted_CDS_10|1017_bp atggggaaaaaacagaacagaaaagctggaaactctaaaaagcagagcgcctcccctcct ccaaaggaacgcagttcctcaccagcaacggaacaaagctggatggagaatgactttgac gagttgagagaagaaggcttcagacgatcaaataactctgagctacgggaggacatacaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagaagaatgtataactagaata accaatacagagaagagcttaaaggagctgatggagctgaaaaccaaggctggagaacta cttgaagaatgcagcagcctcaggagccgatgcgatcaactggaagaaagggtatcagca atggaaaatgaaatgaatgaaatgaagcaagaagggaagtttagagaaaaaagaataaaa agaaatgagcaaagcctccaagaaatatgggactatgtgaaaagaccagatctacgtctg attggtgtacctgaaagtgatggggagaatggaaccaagttggaaaacacgctgcaggat attatccaggagaacgtccccaatctagcaaggcaggccaacattcagattcaggaaata cagagaactccacaaagatactcctcgagaagagcaactcccagacacataattgtcaga ttcaccaaagttgaaatgaaggaaaaaatgttaagggcagccagagagaaaggttgggtt accctcaaagggaagcccgtcagactaacagctgatctctcggcagaaaccctacaagcc agaagagagtggggaccaatattcaacattcttaaagaaaagaattttcaacccagaatt tcatatccagccaaactaagcttcataagtgaaggagaaataaaatactttacagacaag caaatgctgagagattttgtcaccaccaggcctgccctaaaaaagctcctgaaggaagca ctaaacatggaaaggaaaaactggtaccagccgctgcaaaatcatgccaaaatgtaa