GENSCAN 1.0 Date run: 3-Nov-116 Time: 01:39:44 Sequence gi568815593r:138773013_138974557 : 201545 bp : 43.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8911 9017 107 2 2 111 110 58 0.828 10.23 1.02 Intr + 10165 10360 196 0 1 49 100 256 0.995 21.89 1.03 Intr + 37026 37192 167 1 2 83 81 158 0.025 14.28 1.04 Intr + 39171 39290 120 2 0 90 92 128 0.935 14.09 1.05 Intr + 51518 51787 270 1 0 84 105 194 0.947 18.44 1.06 Intr + 54503 54706 204 1 0 76 82 371 0.961 34.70 1.07 Term + 81744 81842 99 2 0 87 48 81 0.165 2.13 1.08 PlyA + 83342 83347 6 1.05 2.02 PlyA - 84889 84884 6 1.05 2.01 Sngl - 101512 99998 1515 1 0 50 34 788 0.671 63.01 2.00 Prom - 103049 103010 40 -7.76 3.00 Prom + 105008 105047 40 -6.46 3.01 Init + 106831 106833 3 0 0 58 103 0 0.378 -1.50 3.02 Intr + 113200 113280 81 0 0 108 89 59 0.883 7.83 3.03 Intr + 114478 114630 153 0 0 21 107 184 0.771 13.87 3.04 Intr + 131337 131429 93 2 0 77 103 43 0.562 4.86 3.05 Intr + 144730 144886 157 0 1 64 95 164 0.664 14.28 3.06 Intr + 151498 151698 201 2 0 99 96 247 0.999 25.86 3.07 Intr + 152244 152395 152 1 2 99 96 218 0.999 23.58 3.08 Intr + 153337 153525 189 0 0 58 16 116 0.576 1.18 3.09 Intr + 156234 156344 111 2 0 81 90 132 0.996 13.28 3.10 Intr + 157461 157642 182 2 2 53 105 233 0.903 20.17 3.11 Intr + 157818 157923 106 0 1 105 103 112 0.998 14.62 3.12 Intr + 159566 159700 135 1 0 71 86 219 0.936 20.76 3.13 Term + 160790 161077 288 1 0 106 41 391 0.999 31.58 3.14 PlyA + 161302 161307 6 1.05 4.03 PlyA - 161694 161689 6 -0.45 4.02 Term - 163475 163305 171 2 0 63 42 100 0.353 0.73 4.01 Init - 165367 165299 69 1 0 90 95 80 0.816 9.95 4.00 Prom - 166764 166725 40 -6.56 5.07 PlyA - 167256 167251 6 1.05 5.06 Term - 174461 174105 357 2 0 24 54 642 0.995 48.91 5.05 Intr - 175275 175195 81 0 0 74 35 78 0.469 0.93 5.04 Intr - 178323 178159 165 0 0 113 113 324 0.999 37.56 5.03 Intr - 178904 178776 129 2 0 75 93 224 0.985 22.49 5.02 Intr - 181296 181232 65 1 2 80 77 39 0.885 0.44 5.01 Init - 186141 186075 67 0 1 85 116 46 0.923 8.34 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 37094 37192 99 1 0 100 81 168 0.972 15.83 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:138773013_138974557|GENSCAN_predicted_peptide_1|387_aa XMTAVHAGNINFKWDPKSLEIRTLAVERLLEPLVTQVTTLVNTNSKGPSNKKRGRSKKAH VLAASVEQATENFLEKGDKIAKESQFLKEELVAAVEDVRKQGDLMKAAAGEFADDPCSSV KRGNMVRAARALLSAVTRLLILADMADVYKLLVQLKVVEDGILKLRNAGNEQDLGIQYKA LKPEVDKLNIMAAKRQQELKDVGHRDQMAAARGILQKNVPILYTASQACLQHPDVAAYKA NRDLIYKQLQQAVTGISNAAQATASDDASQHQGGGGGELAYALNNFDKQIIVDPLSFSEE RFRPSLEERLESIISGAALMADSSCTRDDRRERIVAECNAVRQALQDLLSEYMGNTLFIK KGNDGLCNNVILWMHIQCTSYSTGHTL >gi568815593r:138773013_138974557|GENSCAN_predicted_CDS_1|1164_bp naaatgactgctgtccatgcaggcaacataaacttcaagtgggatcctaaaagtctagag atcaggactctggcagttgagagactgttggagcctcttgttacacaggttacaaccctt gtaaacaccaatagtaaagggccctctaataagaagagaggtcgttctaagaaggcccat gttttggctgcatctgttgaacaagcaactgagaatttcttggagaagggggataaaatt gcgaaggagagccagtttctcaaggaggagcttgtggctgctgtagaagatgttcgaaaa caaggtgatttgatgaaggctgctgcaggagagttcgcagatgatccctgctcttctgtg aagcgaggcaacatggttcgggcagctcgagctttgctctctgctgttacccggttgctg attttggctgacatggcagatgtctacaaattacttgttcagctgaaagttgtggaagat ggtatcttgaagttgaggaatgctggcaatgaacaagacttaggaatccagtataaagcc ctaaaacctgaagtggataagctgaacattatggcagccaaaagacaacaggaattgaaa gatgttggccatcgtgatcagatggctgcagctagaggaatcctgcagaagaacgttccg atcctctatactgcatcccaggcatgcctacagcaccctgatgtcgcagcctataaggcc aacagggacctgatatacaagcagctgcagcaggcggtcacaggcatttccaatgcagcc caggccactgcctcagacgatgcctcacagcaccagggtggaggaggaggagaactggca tatgcactcaataactttgacaaacaaatcattgtggaccccttgagcttcagcgaggag cgctttaggccttccctggaggagcgtctggaaagcatcattagtggggctgccttgatg gccgactcgtcctgcacgcgtgatgaccgtcgtgagcgaattgtggcagagtgtaatgct gtccgccaggccctgcaggacctgctttcggagtacatgggcaatacactgttcattaaa aagggtaatgatggcttatgtaataatgtcatcctgtggatgcacatccagtgtacttca tactccacgggacatacactttga >gi568815593r:138773013_138974557|GENSCAN_predicted_peptide_2|504_aa MLAAIYAMSMVLKMLPALGMACPPKCRCEKLLFYCDSQGFHSVPNATDKGSLGLSLRHNH ITELERDQFASFSQLTWLHLDHNQISTVKEDAFQGLYKLKELILSSNKIFYLPNTTFTQL INLQNLDLSFNQLSSLHPELFYGLRKLQTLHLRSNSLRTIPVRLFWDCRSLEFLDLSTNR LRSLARNGFAGLIKLRELHLEHNQLTKINFAHFLRLSSLHTLFLQWNKISNLTCGMEWTW GTLEKLDLTGNEIKAIDLTVFETMPNLKILLMDNNKLNSLDSKILNSLRSLTTVGLSGNL WECSARICALASWLGSFQGRWEHSILCHSPDHTQGEDILDAVHGFQLCWNLSTTVTVMAT TYRDPTTEYTKRISSSSYHVGDKEIPTTAGIAVTTEEHFPEPDNAIFTQRVITGTMALLF SFFFIIFIVFISRKCCPPTLRRIRQCSMVQNHRQLRSQTRLHMSNMSDQGPYNEYEPTHE GPFIIINGYGQCKCQQLPYKECEV >gi568815593r:138773013_138974557|GENSCAN_predicted_CDS_2|1515_bp atgctggcagcaatatatgcaatgagtatggttttaaaaatgctgcctgccctgggtatg gcgtgtccacccaaatgccgctgcgagaagctgctcttctactgcgactctcagggcttc cactcagtgccaaacgccacagacaagggctctctgggcctgtccctgaggcacaatcac atcacagagctcgaaagagatcaatttgccagcttcagtcaacttacttggctccactta gatcacaatcaaatttcaacagtaaaagaagatgcttttcaaggactatataaacttaag gaattaatcttaagttccaacaaaatattttacttgccaaacacaacttttacccaactg attaacctgcaaaatttggacctgtcttttaatcagctgtcatctctgcacccagagctc ttctatggccttcggaagctgcagaccttgcatttacgttccaactccctgcggactatc ccagtacgcctgttctgggactgtcgtagtctggagtttctggatttgagcacaaatcgt ttgcgaagtttggctcgcaatggatttgcaggattaattaaactgagagagcttcaccta gagcacaaccagctgacgaagattaattttgctcatttcctacggctaagcagtctgcac acgctcttcttacaatggaacaaaatcagcaacttgacatgtgggatggagtggacctgg ggcactttagaaaagctagacctgactggaaatgaaatcaaagccatcgacttgacagtg tttgaaacgatgcccaatcttaaaatactcctcatggataacaacaagttaaacagcctt gattccaagatcttaaactccctgagatccctcacaaccgttggtctctctggcaatctg tgggaatgcagcgcccgaatatgtgctctggcctcctggctgggcagtttccaaggtcgg tgggaacactccatcctatgccacagtcctgaccacacccaaggagaggatattctagat gcagtccatggatttcagctctgctggaatttgtcaaccactgtcactgtcatggctaca acttatagagatccaaccactgaatatacaaaaagaataagctcatcaagttaccatgtg ggagacaaagaaatcccaactactgcaggcatagcagttactaccgaggaacactttcct gaaccagacaatgccatcttcactcagcgggtaattacgggaacaatggctttattgttt tctttcttttttattatttttatagtgttcatctccaggaagtgctgccctcccacttta agaagaattaggcagtgctcaatggttcagaaccacaggcagctccgatcccaaacacga ctccatatgtcaaacatgtcagaccaaggaccgtataatgaatatgaacccacccatgaa ggacccttcatcatcattaatggttatggacagtgcaagtgtcagcagctgccatacaaa gaatgtgaagtataa >gi568815593r:138773013_138974557|GENSCAN_predicted_peptide_3|616_aa MAGRKERSDALNSAIDKMTKKTRDLRRQLRKAVMDHVSDSFLETNVPLLVLIEAAKNGNE KEVKEYAQVFREHANKLIEVANLACSISNNEEGVKLVRMSASQLEALCPQVINAALALAA KPQSKLAQENMDLFKEQWEKQVRVLTDAVDDITSIDDFLAVSENHILEDVNKCVIALQEK DVDGLDRTAGAIRGRAARVIHVVTSEMDNYEPGVYTEKVLEATKLLSNTVMPRFTEQVEA AVEALSSDPAQPMDENEFIDASRLVYDGIRDIRKAVLMIRVQSWQDPALLSTVQMATATL MEPQKPRPFVRPCCPATGVHPPTHTCAPVLFGPLADFVEDTEATPEELDDSDFETEDFDV RSRTSVQTEDDQLIAGQSARAIMAQLPQEQKAKIAEQVASFQEEKSKLDAEVSKWDDSGN DIIVLAKQMCMIMMEMTDFTRGKGPLKNTSDVISAAKKIAEAGSRMDKLGRTIADHCPDS ACKQDLLAYLQRIALYCHQLNICSKVKAEVQNLGGELVVSGVDSAMSLIQAAKNLMNAVV QTVKASYVASTKYQKSQGMASLNLPAVSWKMKAPEKKPLVKREKQDETQTKIKRASQKKH VNPVQALSEFKAMDSI >gi568815593r:138773013_138974557|GENSCAN_predicted_CDS_3|1851_bp atggctggacgtaaagaaagaagtgatgcactcaattctgcaatagataaaatgaccaag aagaccagggacttgcgtagacagctccgcaaagctgtcatggaccacgtttcagattct ttcctggaaaccaatgttccacttttggtattgattgaagctgcaaagaatggaaatgag aaagaagttaaggagtatgcccaagttttccgtgaacatgccaacaaattgattgaggtt gccaacttggcctgttccatctcaaataatgaagaaggtgtaaagcttgttcgaatgtct gcaagccagttagaagccctctgtcctcaggttattaatgctgcactggctttagcagca aaaccacagagtaaactggcccaagagaacatggatctttttaaagaacaatgggaaaaa caagtccgtgttctcacagatgctgtcgatgacattacttccattgatgacttcttggct gtctcagagaatcacattttggaagatgtgaacaaatgtgtcattgctctccaagagaag gatgtggatggcctggaccgcacagctggtgcaattcgaggccgggcagcccgggtcatt cacgtagtcacctcagagatggacaactatgagccaggagtctacacagagaaggttctg gaagccactaagctgctctccaacacagtcatgccacgttttactgagcaagtagaagca gccgtggaagccctcagctcggaccctgcccagcccatggatgagaatgagtttatcgat gcttcccgcctggtatatgatggcatccgggacatcaggaaagcagtgctgatgataagg gtgcagtcctggcaagatcctgctctcctgagcacagtgcagatggccacagccacgctg atggagccgcagaagccgcggccatttgtgcgcccttgctgccctgcaacaggcgtccac ccacccacccacacctgtgcccctgtcctgtttggcccacttgccgatttcgttgaggat actgaggcaacccctgaggagttggatgactctgactttgagacagaagattttgatgtc agaagcaggacgagcgtccagacagaagacgatcagctgatagctggccagagtgcccgg gcgatcatggctcagcttccccaggagcaaaaagcgaagattgcggaacaggtggccagc ttccaggaagaaaagagcaagctggatgctgaagtgtccaaatgggacgacagtggcaat gacatcattgtgctggccaagcagatgtgcatgattatgatggagatgacagactttacc cgaggtaaaggaccactcaaaaatacatcggatgtcatcagtgctgccaagaaaattgct gaggcaggatccaggatggacaagcttggccgcaccattgcagaccattgccccgactcg gcttgcaagcaggacctgctggcctacctgcaacgcatcgccctctactgccaccagctg aacatctgcagcaaggtcaaggccgaggtgcagaatctcggcggggagcttgttgtctct ggggtggacagcgccatgtccctgatccaggcagccaagaacttgatgaatgctgtggtg cagacagtgaaggcatcctacgtcgcctctaccaaataccaaaagtcacagggtatggct tccctcaaccttcctgctgtgtcatggaagatgaaggcaccagagaaaaagccattggtg aagagagagaaacaggatgagacacagaccaagattaaacgggcatctcagaagaagcac gtgaacccggtgcaggccctcagcgagttcaaagctatggacagcatctaa >gi568815593r:138773013_138974557|GENSCAN_predicted_peptide_4|79_aa MERRLQMGALEHLNVRGHEEKELNTDMMGNHPPYQVLREHHGFHLDAARRYRSGEASYYQ AVSCSVERAMWHGTEAFCQ >gi568815593r:138773013_138974557|GENSCAN_predicted_CDS_4|240_bp atggagaggaggctccagatgggagccctggagcacctcaatgtcaggggccacgaggag aaggagctgaatacagacatgatggggaatcaccctccatatcaggttttaagggaacac catggcttccatctggacgctgctcgcagataccgctctggggaagccagctactaccaa gctgtgagctgctctgtggaacgggccatgtggcatggaactgaggccttctgtcaatag >gi568815593r:138773013_138974557|GENSCAN_predicted_peptide_5|287_aa MADSMAGTGKEQYLVQEHPVPEDLAVGENMNLLPYEPDFSNLQKGSHAVLCFFPSNPKVQ VEAIEGGALQKLLVILATEQPLTAKKKVLFALCSLLRHFPYAQRQFLKLGGLQVLRTLVQ EKGTEVLAVRVVTLLYDLVTEKDAGLHPSSVSSASPNQPHDFLLPQEAQMFAEEEAELTQ EMSPEKLQQYRQVHLLPGLWEQGWCEITAHLLALPEHDAREKVLQTLGVLLTTCRDRYRQ DPQLGRTLASLQAEYQVLASLELQDGEDEGYFQELLGSVNSLLKELR >gi568815593r:138773013_138974557|GENSCAN_predicted_CDS_5|864_bp atggctgattccatggctggcacgggaaaagagcaatatcttgtacaagaacatcctgtg ccagaagacttggctgtgggtgaaaacatgaatttattgccttatgaacctgatttcagc aatttacaaaagggtagtcatgctgtcctgtgcttcttccccagcaaccccaaggtccag gtggaggccatcgaagggggagccctgcagaagctgctggtcatcctggccacggagcag ccgctcactgcaaagaagaaggtcctgtttgcactgtgctccctgctgcgccacttcccc tatgcccagcggcagttcctgaagctcggggggctgcaggtcctgaggaccctggtgcag gagaagggcacggaggtgctcgccgtgcgcgtggtcacactgctctacgacctggtcacg gagaaggatgcaggattgcacccctcctcagtttcctctgccagccccaatcagccccac gacttcctgctgccgcaggaagcacagatgttcgccgaggaggaggctgagctgacccag gagatgtccccagagaagctgcagcagtatcgccaggtacacctcctgccaggcctgtgg gaacagggctggtgcgagatcacggcccacctcctggcgctgcccgagcatgatgcccgt gagaaggtgctgcagacactgggcgtcctcctgaccacctgccgggaccgctaccgtcag gacccccagctcggcaggacactggccagcctgcaggctgagtaccaggtgctggccagc ctggagctgcaggatggtgaggacgagggctacttccaggagctgctgggctctgtcaac agcttgctgaaggagctgagatga