GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:02:33 Sequence gi568815597f:204012916_204226131 : 213216 bp : 49.53% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 2924 2919 6 1.05 1.07 Term - 20952 20826 127 2 1 67 53 71 0.431 -0.74 1.06 Intr - 24169 24054 116 1 2 83 98 78 0.800 7.55 1.05 Intr - 24673 24490 184 0 1 128 59 -7 0.578 0.19 1.04 Intr - 27479 27350 130 0 1 42 46 130 0.586 3.95 1.03 Intr - 52203 52088 116 2 2 81 6 105 0.007 1.69 1.02 Intr - 52821 52789 33 2 0 113 94 28 0.898 3.24 1.01 Init - 54255 54197 59 0 2 93 64 24 0.885 0.67 1.00 Prom - 57742 57703 40 -7.26 2.04 PlyA - 59412 59407 6 1.05 2.03 Term - 60367 60189 179 2 2 87 44 146 0.025 7.95 2.02 Intr - 62057 61843 215 1 2 35 37 281 0.962 16.06 2.01 Init - 62677 62346 332 1 2 75 66 100 0.758 3.28 2.00 Prom - 72681 72642 40 -4.86 3.00 Prom + 72757 72796 40 -6.26 3.01 Init + 75893 76013 121 1 1 96 68 74 0.494 6.55 3.02 Intr + 82159 82208 50 2 2 106 98 28 0.148 4.00 3.03 Intr + 86060 86151 92 1 2 59 59 64 0.003 -0.61 3.04 Intr + 100000 100219 220 1 1 134 52 113 0.860 10.60 3.05 Intr + 101406 101517 112 2 1 52 97 102 0.956 7.45 3.06 Intr + 101604 101690 87 1 0 101 67 57 0.958 4.84 3.07 Intr + 103592 103764 173 0 2 10 96 296 0.985 22.26 3.08 Intr + 104207 104275 69 1 0 115 91 152 0.839 17.58 3.09 Intr + 104678 104792 115 1 1 74 109 -15 0.411 -0.78 3.10 Intr + 108985 109070 86 2 2 86 86 94 0.943 8.54 3.11 Intr + 109322 109484 163 1 1 97 94 148 0.998 15.85 3.12 Intr + 109939 110048 110 2 2 116 86 148 0.998 17.40 3.13 Intr + 110197 110293 97 0 1 71 77 168 0.997 13.58 3.14 Intr + 110746 110889 144 2 0 97 87 258 0.998 26.85 3.15 Intr + 111726 111942 217 1 1 22 60 477 0.997 35.86 3.16 Term + 112943 113219 277 2 1 111 42 437 0.998 36.33 3.17 PlyA + 114806 114811 6 1.05 4.11 PlyA - 116194 116189 6 -0.45 4.10 Term - 117049 116949 101 2 2 97 52 33 0.512 -1.11 4.09 Intr - 117621 117431 191 2 2 67 40 147 0.416 7.03 4.08 Intr - 119341 119273 69 0 0 151 -6 71 0.472 2.30 4.07 Intr - 121673 121600 74 2 2 114 109 113 0.978 14.20 4.06 Intr - 124334 124189 146 0 2 94 105 199 0.999 22.20 4.05 Intr - 127203 127120 84 1 0 110 94 173 0.995 19.89 4.04 Intr - 128542 128400 143 0 2 47 86 160 0.952 11.70 4.03 Intr - 133849 133727 123 0 0 97 87 182 0.978 18.70 4.02 Intr - 137047 136788 260 1 2 73 94 500 0.979 45.36 4.01 Init - 138853 138680 174 1 0 76 69 352 0.942 31.45 4.00 Prom - 140132 140093 40 -9.65 5.10 PlyA - 140377 140372 6 1.05 5.09 Term - 142262 142101 162 2 0 45 54 321 0.999 22.34 5.08 Intr - 143003 142905 99 2 0 44 94 115 0.991 8.01 5.07 Intr - 143404 143263 142 0 1 96 58 132 0.354 11.36 5.06 Intr - 146680 146484 197 1 2 126 75 291 0.992 29.81 5.05 Intr - 147594 147400 195 0 0 112 25 78 0.913 3.51 5.04 Intr - 147763 147645 119 2 2 69 103 178 0.999 17.58 5.03 Intr - 148500 148377 124 0 1 129 69 188 0.995 21.16 5.02 Intr - 149248 149098 151 0 1 43 86 188 0.950 14.26 5.01 Init - 150873 150767 107 0 2 73 100 148 0.999 14.19 5.00 Prom - 151519 151480 40 -3.36 6.11 PlyA - 153127 153122 6 1.05 6.10 Term - 177882 177569 314 1 2 112 55 355 0.998 29.76 6.09 Intr - 181769 181720 50 1 2 88 96 6 0.158 -0.28 6.08 Intr - 183857 183781 77 2 2 116 111 -29 0.164 0.51 6.07 Intr - 186343 186280 64 0 1 128 103 68 0.994 11.02 6.06 Intr - 188896 188718 179 1 2 124 78 169 0.995 18.22 6.05 Intr - 190072 189981 92 2 2 88 92 147 0.826 14.81 6.04 Intr - 200276 200264 13 2 1 83 91 10 0.026 -4.85 6.03 Intr - 201519 201262 258 0 0 73 80 140 0.772 9.36 6.02 Intr - 205448 205314 135 2 0 107 69 2 0.436 0.96 6.01 Intr - 205759 205706 54 1 0 101 60 32 0.476 0.88 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 52203 52056 148 2 1 81 33 136 0.961 4.77 S.002 Sngl - 60693 60322 372 0 0 60 49 341 0.974 21.53 S.003 Term - 61744 61692 53 2 2 110 53 35 0.939 -0.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:204012916_204226131|GENSCAN_predicted_peptide_1|254_aa MAQPVGLLLRGKHCLQADLREGPLGHTSKGRAYLMLRTVPKDQVSALDGHLTVKKLGGFC IPSHQDVVESVGNILEETKRTQNNTEQASRAINSPLQSPYTDSMKALAISSHWFLPQIHT LPPITKATRHRWPPACCGRRGGQTTLPPLTPIARACESMERSCPGNYPMQHERKQTLSET TMDPRDATVKETQPFNTWQGRDTSTVIASPSGQCKEATSQEMLSVPGSWKGNEMDFPQNL QGEYGTASTLIFTQ >gi568815597f:204012916_204226131|GENSCAN_predicted_CDS_1|765_bp atggcacagcccgtggggcttcttctcagaggcaagcactgccttcaggctgacctgaga gaggggccactgggccacaccagcaaaggcagggcttaccttatgctaagaactgtaccc aaagatcaagtctctgccctggatggacacctcacggtcaaaaagcttggtggcttctgc atcccttctcatcaggacgtggtggaaagtgttggcaatatccttgaagagactaaaaga actcagaacaacactgaacaagccagcagagccatcaactcacccctacagtctccatac acagactccatgaaagcattggccatcagctctcactggtttctgccccaaattcacaca ctccctcctatcaccaaagccacccgccaccgctggcctcctgcatgctgtgggaggaga ggagggcagacaacgttgcctcccctgactcccatcgcgcgggcttgtgagtccatggag cgcagttgtccagggaactatccgatgcagcatgagaggaagcaaacactcagtgagaca accatggaccctagggatgcaacagtgaaggagacgcagcctttcaacacctggcagggc agagacacttcaacagtcattgccagccccagtggacaatgtaaggaggccaccagccaa gaaatgctgtcagtgcctggaagctggaaaggcaatgaaatggattttccccagaacctc cagggagaatatggcacagcttctaccttgattttcacccagtga >gi568815597f:204012916_204226131|GENSCAN_predicted_peptide_2|241_aa MPSKPVSPKRSRHASFKVGCITLPISHAGKSKPTLLIDGELESKCAESQHLVKRFLMKHS GNRGQPQALTRNCAEAQITESGSRGARERGRGRAGGSGLAPRRTSPKAQPRPLHQRQRQR QRRGVLPGNSQRVKANFPGKLFGFLQASVTEAAAKSASGVFAAAAGPPGREPPLQRSGSS RKAPRLDSESGPTRLTAELWAQTWARRFWARVPGPVLTQFSSKPGLEKGARGSLLRGDWT A >gi568815597f:204012916_204226131|GENSCAN_predicted_CDS_2|726_bp atgccttccaaaccagtatcccccaaacgatcacgacatgccagtttcaaagttgggtgc ataactctgcccatttcgcatgcggggaagtccaaacccacgctgctcatcgatggggaa ctggaatccaaatgtgctgaatcccagcatcttgtgaagcgctttctaatgaagcacagt ggaaacaggggtcagcctcaggccctgactcgcaactgcgccgaggcgcagattacggag tcagggagccggggagccagggagcgggggagagggagagcgggaggtagcgggctggcg ccgcgtcggacgagtccgaaagcacaaccgcgaccccttcaccagcgccagcgccagcgc cagcgccgaggcgtcctgcccggaaattcccagagagtcaaagcaaacttcccaggaaaa ctcttcggcttcctccaagcctctgtcaccgaggcggctgccaagagcgccagcggagtc tttgcagctgccgcgggccctccagggcgggagccgccgctccagcgctcggggtcctcc cggaaagcgcctcggctcgattccgagtcagggcctacgaggctgaccgctgagctctgg gctcagacctgggctcgccggttctgggcccgggttccaggacccgtgctcacacagttc agttccaagccgggactggagaagggcgctcgcggctcgctgctgcgaggcgactggaca gcctag >gi568815597f:204012916_204226131|GENSCAN_predicted_peptide_3|710_aa MPGTQQVPIKCEVKAQLEFGQIHTVQAPRDLAFELMKGDAGGEGIRSKPPSTALSRTVDY YFPCIIRTRAKGLVPLTSIVGAALAQQRMSMRSPISAQLALDGVGTMVNCTIKSEEKKEP CHEAPQGSATAAEPQPGDPARASQDSADPQAPAQGNFRGSWDCSSPEGNGSPEPKRPGVS EAASGSQEKLDFNRNLKEVVPAIEKLLSSDWKERFLGRNSMEAKDVKGTQESLAEKELQL LVMIHQLSTLRDQLLTAHSEQKNMAAMLFEKQQQQMELARQQQEQIAKQQQQLIQQQHKI NLLQQQIQQVNMPYVMIPAFPPSHQPLPVTPDSQLALPIQPIPCKPVEYPLQLLHSPPAP VVKRPGAMATHHPLQEPSQPLNLTAKPKAPELPNTSSSPSLKMSSCVPRPPSHGGPTRDL QSSPPSLPLGFLGEGDAVTKAIQDARQLLHSHSGALDGSPNTPFRKDLISLDSSPAKERL EDGCVHPLEEAMLSCDMDGSRHFPESRNSSHIKRPMNAFMVWAKDERRKILQAFPDMHNS SISKILGSRWKSMTNQEKQPYYEEQARLSRQHLEKYPDYKYKPRPKRTCIVEGKRLRVGE YKALMRTRRQDARQSYVIPPQAGQVQMSSSDVLYPRAAGMPLAQPLVEHYVPRSLDPNMP VIVNTCSLREEGEGTDDRHSVADGEMYRYSEDEDSEGEEKSDGELVVLTD >gi568815597f:204012916_204226131|GENSCAN_predicted_CDS_3|2133_bp atgcctggcactcagcaggtacccattaaatgtgaagtgaaagctcaattagagttcgga caaatacatactgtgcaggctcctcgggacctggcttttgagctcatgaagggagatgct ggcggggagggcatcaggagtaaacctccttccacggccctgagccgtacagttgactat tacttcccttgcatcatccgtaccagggccaagggcctggttcccttgacttccattgtg ggagcagcactggcccagcagaggatgtccatgaggagccccatctctgcccagctggcc ctggatggcgttggcaccatggtgaactgcaccatcaagtcagaggagaagaaagagcct tgccacgaggccccccagggctcagccactgccgctgaacctcagcctggagacccagcc cgggcctcccaggatagtgctgacccccaagctccagcccaggggaatttcaggggctcc tgggactgtagctctccagagggtaatgggtccccagaacccaagagaccaggagtgtcg gaggctgcctctggaagccaggagaagctggacttcaaccgaaatttgaaagaagtggtg ccagccatagagaagctgttgtccagtgactggaaggagaggtttctaggaaggaactct atggaagccaaagatgtcaaagggacccaagagagcctagcagagaaggagctccagctt ctggtcatgattcaccagctgtccaccctgcgggaccagctcctgacagcccactcggag cagaagaacatggctgccatgctgtttgagaagcagcagcagcagatggagcttgcccgg cagcagcaggagcagattgcaaagcagcagcagcagctgattcagcagcagcataagatc aacctccttcagcagcagatccagcaggttaacatgccttatgtcatgatcccagccttc cccccaagccaccaacctctgcctgtcacccctgactcccagctggccttacccattcag cccattccctgcaaaccagtggagtatccgctgcagctgctgcacagcccccctgcccca gtggtgaagaggcctggggccatggccacccaccaccccctgcaggagccctcccagccc ctgaacctcacagccaagcccaaggcccccgagctgcccaacacctccagctccccaagc ctgaagatgagcagctgtgtgccccgcccccccagccatggaggccccacgcgggacctg cagtccagccccccgagcctgcctctgggcttccttggtgaaggggacgctgtcaccaaa gccatccaggatgctcggcagctgctgcacagccacagtggggccttggatggctccccc aacacccccttccgtaaggacctcatcagcctggactcatccccagccaaggagcggctg gaggacggctgtgtgcacccactggaggaagccatgctgagctgcgacatggatggctcc cgccacttccccgagtcccgaaacagcagccacatcaagaggcccatgaacgccttcatg gtgtgggccaaggatgagcggaggaagatcctgcaagccttcccagacatgcacaactcc agcatcagcaagatccttggatctcgctggaagtccatgaccaaccaggagaagcagccc tactatgaggaacaggcgcggctgagccggcagcacctggagaagtatcctgactacaag tacaagccgcggcccaagcgcacctgcatcgtggagggcaagcggctgcgcgtgggagag tacaaggccctgatgaggacccggcgtcaggatgcccgccagagctacgtgatccccccg caggctggccaggtgcagatgagctcctcagatgtcctgtaccctcgggcagcaggcatg ccgctggcacagccactggtggagcactatgtccctcgtagcctggaccccaacatgcct gtgatcgtcaacacctgcagcctcagagaggagggtgagggcacagatgacaggcactcg gtggctgatggcgagatgtaccggtacagcgaggacgaggactcggagggcgaagagaag agcgatggggagttggtggtgctcacagactga >gi568815597f:204012916_204226131|GENSCAN_predicted_peptide_4|454_aa MEEKAAASASCREPPGPPRAAAVAYFGISVDPDDILPGALRLIQELRPHWKPEQVRTKRF TDGITNKLVACYVEEDMQDCVLVRVYGERTELLVDRENEVRNFQLLRAHSCAPKLYCTFQ NGLCYEYMQGVALEPEHIREPRLFRLIALEMAKIHTIHANGSLPKPILWHKMHNYFTLVK NEINPSLSADVPKVEVLERELAWLKEHLSQLESPVVFCHNDLLCKNIIYDSIKGHVRFID YEYAGYNYQAFDIGNHFNEFAGVNEVDYCLYPARETQLQWLHYYLQAQKGMAVTPREVQR LYVQVNKFALASHFFWALWALIQNQYSTIDFDFLRYAVIRFNQYFKVKPQASALEMPKLC TGQQLLQDCPPASKCPVPLGAAEATAAAACQVDSGCLAFACKKHPGAMATTGAGTCSWSF QGPDHHLYGKELTADIIPLPPLRKHSPSPSPPGE >gi568815597f:204012916_204226131|GENSCAN_predicted_CDS_4|1365_bp atggaggagaaggcggcggccagcgccagctgccgggagccgccgggccccccgagggcc gccgccgtcgcgtacttcggcatttccgtggacccggacgacatccttcccggggccctg cgcctcatccaggagctgcggccgcattggaaacccgagcaagttcggaccaagcgcttc acggatggcatcaccaacaagctggtggcctgctatgtggaggaggacatgcaggactgc gtgctggtccgggtgtatggggagcggacggagctgctggtggaccgggagaatgaggtc agaaacttccagctgctgcgagcacacagctgtgcccccaaactctactgcaccttccag aatgggctgtgctatgagtacatgcagggtgtggccctggagcctgagcacatccgtgag ccccggcttttcaggttaatcgccttagaaatggcaaagattcatactatccacgccaac ggcagcctgcccaagcccatcctctggcacaagatgcacaattatttcacgcttgtgaag aacgagatcaaccccagcctttctgcagatgtccctaaggtagaggtgttggaacgggag ctggcctggctgaaggagcatctgtcccagctggagtcccctgtggtgttttgtcacaat gacctgctctgcaagaatatcatctatgacagcatcaaaggtcacgtgcggttcattgac tatgaatatgctggctacaactaccaagcttttgacattggcaaccatttcaatgagttt gcaggcgtgaatgaggtggattactgcctgtacccggcgcgggagacccagctgcagtgg ctgcactactacctgcaggcacaaaaggggatggccgtgacccccagggaggtgcaaagg ctctacgtgcaagtcaacaagtttgccctggcgtctcacttcttctgggctctctgggcc ctcatccagaaccagtactccaccatcgactttgatttcctcaggtacgcagtgatccga ttcaaccagtacttcaaggtgaagcctcaagcgtcagccttggagatgccaaagctgtgc actgggcagcagctgctccaggactgcccccctgccagtaagtgcccagtgcccttaggg gcagcagaggcaacagcagcagcagcctgccaggtggactctggttgtttggcttttgcc tgcaaaaagcaccctggtgccatggcaaccactggagctggcacctgcagctggagcttt caaggtcctgaccaccatctgtatgggaaggaattaacagcagacatcatcccactgccg cctctgagaaagcattctccttcccccagtcctccaggggagtga >gi568815597f:204012916_204226131|GENSCAN_predicted_peptide_5|431_aa MPPATTVFTTVRHQFHKLYSVGCETQGKTELRPTGRIFLKRMPSIRESLKERGVDMARLG PEWSQPMKRLTLGNTTSSVILTNYMDTQYYGEIGIGTPPQTFKVVFDTGSSNVWVPSSKC SRLYTACVYHKLFDASDSSSYKHNGTELTLRYSTGTVSGFLSQDIITASPDPPGPTPAGR YTAAHLEPGELRNTLLCHIWLLLQSSSTYGGASLGYGLTVGWFVDVFASNLHVGGITVTQ MFGEVTEMPALPFMLAEFDGVVGMGFIEQAIGRVTPIFDNIISQGVLKEDVFSFYYNRVS VGSSTLLCEDGCLALVDTGASYISGSTSSIEKLMEALGAKKRLFDYVVKCNEGPTLPDIS FHLGGKEYTLTSADYVFQESYSSKKLCTLAIHAMDIPPPTGPTWALGATFIRKFYTEFDR RNNRIGFALAR >gi568815597f:204012916_204226131|GENSCAN_predicted_CDS_5|1296_bp atgccccctgcaaccactgtcttcaccaccgtacggcaccagttccacaagctgtacagt gtgggctgtgagacccaaggaaaaacagagctgaggcccacgggaaggatcttcctcaag agaatgccctcaatccgagaaagcctgaaggaacgaggtgtggacatggccaggcttggt cccgagtggagccaacccatgaagaggctgacacttggcaacaccacctcctccgtgatc ctcaccaactacatggacacccagtactatggcgagattggcatcggcaccccaccccag accttcaaagtcgtctttgacactggttcgtccaatgtttgggtgccctcctccaagtgc agccgtctctacactgcctgtgtgtatcacaagctcttcgatgcttcggattcctccagc tacaagcacaatggaacagaactcaccctccgctattcaacagggacagtcagtggcttt ctcagccaggacatcatcaccgcctctcctgaccctccagggcccacacctgcggggagg tacactgcagcccacttggagcctggggagctgaggaacaccctactctgccacatctgg ctgttgctgcaaagcagcagtacctatgggggagcaagcctgggctacgggctcaccgtt gggtggtttgtggatgtttttgcatctaacttgcatgtgggtggaatcacggtgacacag atgtttggagaggtcacggagatgcccgccttacccttcatgctggccgagtttgatggg gttgtgggcatgggcttcattgaacaggccattggcagggtcacccctatcttcgacaac atcatctcccaaggggtgctaaaagaggacgtcttctctttctactacaacagggtgtct gtggggtcatccaccttgctctgtgaagacggctgcctggcattggtagacaccggtgca tcctacatctcaggttctaccagctccatagagaagctcatggaggccttgggagccaag aagaggctgtttgattatgtcgtgaagtgtaacgagggccctacactccccgacatctct ttccacctgggaggcaaagaatacacgctcaccagcgcggactatgtatttcaggaatcc tacagtagtaaaaagctgtgcacactggccatccacgccatggatatcccgccacccact ggacccacctgggccctgggggccaccttcatccgaaagttctacacagagtttgatcgg cgtaacaaccgcattggcttcgccttggcccgctga >gi568815597f:204012916_204226131|GENSCAN_predicted_peptide_6|411_aa ELGKRGYATNTSIIISVQVFCGYQVLTLCVKGFTSASQFGILLWGPRVKKPGKFCGVGSG REEVNNRLCAQSLIASQRLACAIAESKKLDFQDEFSTFPKGCADANSSHTIKPSKKLEHP LDSNASTYNLSFAEQTRAAEHREVRGKGQRGETEIGVGITGFGIFFILFGTLLYFDSVLL AFGNLLFLTGLSLIIGLRKTFWFFFQRHKLKGTSFLLGGVVIVLLRWPLLGMFLETYGFF SLFKGFFPVAFGFLGNVCNIPFLGAIAHFHVVWGQDGSSGWEEGVEDGKSRLGSPGLPYP LVDSHASGQQLESLGLLAPGEQSLPCTERKPAATARLSRRGTSLSPPPESSGSPQQPGLS APHSRQIPAPQGAVLVQREKDLPNYNWNSFGLRFGKREAAPGNHGRSAGRG >gi568815597f:204012916_204226131|GENSCAN_predicted_CDS_6|1236_bp gagctggggaagcgagggtatgctaccaacaccagcatcatcatttcagttcaggtgttt tgtggttatcaggtattgacactctgtgtgaagggctttaccagtgcttctcaatttgga atcttactgtggggaccaagagtcaagaagcctggtaaattttgtggggtgggtagtgga agagaagaggtcaacaacaggctttgtgctcagtcgctcatcgcctcccagcgcctagca tgtgcaatagcagagagcaaaaagcttgactttcaagatgaatttagtacattcccaaag ggatgcgcagatgccaactcttcccacactatcaaaccaagcaagaaacttgagcacccc ctagattccaatgcttccacatacaatctaagtttcgcggagcagacgagagccgcagaa caccgcgaggtgcgaggaaagggtcagagaggggagacagagattggtgtggggatcacc ggtttcggcatcttcttcatcctctttggaacactcctgtactttgattccgtgctcctg gcctttggaaacctgctgttcctgacgggcctgtccctcatcattggcctgaggaagacc ttttggttcttcttccaacggcacaaactcaagggaaccagcttcctcctggggggtgtg gttatcgtgctcctacgctggcccctcctcggcatgttcctggaaacctacggattcttc agcctctttaagggctttttccctgtcgccttcggcttcctgggcaatgtctgcaacatc cccttcctgggtgcgatagcccatttccacgttgtttgggggcaggatgggtcctctggg tgggaagagggtgtggaggatggaaagagccggttgggttcccctggactcccatatccc cttgtggattcacatgcatccggccagcagctagaatccctgggcctcctggcccccggg gagcagagcctgccgtgcaccgagaggaagccagctgctactgccaggctgagccgtcgg gggacctcgctgtccccgccccccgagagctccgggagcccccagcagccgggcctgtcc gccccccacagccgccagatccccgcaccccagggcgcggtgctggtgcagcgggagaag gacctgccgaactacaactggaactccttcggcctgcgcttcggcaagcgggaggcggca ccagggaaccacggcagaagcgctgggcggggctga