GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:15:52 Sequence gi568815584f:89863115_90080632 : 217518 bp : 41.66% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 4132 4326 195 0 0 85 43 156 0.924 7.23 1.02 PlyA + 5057 5062 6 1.05 2.05 PlyA - 7996 7991 6 -0.45 2.04 Term - 11456 10472 985 1 1 20 32 436 0.459 21.74 2.03 Intr - 12130 11958 173 1 2 23 62 100 0.272 -1.28 2.02 Intr - 13708 13562 147 1 0 44 88 88 0.114 3.91 2.01 Init - 15944 15882 63 2 0 72 98 26 0.165 3.20 2.00 Prom - 25147 25108 40 -5.95 3.00 Prom + 27198 27237 40 -8.25 3.01 Sngl + 28680 29330 651 2 0 111 41 639 0.963 55.42 3.02 PlyA + 30639 30644 6 1.05 4.00 Prom + 38904 38943 40 -4.25 4.01 Init + 47691 47781 91 2 1 71 60 61 0.781 2.30 4.02 Term + 50745 50866 122 1 2 107 38 99 0.934 4.46 4.03 PlyA + 50890 50895 6 1.05 5.08 PlyA - 52423 52418 6 1.05 5.07 Term - 55685 55641 45 2 0 112 40 28 0.102 -3.37 5.06 Intr - 56065 55819 247 0 1 83 71 98 0.171 4.14 5.05 Intr - 67521 67435 87 2 0 73 91 37 0.268 0.67 5.04 Intr - 68517 68427 91 1 1 50 110 23 0.438 -1.17 5.03 Intr - 69513 69412 102 1 0 93 93 101 0.817 10.33 5.02 Intr - 87028 86983 46 1 1 116 50 25 0.103 -1.44 5.01 Init - 88444 88307 138 1 0 63 38 136 0.532 6.29 5.00 Prom - 90382 90343 40 -5.55 6.00 Prom + 90771 90810 40 -9.15 6.01 Init + 94376 94515 140 1 2 72 36 116 0.109 4.46 6.02 Intr + 99994 100559 566 1 2 101 90 415 0.458 34.51 6.03 Intr + 103033 103076 44 2 2 79 109 16 0.821 -0.16 6.04 Intr + 104253 104308 56 2 2 78 109 67 0.971 4.66 6.05 Intr + 108061 108157 97 1 1 95 115 66 0.954 9.09 6.06 Intr + 112667 112701 35 1 2 104 87 18 0.652 -0.60 6.07 Intr + 121402 121569 168 1 0 87 53 125 0.604 7.04 6.08 Intr + 122018 122096 79 2 1 90 87 13 0.598 0.03 6.09 Intr + 125791 125976 186 0 0 99 97 61 0.906 7.06 6.10 Term + 137035 137166 132 0 0 90 44 95 0.035 2.41 6.11 PlyA + 138972 138977 6 1.05 7.00 Prom + 138984 139023 40 -7.65 7.01 Init + 141736 141907 172 0 1 68 116 107 0.779 11.15 7.02 Intr + 153813 153909 97 1 1 63 42 82 0.009 -0.75 7.03 Intr + 156202 156304 103 1 1 122 78 56 0.093 7.26 7.04 Intr + 164205 164325 121 2 1 108 48 86 0.237 5.75 7.05 Intr + 169183 169354 172 2 1 34 91 110 0.251 3.98 7.06 Intr + 172773 172859 87 0 0 68 80 65 0.104 1.87 7.07 Intr + 179773 179895 123 1 0 55 109 48 0.078 2.38 7.08 Intr + 180892 181045 154 1 1 59 61 45 0.566 -1.95 7.09 Intr + 181676 181805 130 1 1 47 40 162 0.861 6.65 7.10 Intr + 189398 189656 259 0 1 -3 56 190 0.463 2.30 7.11 Term + 193921 194107 187 1 1 44 43 161 0.662 3.18 7.12 PlyA + 194222 194227 6 1.05 8.00 Prom + 195639 195678 40 -4.55 8.01 Init + 197151 197235 85 2 1 58 78 66 0.595 3.63 8.02 Intr + 198755 198977 223 0 1 23 40 236 0.167 8.56 8.03 Intr + 199112 199425 314 2 2 30 97 462 0.189 36.20 8.04 Term + 200835 200947 113 1 2 11 52 112 0.295 -2.36 8.05 PlyA + 201899 201904 6 1.05 9.05 PlyA - 202080 202075 6 1.05 9.04 Term - 204980 204869 112 1 1 124 43 110 0.762 7.15 9.03 Intr - 207855 207745 111 2 0 25 99 88 0.429 2.18 9.02 Intr - 208521 208304 218 0 2 84 5 147 0.736 2.38 9.01 Init - 210495 210331 165 0 0 64 -15 155 0.542 2.48 9.00 Prom - 216341 216302 40 -3.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:89863115_90080632|GENSCAN_predicted_peptide_1|64_aa SATLPYIGSPYIYYYLAYWQLKPKNQSRMTSANPWSSPNLTSTNGASAGANIMLMCGLFQ NPEI >gi568815584f:89863115_90080632|GENSCAN_predicted_CDS_1|195_bp tcagcgaccctgccttatataggcagtccctacatttattactatctggcgtattggcaa ctcaaacctaaaaatcagtcacggatgacctcagccaatccctggtcctccccaaacctc acctcaaccaatggagcttctgctggtgccaacatcatgttaatgtgtggccttttccag aatccagaaatttag >gi568815584f:89863115_90080632|GENSCAN_predicted_peptide_2|455_aa MSRQKKGQNKAESTFWVNSNKEPGIVQVLQVTTLNKTCPAGKDLAVWESRCINGDWNSLN FSHDRQRCQEGSSPAWQFLVDGVICLFNSDNEQDCGMLTSYANPEQSASPNFLEGQVAFS HLRLSNNRCGLELELIFKREAEHKKIGNLQHDSAIEKKNPFSGEKFKLVAEICINNKEPN VNHQDNGKNVFRARQRPLCQPLPSQAQRPRRKKWFCGLGPGPCCFVQSQDLVPCIPSVAK RGQYTAETIASEGVSPKHWRLTCGVGPVGTQKSRTEVWEPLPRFQRMYGNTWISRQRCAA GVEPSWRTSARAMQKGNVGLEPPHRVPTGTSGAVRRGPPSSRPQNDRPTDSLDRVPGKAT NTQCHPVKAARRGAVPCKATGAELPKAVETHLLHQRVLDVRPGVKGDHFGVLRFGCPTGF RSCMGPVAPLFWPISPIWNECVYPMPVPPLYLGSN >gi568815584f:89863115_90080632|GENSCAN_predicted_CDS_2|1368_bp atgtctcggcaaaagaaaggtcaaaacaaggcagaaagtacattttgggttaattcaaac aaggagcctggcattgtgcaggtgctccaagttacaacactgaacaagacatgtcctgca ggcaaggaccttgcagtctgggagagcagatgcataaacggggattggaattcacttaac tttagccatgacaggcagcgctgccaagagggcagttcccctgcatggcagttcttagtt gatggagtaatttgtctgtttaattccgataacgaacaagactgtggcatgctaactagt tacgcgaaccccgagcagtcagcatcccccaacttcttagagggacaagtggcattcagc cacctgagattgagcaataacagatgtggtttggaattggaacttatatttaaaagggaa gcagagcataaaaaaattggaaatctgcagcatgacagtgcaatagaaaagaaaaaccca ttttctggggagaaattcaagctggttgcagaaatttgcataaataacaaggagccaaat gttaatcaccaagacaacgggaaaaatgttttcagggcacgtcagagacccttatgccag cccctcccatcgcaggcccagaggcctaggaggaaaaaatggttttgtgggctgggccca ggaccttgctgctttgtgcagtctcaggacttggtgccctgcatcccatctgtggctaaa aggggccaatatacagctgaaaccattgcttcagagggtgtaagccccaagcattggagg cttacatgtggtgttggacctgtgggtacacagaagtcaagaactgaggtttgggaacct ctgcctagatttcagaggatgtatggaaacacctggatatccaggcagaggtgtgctgca ggggtagagccctcatggagaacctctgctagggcaatgcagaagggaaatgtggggttg gagcctccacacagagtccccactgggactagtggagctgtgagaagagggccaccatcc tccagaccccagaatgatagaccaactgacagcttggatcgtgtacctggaaaagccaca aacactcaatgccatcctgtgaaagcagccaggaggggagctgtaccctgcaaagccaca ggggcagagcttcccaaggctgtagaaacccacctcttacatcagcgtgtcctggatgtg agacctggagtcaaaggagatcattttggagttttaagatttggctgccccactggattt aggagttgcatggggcctgtagcccctttgttttggccaatttctcccatctggaatgag tgtgtttatccaatgcccgtacccccattatatctaggaagtaactaa >gi568815584f:89863115_90080632|GENSCAN_predicted_peptide_3|216_aa MEAEGCRYQFRVALLGDAAVGKTSLLRSYVAGAPGAPQPKPEPTVGAECYRRALQLRAGP RVKLQLWDTAGHERFSCITGSFYRNVVGVLLVFDVTNRKSFEHIQDWHQEVMATQGPDKV IFLLVGHKSDLQSTRCVSAQEAEELAASLGMAFVETSVKNNCNVDLAFDALTDAIQQALQ QGDIKLEEGLGGVRLIHKTQIPRSPSRKQHPGPCQC >gi568815584f:89863115_90080632|GENSCAN_predicted_CDS_3|651_bp atggaggccgagggctgccgctaccaatttcgggtcgcgctgctgggggacgcggcggtg ggcaagacgtcgctgctgcggagctacgtggcgggcgcgcctggcgccccacagcccaag cccgagcccacggtgggcgccgagtgctaccgccgcgcgctgcagctgcgggctgggccg cgggtcaagctgcagctctgggacaccgcgggccacgagcgcttcagctgcatcaccggg tccttttaccggaatgtggtgggtgtcctgctggtctttgatgtgacaaacaggaagtcc tttgaacacatccaagactggcaccaggaggtcatggccactcagggcccggacaaggtc atcttcctgctggttggccacaagagtgacctgcagagcacccgctgtgtctcagcccag gaggccgaggagctagctgcctccctgggcatggccttcgtggagacctctgttaaaaac aactgcaatgtggacctggcctttgacgccctcactgatgctatccagcaggccctgcag cagggggacatcaagctagaagagggcttggggggtgtccggctcatccacaagacccaa atccccaggtcccccagcaggaagcagcacccaggcccatgccagtgttga >gi568815584f:89863115_90080632|GENSCAN_predicted_peptide_4|70_aa MAEEGERKDSEKLSYFYLFRINMSLMIGGQDQGCLSIPAKQKASCSLSLQLSEASSGSLG IHLCMPETQQ >gi568815584f:89863115_90080632|GENSCAN_predicted_CDS_4|213_bp atggcagaggagggagaacgaaaagattcagaaaagttaagctacttctatctctttaga attaacatgagcctaatgattggaggacaggatcagggctgcttatcaattcctgccaag cagaaagcatcctgctccttgagtctgcagctctcagaagcatcttcaggctctctggga attcatctttgtatgccggagacccagcaataa >gi568815584f:89863115_90080632|GENSCAN_predicted_peptide_5|251_aa MNETQSLLRSSSEFNIERHKTVIDTKKEKRLKNKVKGAKKEGLILVIEVDSVMSSINPNT SGILLEGFLNIVRKKKEAQRYRNEVRHIFTAFDTYYRGFLTLEDFKKAFRQVAPKLPERT VLEVFRWLLKFQQSNIKFQQSNLNSSWLGGKESRSVMSEKSLNSAIHLPGSSCLDPLVPA LIPEWDHVTVQAETDLYVTKVFGLPESVFCTQFPHPGPVSFQNAFHFMQRSGMPVTQNSE FFSVQYPKPAH >gi568815584f:89863115_90080632|GENSCAN_predicted_CDS_5|756_bp atgaatgaaacacagtctctacttcgaagcagttcagagtttaatatagaaagacataaa actgtgatagataccaagaaagagaagagattgaaaaacaaagtgaagggagcaaagaag gaaggcctaattctagtgatagaagtggattctgtgatgtcttcaataaatccaaatact tctggtatattactcgaggggtttttaaatattgtcaggaaaaagaaggaagctcaacga tatcggaacgaagtaagacacatcttcacagcctttgacacctactatcgtggattttta actttggaagatttcaaaaaagcatttaggcaggtggctcccaaattaccggaaaggact gttcttgaggtattcagatggttgcttaagtttcagcagtcaaatattaagtttcagcag tcaaatctgaattccagctggttaggaggaaaagaaagtagaagtgtgatgtctgagaaa tctctaaactctgctatccaccttcctggatcctcttgcttagatcccctggtgccggcc ttgatcccagaatgggaccatgtcacagtacaggcagaaactgatctctatgtgaccaaa gtgtttgggctaccagaatctgtcttctgtactcagttcccccaccctggtcctgtgtca ttccagaatgccttccatttcatgcagagatctggtatgcctgtcactcagaactcagag ttcttctcagtacaataccccaagccagctcattag >gi568815584f:89863115_90080632|GENSCAN_predicted_peptide_6|500_aa MFAYYWPHRNSIEEHGLQAQLMVFKLNEKEQLEVKEKCPSWVSYSGESIMSQEGDYGRWT ISSSDESEEEKPKPDKPSTSSLLCARQGAANEPRYTCSEAQKAAHKRKISPVKFSNTDSV LPPKRQKSGSQEDLGWCLSSSDDELQPEMPQKQAEKVVIKKEKDISAPNDGTAQRTENHG APACHRLKEEEDEYETSGEGQDIWDMLDKGNPFQFYLTRVSGVKPKYNSGALHIKDILSP LFGTLVSSAQFNYCFDVDWLVKQYPPEFRKKPILLVHGDKREAKAHLHAQAKPYENISLC QAKLDIAFGTHHTIWLSPLYPRIADGTHKSGESPTHFKADLISYLMAYNAPSLKEWIDVI HKHDLSETNVYLIGSTPGRFQGSQKDNWGHFRLKKLLKDHASSMPNAESWPVVGQFSSVG SLGADESKWLCSEFKESMLTLGKESKTPGKSSVPLYLVLPVVVCCIPHGMLLGDQQEDLS QEGPNSGPPRIVSLLSNLIN >gi568815584f:89863115_90080632|GENSCAN_predicted_CDS_6|1503_bp atgtttgcctattactggccacacaggaactcaattgaggagcatggcctccaggcccag ctgatggtctttaagctcaatgagaaggagcagctggaggtgaaagagaaatgccccagc tgggtctcatatagtggggagagtataatgtctcaggaaggcgattatgggaggtggacc atatctagtagtgatgaaagtgaggaagaaaagccaaaaccagacaagccatctacctct tctcttctctgtgccaggcaaggagcagcaaatgagcccaggtacacctgttccgaggcc cagaaagctgcacacaagaggaaaatatcacctgtgaaattcagcaatacagattcagtt ttacctcccaaaaggcagaaaagcggttcccaggaggacctcggctggtgtctgtccagc agtgatgatgagctgcaaccagaaatgccgcagaagcaggctgagaaagtggtgatcaaa aaggagaaagacatctctgctcccaatgacggcactgcccaaagaactgaaaatcatggc gctcccgcctgccacaggctcaaagaggaggaagacgagtatgagacatcaggggagggc caggacatttgggacatgctggataaagggaaccccttccagttttacctcactagagtc tctggagttaagccaaagtataactctggagccctccacatcaaggatattttatctcct ttatttgggacgcttgtttcttcagctcagtttaactactgctttgacgtggactggctc gtaaaacagtatccaccagagttcaggaagaagccaatcctgcttgtgcatggtgataag cgagaggctaaggctcacctccatgcccaggccaagccttacgagaacatctctctctgc caggcaaagttggatattgcgtttggaacacaccacacaatatggttgagccccttatac ccacgaattgctgatggaacccacaaatctggagagtcgccaacacattttaaagctgat ctcatcagttacttgatggcttataatgccccttctctcaaggagtggatagatgtcatt cacaagcacgatctctctgaaacaaatgtttatcttattggttcaaccccaggacgcttt caaggaagtcaaaaagataattggggacattttagacttaagaagcttctgaaagaccat gcctcatccatgcctaacgcagagtcctggcctgtcgtaggtcagttttcaagcgttggc tccttgggagccgatgaatcaaagtggttatgttctgagtttaaagagagcatgctgaca ctggggaaggaaagcaagactccaggaaaaagctctgttcctctttacttggtgctgccc gtggtggtctgctgcattcctcatgggatgcttcttggagaccagcaggaggatctatct caggaaggacccaactcaggcccacccaggattgtttcccttttgagtaacttaattaat tga >gi568815584f:89863115_90080632|GENSCAN_predicted_peptide_7|534_aa MLLGFLELINSPKEAAPPAISGLSGNLPCAMQHQCFRLQKDKNHSQDTVYYGTQENKVMK HHSVSSSNNTNHSTWERKQPSPVEKRTMEDANLSKAAWGALEKNGTQLMIRSYELGVLFL PSAFATAGFLPTCQGLCSSALKPSAAIFGPRDDVRTPLCDIKVHYISTVGYVPIEDLKGG YVCGWVDRRNERQTDKQNARMIHEGAGVGQCMEARGGTSVMCPNLQRTKKAKMEALEEPE KCSKGPFLPVNVLIGAFAKSCCLKGENLLGLVQKILYHSLESQAHKCFYAISPLQLRIVL TVLTAAFHSFQLQLYEASPLSLAPCWLPHCLQTGPTKPKHLPRIRLKPFVTLPSMYENIT TRHQMSSTAMKGNRESPGAGSERRHHVVNLQAAIVKLFFDVFENVIIQCWGKPGDFRFLA EPKSSYWTNSLANKNCKLWKKTFFKNQLLKDTEIDQKQADTGEESVEEGHGTVVEPGIQA AGWKRSSRDSGLVRVPASPFSSLSPNKALRYSPYKPSANLNFHGRRTDKDPIFS >gi568815584f:89863115_90080632|GENSCAN_predicted_CDS_7|1605_bp atgctgttaggctttctggagctgatcaacagccccaaggaggctgcccctccagctatc tcaggtcttagtggaaaccttccttgtgctatgcagcaccaatgcttcaggctccaaaaa gacaagaatcattcccaggacactgtgtactatgggactcaggaaaacaaagtgatgaag catcactcggtatccagcagcaacaacacaaaccacagcacttgggagagaaagcagccc agtccagtagaaaaaagaacaatggaagacgcaaatctgtccaaggctgcctggggagca ttggagaagaatggcacccagctgatgatccgctcctacgagctcggggtccttttcctc ccttcagcatttgctactgctgggtttctacccacatgtcagggtctgtgttcctctgcc cttaaaccctcagcagcaatctttggcccacgagatgatgtccggactcctttgtgtgac atcaaggtccactacatcagcactgttgggtatgtgcctattgaggatttgaagggtgga tatgtgtgtggatgggtggacagaaggaatgagagacagacggacaagcagaatgccagg atgatccacgaaggggcaggagttggtcaatgtatggaagcacgagggggcacatcggtc atgtgtcctaatcttcaaaggactaagaaggcaaaaatggaagcactagaagagcctgaa aaatgttccaagggccctttccttccagtaaatgtcctaataggagccttcgctaaaagc tgttgtctgaagggggagaatcttctgggcttggttcagaaaatactctaccacagttta gagagtcaagcacataagtgtttttatgccatcagccccctgcagctaagaattgtattg actgtcctcacagcggcttttcatagctttcagcttcagctttacgaggcttctcctctc tccctggcaccctgctggctgcctcactgcttacagacaggtcccaccaaacccaaacac ctgcctaggataaggcttaagccttttgtgaccctaccctctatgtacgaaaacattaca actcgacatcagatgagctcaacagccatgaaaggaaacagggagagccctggggctggt agtgaaaggcgacatcatgtggtcaatttacaagctgccattgtaaaattattctttgat gtgtttgaaaatgtcattatacaatgttgggggaaaccaggtgacttcaggtttttggca gagccaaagagttcctattggaccaattctttagcaaataaaaactgtaagctctggaaa aaaaccttctttaaaaaccaacttcttaaagatactgaaatagaccaaaagcaggcagat accggagaggagtcagtggaggaagggcacgggactgttgtggaacctgggattcaagct gccggctggaagcgctctagcagggactctggcctagtaagagtccctgcttctcccttt tcttccctttcacccaataaagccctgcgttactcaccctacaaaccctctgctaaccta aattttcatggccgtaggacggacaaggaccccatctttagctga >gi568815584f:89863115_90080632|GENSCAN_predicted_peptide_8|244_aa MHSPELCFFIRNHTVLLNRAPSPSVSAPAPSRRELCARSPGPSALRRTPRRAGGVEGNGG DKADLKLGVKEDVGSWRHRSYRGGGRGSLAACGRARRSWASRRWGPGHLNEDNARFLLLA ALIVLYLLGGAAVFSALELAHERQAKQRWEERLANFSRGHNLSRDELRGFLRHYEEATRA GIRVDNVRPRWDFTGAFYFVGTVVSTIAGHDGTTITGYQAGNLAADQNGPKIGEGGITPP WFSP >gi568815584f:89863115_90080632|GENSCAN_predicted_CDS_8|735_bp atgcactcgccagagctgtgtttctttattcgaaatcacacggttttactcaacagggct cctagcccttctgtctctgctcctgcgccaagcagacgtgaactctgcgcccgttccccg ggaccttcagcgttaaggagaactccgaggagagcagggggagtggaggggaacggcggc gacaaagcggatttgaaactaggagtcaaggaggacgtggggagctggcgccacaggagc taccgaggcggcggccgggggagcctcgcggcctgcgggagagcccggcggtcatgggcg agccggcgctggggcccgggccacctgaacgaggacaacgcgcgctttctgctgctggcc gcgctcatcgtgctctacctgctgggcggcgccgccgtcttctccgcgctggagctggcg cacgagcgccaggccaagcagcgctgggaggagcgcctggccaacttcagccgcggccac aacctgagccgcgacgagctgcgcggcttcctccgccactacgaggaggccactcgggcc ggcatccgcgtggacaacgtccgcccgcgctgggacttcaccggcgccttctacttcgtg ggcaccgtcgtttccaccatagcaggccatgatggcaccaccataactgggtatcaggca ggaaaccttgcagcagaccagaatgggcccaagattggtgaaggaggaataacaccccca tggttctccccctga >gi568815584f:89863115_90080632|GENSCAN_predicted_peptide_9|201_aa MNPTGSGEHSSKTTALADLQGPCLVIFISCLKGEKTQDTENLGANMPRVNRQSQEGKRLL LAGSTTRTVLYCGESRKYAERKLNDVAEVKSILITGSTERCGEMGPGGTERQRADGWEKV GEEFRDKRKQIIEQIQDSQNGLSELPHEILEAGRGMALSPYPEWRAILEACAEEEEPRDR RGLKRWGNTETTARQHHQHPQ >gi568815584f:89863115_90080632|GENSCAN_predicted_CDS_9|606_bp atgaaccccacgggctccggggagcacagttcgaaaaccacagccctggctgatcttcag ggcccatgccttgtgatcttcatcagctgcttaaaaggagagaagacccaagacacagag aaccttggtgccaacatgccaagagttaataggcaaagtcaggaagggaaaagattgttg cttgctgggtcaactaccaggactgtcctttactgtggggaatccaggaaatatgctgag agaaagctaaatgatgttgcagaagtgaagtcaatcctgataacagggagcacagagaga tgtggggagatgggaccaggagggacagaaagacagagggctgatggttgggagaaagtt ggggaggagtttagggataagaggaaacaaataattgaacaaatccaggattctcagaat ggactgtctgagcttccccacgaaattttagaagctggaagaggtatggccctctcaccc tatcctgagtggagagcaatcctggaagcatgtgctgaggaagaggagccaagagacaga agaggcctgaagcgctggggcaacacagagacaaccgcccggcagcatcaccagcaccca cagtga