GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:03:13 Sequence gi568815597f:63425573_63672513 : 246941 bp : 38.65% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 11251 11343 93 2 0 119 62 150 0.173 14.52 1.02 Intr + 19996 20016 21 2 0 89 107 21 0.043 0.70 1.03 Term + 56283 56344 62 1 2 97 55 40 0.051 -1.41 1.04 PlyA + 56998 57003 6 1.05 2.03 PlyA - 57421 57416 6 1.05 2.02 Term - 76610 76408 203 0 2 64 44 126 0.500 2.37 2.01 Init - 77249 77171 79 2 1 83 47 46 0.462 1.27 2.00 Prom - 77426 77387 40 -3.65 3.03 PlyA - 78188 78183 6 1.05 3.02 Term - 80014 79140 875 2 2 13 42 409 0.669 20.58 3.01 Init - 92893 92848 46 1 1 63 75 66 0.341 3.70 3.00 Prom - 93976 93937 40 -8.45 4.00 Prom + 94305 94344 40 -7.25 4.01 Init + 97046 97141 96 1 0 77 70 42 0.466 1.66 4.02 Intr + 97295 97406 112 1 1 -7 43 131 0.468 -1.87 4.03 Term + 97516 97655 140 2 2 100 48 168 0.934 11.14 4.04 PlyA + 98525 98530 6 1.05 5.00 Prom + 99379 99418 40 -10.35 5.01 Init + 100001 100187 187 1 1 53 67 126 0.971 6.27 5.02 Intr + 106248 106459 212 1 2 52 116 91 0.984 6.01 5.03 Intr + 107098 107184 87 0 0 102 83 133 0.995 13.45 5.04 Intr + 107882 108077 196 1 1 86 63 109 0.857 6.27 5.05 Intr + 131556 131675 120 1 0 35 99 103 0.787 5.65 5.06 Intr + 136137 136285 149 1 2 37 88 173 0.975 11.23 5.07 Intr + 142738 142947 210 0 0 62 64 187 0.953 11.89 5.08 Term + 144937 144948 12 0 0 104 49 0 0.250 -5.07 5.09 PlyA + 145017 145022 6 1.05 6.03 PlyA - 145380 145375 6 1.05 6.02 Term - 149260 148950 311 2 2 60 37 176 0.158 3.94 6.01 Init - 159927 159918 10 0 1 51 116 -3 0.039 -0.54 6.00 Prom - 163113 163074 40 -1.15 7.00 Prom + 166146 166185 40 -4.45 7.01 Init + 167917 168162 246 0 0 117 74 347 0.878 33.84 7.02 Intr + 168641 168719 79 1 1 68 52 102 0.271 2.81 7.03 Term + 178337 178569 233 0 2 38 48 123 0.034 -1.05 7.04 PlyA + 180768 180773 6 1.05 8.03 PlyA - 181394 181389 6 1.05 8.02 Term - 185609 185382 228 2 0 138 41 90 0.704 4.95 8.01 Init - 189194 189156 39 2 0 76 76 34 0.332 1.24 8.00 Prom - 190690 190651 40 -1.85 9.00 Prom + 196125 196164 40 -5.05 9.01 Init + 197889 198188 300 2 0 51 47 250 0.795 14.50 9.02 Intr + 203853 204042 190 2 1 68 111 182 0.841 16.74 9.03 Intr + 204370 204618 249 2 0 105 75 108 0.987 7.69 9.04 Intr + 206085 206210 126 1 0 80 59 144 0.999 10.43 9.05 Intr + 209257 209447 191 2 2 28 47 238 0.860 12.08 9.06 Intr + 210662 210816 155 1 2 97 94 204 0.737 19.85 9.07 Intr + 222895 223080 186 1 0 1 99 211 0.291 11.38 9.08 Intr + 226061 226302 242 2 2 47 41 361 0.309 23.57 9.09 Intr + 226464 226685 222 1 0 46 103 99 0.348 4.38 9.10 Intr + 227863 227936 74 2 2 44 57 87 0.498 -0.59 9.11 Intr + 228760 228894 135 0 0 65 97 214 0.833 19.84 9.12 Term + 232522 232641 120 0 0 72 48 59 0.428 -2.21 9.13 PlyA + 234652 234657 6 1.05 10.04 PlyA - 234852 234847 6 1.05 10.03 Term - 236604 236292 313 2 1 34 43 174 0.731 0.99 10.02 Intr - 237903 237499 405 2 0 42 73 177 0.376 4.14 10.01 Init - 240566 240277 290 2 2 87 -4 235 0.448 10.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:63425573_63672513|GENSCAN_predicted_peptide_1|58_aa XSYLSHHYGASDVDDCHTGSSSETTGLVFCIEIAGKTPDEKIEVQNGYKLIPAHKVSE >gi568815597f:63425573_63672513|GENSCAN_predicted_CDS_1|177_bp ntttcttatctcagtcatcactatggtgcttctgacgttgatgactgtcacactggatcc tcctcagaaactaccggacttgttttctgtattgagatagcaggcaaaacacctgatgag aaaatagaagtacagaatggttacaaacttatcccagcccataaagttagtgagtga >gi568815597f:63425573_63672513|GENSCAN_predicted_peptide_2|93_aa MDEAGNHHSQQISQGQKTKHRVFSLVGRFSGMKMNYILQTVADGSPDEFIWTTGYLAIYQ ASLTQSFSGHFRERIEDATVGKRRAYFLPRPAQ >gi568815597f:63425573_63672513|GENSCAN_predicted_CDS_2|282_bp atggatgaagctggaaaccatcattctcagcaaatatcgcaaggacaaaaaaccaaacac cgcgtgttctcactcgtaggtagattttcaggaatgaagatgaattatatattacaaact gttgccgatggcagtccagatgaatttatatggacaactggctacctggccatataccaa gccagcttgacccagtctttttcaggacacttcagagagagaatagaagatgctacggtt gggaagagaagagcatacttcttaccaaggccagctcaatag >gi568815597f:63425573_63672513|GENSCAN_predicted_peptide_3|306_aa MKEQNTTKDAIDIERELEKQEQTHSKASRRQEIPKIRAELKEIETQKTLQKINESSSWFF EKINKIDGLLARLIKKKREKNQIDAIKNDKGDITTDPTEIQTTIREYYKHLYANKLENLE EMDKFLDTYTLPRLNQEEFEPPNRPITGCEIEAIIISLPTKKSPGPDGFTAEFYQRYKEE LVPFLLKLFQSIEKEGILPDAFYEPSIILIPKPGRDTAKKENFRPISLMNIDAKILNKIL ANRIQQHIKKLIHHDQVGLIPGMQGWFNICKSINVIQHINRTNDKNHMIISIDVEKAFDK IQQPSC >gi568815597f:63425573_63672513|GENSCAN_predicted_CDS_3|921_bp atgaaagagcagaataccactaaagatgctatagatattgaaagagaactagagaagcaa gagcaaacacattcaaaagctagcagaaggcaagaaatacctaagatcagagcagaactg aaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatccagtagctggtttttt gagaagatcaacaaaattgatggactgctagcaagactaataaagaagaaaagagagaag aatcaaatagatgcaataaaaaatgataaaggggatatcaccaccgatcccacagaaata caaactaccatcagagaatactataaacacctctacgcaaataaactagaaaatctagaa gaaatggataaattcctcgacacatacaccctcccaagactaaaccaggaagaatttgaa cctccgaatagaccaataacaggctgtgaaattgaggcaataattattagcttaccaacc aaaaaaagtccaggaccagatggattcacagccgaattctaccagaggtacaaggaggag ctggtgccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctgat gcattttatgagcccagcatcatcctgataccaaagcctggcagagacacagcaaaaaaa gagaattttagaccaatatccctgatgaacatcgatgcaaaaatcctcaataaaatactg gcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcctcatc cctgggatgcaaggctggttcaacatatgcaaatcaataaatgtaatccagcatataaac agaaccaacgacaaaaaccacatgattatctcaatagatgtggaaaaggcctttgacaaa attcaacaaccttcatgctaa >gi568815597f:63425573_63672513|GENSCAN_predicted_peptide_4|115_aa MQYKLICVSKKKRSSGSISDLPNSERVLSGIQTSPKYSPQFTKSVKSIVETESEGGLSYE YRWRRRRVEEGPQNSNTWGGDTYGILRFGKAPLPLNKTNPATSENRKSAKGNAKA >gi568815597f:63425573_63672513|GENSCAN_predicted_CDS_4|348_bp atgcagtacaaacttatttgtgtaagtaaaaaaaaacgaagctcagggtcgataagtgac ttgcccaacagtgaaagagtcttatctggaatccagacatcacctaaatacagcccacag tttacgaaatctgtcaaaagcatcgtagaaacagaaagcgaaggaggactatcgtatgag taccgctggaggagacgtagagttgaggagggcccccaaaacagcaatacctggggagga gatacctacggcattctgagattcgggaaagcaccactgccgctgaataaaacgaaccca gcaacttccgaaaacagaaaatccgccaaaggaaacgccaaggcatga >gi568815597f:63425573_63672513|GENSCAN_predicted_peptide_5|390_aa MAISPRSDATFSSQKSTPSESPRTKKFPLTEEEIFYMNCRAAYLTVFKSSLENIISKDQL YLALQHAGRNPSQKTINKYWTPQTAKLNFDDFCIILRKEKPTSKAELLKSFKQLDVNDDG CILHTDLYKFLTKRGEKMTREEVNAIINLADVNADGKFDYIKFCKLYMTTNEQCLKTTLE KLEVDSKLMRHQFGNHIEGSPERDPSPVPKPSPKITRKTDPETFLNKDIFEVIDLDGNGL LSLEEYNFFELRTSGEKCDEDAWAVCRENFDTKRNELTRQGFMDLNLMEANDREGDPCDL WVTLHSMGYNKALELTEACPFVIDIYAEKCKPKIKAVHMEACSGQLEKAICKSVLSNGDA KVMDGYENIIVHTYSCDTWITSVIENKSVS >gi568815597f:63425573_63672513|GENSCAN_predicted_CDS_5|1173_bp atggcgatcagtccacgaagcgatgcaactttctccagtcagaaatcaacaccttcagag agtcctcgaacaaagaaatttccactaactgaagaggaaatattttatatgaattgtaga gctgcctacttaactgtcttcaaaagcagcttggaaaacattatttctaaagatcaactt tacttagctcttcagcatgcaggaagaaatccatcccaaaagaccattaataagtattgg actcctcaaactgccaaactgaattttgatgatttttgtataattttaaggaaggaaaaa cctacttcaaaagcagaactactaaaatcatttaagcaattagatgtaaatgatgatggc tgtattttacacactgacctttataaatttctaacaaagagaggtgagaagatgactcga gaagaagtaaatgccataataaatttggctgatgtaaatgctgatggcaaatttgactac atcaagttttgtaaattatatatgacaaccaacgagcaatgtctcaagactacactagaa aaactagaggttgacagtaaattgatgcgtcaccagtttggaaaccacatcgaagggtcc cctgaaagggacccatcaccagtaccaaaaccatcacctaaaatcacaagaaaaactgat ccagaaacattcttaaataaagatatatttgaagtaattgatttagatggaaatggtctt cttagccttgaagaatataatttttttgaattgagaacaagtggtgagaaatgtgatgaa gatgcttgggctgtctgcagagagaattttgatacaaagaggaatgaactaacaagacaa ggatttatggatttgaatctaatggaagctaatgatcgagaaggagatccttgtgacctt tgggtaactctacactctatgggctacaataaagctctggagttgacagaggcatgtcca tttgtcattgatatctatgcagaaaaatgcaagccaaaaattaaagctgtccatatggag gcatgtagtggacaacttgagaaggccatttgtaaatctgttcttagcaacggtgatgcc aaagtaatggatggctatgaaaatataatcgtgcatacttacagttgtgacacctggata acgtcagttattgaaaacaagagtgtgagctag >gi568815597f:63425573_63672513|GENSCAN_predicted_peptide_6|106_aa MNKGLRVNIYTDCKYAFHILHHHAVIWAERNFLTMQGSSIINASLTKTLLKAALLPKEAG VIHCKGHQKASDPIAQDNAYADKVAKKAAIKRHQIPLLRTMLMLIR >gi568815597f:63425573_63672513|GENSCAN_predicted_CDS_6|321_bp atgaataaaggactacgcgtcaatatttatactgactgtaaatatgccttccatatcttg caccaccatgctgttatatgggcagaaagaaatttcctcactatgcaagggtcctccatc attaatgcctctttaacaaaaactcttctcaaagccgctttacttccaaaggaagctgga gtcattcactgcaaaggccatcaaaaggcatcagatcccatcgctcaagacaatgcttat gctgataaggtagctaaaaaagcagccatcaaaaggcatcagatcccattgctcaggaca atgcttatgctgataagatag >gi568815597f:63425573_63672513|GENSCAN_predicted_peptide_7|185_aa MVKIVTVKTQAYQDQKPGTSGLRKRVKVFQSSANYAENFIQSIISTVEPAQRQEATLVVG GDGRFYMKEAIQLIARIAAANGDWRFPLARSPTLWGKGGESTHPPGASVLYTTVTFVQDK YLYDVTTLHILDATQSAHMRIISIMIHHLQLGALMPCAFLPGLYLKSQISLSRLDLTAAD LFGPL >gi568815597f:63425573_63672513|GENSCAN_predicted_CDS_7|558_bp atggtgaagatcgtgacagttaagacccaggcgtaccaggaccagaagccgggcacgagc gggctgcggaagcgggtgaaggtgttccagagcagcgccaactacgcggagaacttcatc cagagtatcatctccaccgtggagccggcgcagcggcaggaggccacgctggtggtgggc ggggacggccggttctacatgaaggaggccatccagctcatcgctcgcatcgctgccgcc aacggggactggcgcttcccgctggctcggagcccgacactgtggggcaagggtggcgag agcacccatccccctggggcgtcagttctatatacaacagttacatttgtgcaagataaa tacttgtatgatgtgactacactgcacattcttgatgctactcagagtgcacatatgaga atcatctcaatcatgattcatcacctgcaacttggagcacttatgccctgtgcctttcta ccgggcttatatttgaaatctcagatttccctgtcaaggcttgatcttaccgctgcagat ctgtttggaccactctga >gi568815597f:63425573_63672513|GENSCAN_predicted_peptide_8|88_aa MGDTEIMSSVRGQGPQYTLTRHFSHYIAMPTCGDRFMHYCMPQIHSTVLSNKKTFSKCFR DGKHPIKSYSRRFLEQVDLINYSPEPLS >gi568815597f:63425573_63672513|GENSCAN_predicted_CDS_8|267_bp atgggggatacagagattatgtcatctgtccgaggtcagggcccccaatacaccctaacg aggcacttctcacactatattgcaatgcccacttgcggagaccgttttatgcactactgc atgccccaaatacacagcactgtattaagcaataagaaaacattcagtaaatgtttcagg gatggaaagcaccccatcaaatcttattcacggagatttctggagcaagtggacctaatc aactattcaccagaaccactatcctaa >gi568815597f:63425573_63672513|GENSCAN_predicted_peptide_9|729_aa MSDFEEWISGTYRKMEEGPLPLLTFATAPYHDQKPGTSGLRKKTYYFEEKPCYLENFIQS IFFSIDLKDRQGSSLVVGGDGRYFNKSAIETIVQMAAANGIGRLVIGQNGILSTPAVSCI IRKIKAIGGIILTASHNPGGPNGDFGIKFNISNGGEFAVILRTGPAPEAITDKIFQISKT IEEYAVCPDLKVDLGVLGKQQFDLENKFKPFTGMFTFLLFLPTPISKLSDFPFNVLVEGL QLWKDFLEIVDSVEAYATMLRSIFDFSALKELLSGPNRLKIRIDAMHGVVGPYVKKILCE ELGAPANSAVNCVPLEDFGGHHPDPNLTYAADLVETMKSGEHDFGAAFDGDGDRNMILGK HGFFVNPSDSVAVIAANIFSIPYFQQTGVRGFARSMPTSGALDRYWEKARFLQQLAVPPP GSDHIREKDGLWAVLAWLSILATRKQSVEDILKDHWQKYGRNFFTSQPVGLNFVTSSRYD YEEVEAEGANKMMKDLEALMFDRSFVGKQFSANDKVYTVEKADNFEYSDPVDGSISRNQV ETDRCVMVYALVLCTVQYPHSGILNLDSRGIFALSGMICKMMCPLYMQKGAPESKKEGEL WELRCQLLSFLSADYSRVTALLMNGTIKHQMGIGDEQQVKSSIFLGLRLIFTDGSRIVFR LSGTGSAGATIRLYIDSYEKDVAKINQDPQLLGHQHFVGTAQALRGNVVVSAGGVDSPFP GIMDRWRFR >gi568815597f:63425573_63672513|GENSCAN_predicted_CDS_9|2190_bp atgagtgattttgaagaatggatttctgggacatatagaaaaatggaagaaggtcctctc cctctgttgacttttgctacagctccctaccacgatcagaaaccaggaacaagtggatta cggaagaaaacctattattttgaggaaaagccatgctatctggagaatttcatccagagt atattcttttccatagacctaaaagatcgccagggatcatcactggtggttggtggagat gggcggtactttaataaatcagcaatagaaacaatagtgcagatggcagctgccaatggg atcggtcgcttggttatcggacagaatggaatcctctccacccctgctgtatcctgcatc attagaaaaatcaaagccattggtgggatcattctgacagccagtcacaacccagggggc cccaatggagattttggaatcaaattcaatatttctaatggaggtgagtttgctgtcatt ttgaggacaggtcctgctccagaagcaataactgataaaattttccaaatcagcaagaca attgaagaatatgcagtttgccctgacctgaaagtagaccttggtgttctgggaaagcag cagtttgacttggaaaataagttcaaacccttcacaggcatgtttactttcctcctcttt ctgcccactcctatttccaagttgagcgattttcctttcaacgttttagtagagggtctt cagctttggaaagatttcctggaaattgtggattcggtagaagcttatgctacaatgctg agaagcatctttgatttcagtgcactgaaagaactactttctgggccaaaccgactgaag atccgtattgatgctatgcatggagttgtgggaccgtatgtaaagaagatcctctgtgaa gaactcggtgcccctgcgaactcggcagttaactgcgttcctctggaggactttggaggc caccaccctgaccccaacctcacctatgcagctgacctggtggagaccatgaagtcagga gagcatgattttggggctgcctttgatggagatggggatcgaaacatgattctgggcaag catgggttctttgtgaacccttcagactctgtggctgtcattgctgccaacatcttcagc attccgtatttccagcagactggggtccgcggctttgcacggagcatgcccacgagtggt gctctggaccggtactgggagaaagcacgtttcttacagcagcttgctgtcccccctcca ggttctgaccacatccgtgagaaagatggactgtgggctgtccttgcctggctctccatc ctagccacccgcaagcagagtgtggaggacattctcaaagatcattggcaaaagtatggc cggaatttcttcaccagtcagcccgtgggcctcaacttcgtgacttcttccaggtatgat tacgaggaggtggaagctgagggcgcaaacaaaatgatgaaggacttggaggccctgatg tttgatcgctcctttgtggggaagcagttctcagcaaatgacaaagtttacactgtggag aaggccgataactttgaatacagcgacccagtggatggaagcatttcaagaaatcaggta gaaacagaccggtgtgtaatggtttatgcccttgttctgtgcactgtccagtaccctcat tctgggattcttaacctggactccagagggatttttgcactctctggaatgatatgcaag atgatgtgtccattgtacatgcagaaaggagccccagaatctaagaaagagggagaactg tgggaacttagatgtcaactcctctcttttctgtcagcagattatagcagagtgacagca ttgctaatgaatggcaccattaaacaccagatggggattggtgacgaacagcaggtgaag agcagcatctttttgggcttgcgcctcattttcacagatggttctcgaatcgtcttccga ctgagcggcactgggagtgccggggccaccattcggctgtacatcgatagctatgagaag gacgttgccaagattaaccaggacccccagctgctaggccatcagcactttgtggggact gcccaagccctaaggggaaatgtggtagtttcagcaggtggagttgattctccttttcca ggtataatggatcggtggagatttagataa >gi568815597f:63425573_63672513|GENSCAN_predicted_peptide_10|335_aa MKGVAKFDKEVSSEGPSQSKLEAIHPDNGEIRLPSKQKPGPIVQDNGITPKAFWRSSRLP PHFTGPDCKGLGEQNNFKALLFAKLFGCPKFSSSEFKHRPRSWVNTGEKGSLCHVGVPRA VEEMESKQVMNIEWDKRLEGGNRGDRNTRKSSQTRHWRLRKHFLEETMFKLRPEEQVGVG RIWRMVEWRGKVENVVEGGMEGKGDSGGKKSSFQGLGAGGRCHGSPRLKLGICCQVEAGK AVEIRSGRRKFLKPFQGPQNWEYPQLIQEMRRQKRVRKHERYLGSEFQFHHTVAENLGQI NELLLSFSCLNWKARRMMSCFEGYWKASSPTSEIE >gi568815597f:63425573_63672513|GENSCAN_predicted_CDS_10|1008_bp atgaagggtgtggccaagtttgataaggaagttagttcagaggggccatctcaatcaaag ctagaagctattcatccagacaatggggagattagactcccgtctaaacagaagccagga cctattgtgcaagacaatggaataaccccgaaggcattttggagatcatcaaggctgccc cctcacttcacaggtccagactgcaagggactgggggaacagaacaatttcaaggctctg ctctttgccaagctctttggctgccccaagttcagctctagtgaattcaaacacaggcct agatcctgggtaaacactggtgaaaaaggcagtctctgccatgtaggagttcccagggca gtagaagaaatggaaagtaaacaggttatgaacatagaatgggataagcgcctcgaaggg ggaaatagaggtgataggaacaccagaaaaagctcccaaaccagacactggaggctcaga aagcacttcttggaggaaacgatgtttaaattgagacctgaggaacaagtaggagttggg agaatatggcggatggtggaatggaggggaaaggtggagaatgtagtggagggtgggatg gagggtaaaggtgattcaggtggaaagaagagcagcttccaaggcctgggggcaggaggg aggtgccatggttccccaagactgaagcttggaatctgttgccaagtggaggcaggaaaa gcagttgaaatacgatcagggaggaggaaatttttgaagccattccaggggccacagaac tgggagtatccgcagctgattcaagagatgaggaggcagaaaagggtcaggaaacatgaa cgatacttagggtcagaattccagttccatcacacggtagctgagaacctggggcaaatt aatgagcttctgttgagcttcagttgcctcaactggaaagcgagaagaatgatgtcttgc ttcgagggttactggaaagcatctagcccaacatctgagatagaatag