GENSCAN 1.0 Date run: 4-Nov-116 Time: 19:03:28 Sequence gi568815575r:103153629_103353979 : 200351 bp : 39.37% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3144 3252 109 2 1 46 100 83 0.535 5.73 1.02 Intr + 7056 7189 134 1 2 58 96 122 0.959 9.44 1.03 Intr + 28410 28433 24 2 0 108 88 7 0.339 0.00 1.04 Intr + 29256 29327 72 2 0 59 89 58 0.511 1.68 1.05 Term + 52858 53127 270 0 0 -2 38 233 0.047 3.80 1.06 PlyA + 54869 54874 6 1.05 2.00 Prom + 55109 55148 40 -6.05 2.01 Init + 61714 61887 174 0 0 83 37 270 0.745 18.79 2.02 Intr + 62061 62136 76 2 1 38 67 69 0.722 -2.03 2.03 Intr + 62521 62770 250 2 1 102 76 309 0.524 26.67 2.04 Term + 67638 67803 166 0 1 66 49 254 0.748 15.71 2.05 PlyA + 68524 68529 6 1.05 3.07 PlyA - 72908 72903 6 1.05 3.06 Term - 88287 87971 317 1 2 68 42 191 0.883 6.62 3.05 Intr - 89620 89508 113 0 2 11 90 59 0.113 -2.50 3.04 Intr - 96458 96307 152 2 2 22 42 170 0.123 3.84 3.03 Intr - 97036 96907 130 0 1 62 103 75 0.711 6.18 3.02 Intr - 101049 100936 114 2 0 97 39 98 0.197 4.44 3.01 Init - 110929 110874 56 1 2 74 95 48 0.319 4.91 3.00 Prom - 113110 113071 40 -3.45 4.07 PlyA - 114437 114432 6 1.05 4.06 Term - 120962 120315 648 2 0 123 35 548 0.729 46.09 4.05 Intr - 121353 121291 63 0 0 77 69 72 0.811 2.20 4.04 Intr - 121531 121476 56 2 2 106 81 30 0.989 1.88 4.03 Intr - 121714 121609 106 1 1 143 -4 141 0.590 9.27 4.02 Intr - 123141 122851 291 0 0 13 45 290 0.224 13.61 4.01 Init - 127403 127077 327 2 0 65 68 121 0.267 5.17 4.00 Prom - 127457 127418 40 -1.35 5.00 Prom + 131319 131358 40 -6.15 5.01 Sngl + 131780 132301 522 1 0 42 40 266 0.471 13.00 5.02 PlyA + 132379 132384 6 -0.45 6.00 Prom + 132601 132640 40 -11.44 6.01 Init + 132957 134362 1406 2 2 68 53 343 0.133 19.03 6.02 Intr + 139591 139705 115 1 1 83 22 68 0.073 -0.77 6.03 Term + 140153 140332 180 1 0 55 47 115 0.370 0.73 6.04 PlyA + 142513 142518 6 1.05 7.00 Prom + 152119 152158 40 -3.65 7.01 Init + 152243 152366 124 1 1 78 105 59 0.696 6.98 7.02 Intr + 152826 152855 30 1 0 105 115 1 0.523 1.68 7.03 Term + 153992 154020 29 0 2 66 48 43 0.326 -4.64 7.04 PlyA + 155560 155565 6 1.05 8.04 PlyA - 155739 155734 6 1.05 8.03 Term - 156353 155962 392 0 2 84 41 397 0.876 28.76 8.02 Intr - 156805 156730 76 1 1 109 82 15 0.677 1.17 8.01 Init - 157394 157089 306 2 0 63 30 279 0.548 16.94 8.00 Prom - 171781 171742 40 -5.35 9.03 PlyA - 171929 171924 6 1.05 9.02 Term - 177578 177432 147 2 0 122 42 100 0.132 5.72 9.01 Intr - 186286 186000 287 2 2 71 76 117 0.265 4.94 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:103153629_103353979|GENSCAN_predicted_peptide_1|202_aa MFPSVIHAPLHQNHQKVLVKTADSRPILDVINQNKEGVYMTWGSGALLELQKEVTDGECA GRDTGGRVKNCSWDPVPEPSRMKEMKSWEVSRTKGQDFISIIIIITTTKRGCRSLVVSLG IDNHQKTWFLEGCLDLVSEVSRSEVARNRSGYTGSSKLQHSLLASIPRGYDTDISRVFTG NNGMNCHQKLLPGPSQGKVMAY >gi568815575r:103153629_103353979|GENSCAN_predicted_CDS_1|609_bp atgtttccaagtgtgattcatgcaccacttcatcaaaatcatcagaaagtacttgtaaaa actgcagattctaggcctatcctggatgtgataaaccaaaacaaggaaggtgtgtacatg acctggggctctggggccttattggagcttcagaaagaggttactgatggtgagtgtgca gggagggacactggaggaagagttaagaattgttcctgggacccagtccctgaaccatcc aggatgaaggaaatgaagagctgggaggtctctcggaccaaagggcaggatttcatttct atcatcatcattatcactaccaccaagaggggttgtagaagcctggtggtcagccttggg attgataaccaccagaagacatggttcctggaaggctgcttggatctggttagtgaagtt tccaggagtgaagtggccagaaataggagtggctacactggcagcagcaaacttcagcat agcctgctggccagtattcctagaggatatgacactgacatcagcagggttttcactggc aacaatggcatgaactgccaccagaagcttctcccaggtccttctcaaggaaaagtgatg gcatactga >gi568815575r:103153629_103353979|GENSCAN_predicted_peptide_2|221_aa MQPPGPAAGPRAFLERRGARPGGSRRSWGYDPSETPAYAISASFSDTPAPHFLVQKMVVC GAKCRGGAPRVKNPEEETARIGPGVMESKEELAANNLNGENAQQENEGGEQAPTQNEEES RHLGGGEGQKPGGNIRRGRVRRLVPNFRWAIPNRHIEHNEARDDVESRAVNASRCCEGPH RGQQHNTHQYPVAADECAPYYAVAADATATYEQGWIPLLPP >gi568815575r:103153629_103353979|GENSCAN_predicted_CDS_2|666_bp atgcagcccccgggccccgcggcgggcccgcgagccttccttgagcggagaggtgcccgg cccggagggagccggcggtcctggggctacgacccttcggaaacacctgcctacgccatc agcgcaagcttttccgacacccctgccccgcacttcttggtgcagaaaatggtggtctgc ggggctaagtgtcgcggcggcgcacctcgcgtcaagaatccggaggaggagactgcaagg ataggcccaggagtaatggagtccaaagaggaactagcggcaaacaatctcaacggggaa aatgcccaacaagaaaacgaaggaggggagcaggcccccacgcagaatgaagaagaatcc cgccatttgggagggggtgaaggccagaagcctggaggaaatatcaggcgggggcgagtt aggcgacttgtccctaattttcgatgggccatacctaataggcatattgagcacaatgaa gcgagagatgatgtagaaagccgcgctgtcaatgctagccgctgctgcgaaggcccgcac agaggccagcaacacaacacccatcagtaccctgtcgcagcagatgagtgtgcaccctac tatgctgtagctgcagatgctactgccacatacgaacaaggatggatcccactgctaccg ccctag >gi568815575r:103153629_103353979|GENSCAN_predicted_peptide_3|293_aa MIIYAENPKEFLKESRTLRIFGFLGATQIEVREERGKSPESLQVSFCKGAIASGLFGLPE GPRCALACLVGSARTLALCLPGAEHLMPEPAQALKHRRPQPPDIRLLGQTENVLSTTIIA VNSIIIIIIIIIIIIKGWCSVNIQWILVGDREGQTLTFIGDAICGPSRLAKTRVTGWCAL RHCSQRGGGVIAVKEEKGKTTFDMENSHQENEESVHNVEEELHLEEMEGQEARGNNLQEQ APPTQEDGDGLPHRHVNNNEGRGRKRMRRRKEMGRRRKRRRKRRRRKVSRRDF >gi568815575r:103153629_103353979|GENSCAN_predicted_CDS_3|882_bp atgattatctatgcagaaaatcctaaagaattcttgaaagagtctagaactcttaggata tttggttttttgggcgcgacacaaatcgaggtgagggaagagagaggaaaatcccctgaa tccctgcaggtcagtttttgtaaaggtgcaattgcttcaggtctcttcggactcccagaa gggcctcgatgtgccctggcctgtttggttggttctgcaaggactttagctctgtgtctt cctggtgcagagcatctgatgcctgagccagcccaggccctgaagcacaggagaccccag cccccagacataagactcttaggtcaaacagaaaatgtcctctctaccaccatcattgct gtcaacagcatcatcatcatcatcatcatcatcatcatcatcatcaagggatggtgttca gtgaatattcaatggattctggttggggacagagaagggcaaacactaaccttcatcggt gatgccatttgtggcccctcaagattggcgaagaccagagtgacagggtggtgtgcactg aggcactgttcacagaggggtggaggagtaattgcagtcaaagaggaaaagggaaaaaca actttcgacatggaaaattcccaccaggaaaatgaagaaagtgttcataacgtagaggaa gagctacatttggaagaaatggaaggccaggaagctagaggaaataatctccaggaacaa gcaccacctacccaggaagatggagatggcttgccccataggcatgtcaataacaatgaa gggagagggaggaagaggatgaggaggaggaaggagatggggaggaggagaaagaggagg agaaagaggaggagaagaaaagtgtccagaagagatttttaa >gi568815575r:103153629_103353979|GENSCAN_predicted_peptide_4|496_aa MGTGVMTKTPKAIASRAKIDKRDLIKHRSLCMAKETINRINRQPTEWEKILANYASDKGL ISSKYKGLKQICKRKNNPIKKWAKDMNRHFKRRHTCKLNITVESSVSIPKSLSSRYQIPV SVEEGRLQTEGQRRQLPYSPAGRYKPGTPEVGMVMHIGYRGSDLLSRYRPVPPPFILAQK FFCVGQEVWVPAFISLEKCRLLAEGNVFKSVVVGTNPDKKRKDLQYRCRSVGEGYCTRTL GISGYMTAVKTLRGYGDLWLFYKSYNCMQGEKEENCPGFLQERQRREHLNMEKLYKENEG KPENERNLESEGKPEDEGSTEDEGKSDEEEKPDMEGKTECEGKREDEGEPGDEGQLEDEG NQEKQGKSEGEDKPQSEGKPASQAKPESQPRAAEKRPAEDYVPRKAKRKTDRGTDDSPKD SQEDLQERHLSSEEMMRECGDVSRAQEELRKKQKMGGFHWMQRDVQDPFAPRGQRGVRGV RGGGRGQKDLEDVPYV >gi568815575r:103153629_103353979|GENSCAN_predicted_CDS_4|1491_bp atgggcacaggtgtcatgacaaagacaccaaaagcaattgcatcaagagcaaaaattgac aaacgagatttaattaaacataggagcttgtgtatggcaaaagaaactatcaacagaata aacagacaacctacagaatgggagaaaatattagcaaactatgcatctgacaaaggtcta atatcgagcaagtataagggacttaaacaaatttgcaagaggaaaaacaaccccattaaa aagtgggcaaaggacatgaacagacacttcaaaagaagacatacatgcaagctcaatatc actgtggaaagcagtgtgtcgattcccaagtccctctcctccaggtaccagatccctgtt tccgtggaggaaggcagacttcagactgaaggacagagaaggcaactgccctacagccct gcaggtcggtacaagccggggacccctgaggtagggatggtaatgcacataggctacaga ggatcggatctgctttctagataccgccctgttccacccccattcattttggcgcagaaa tttttttgtgttgggcaagaggtatgggtaccagcttttatttccctggagaagtgcagg cttctggcagaaggaaatgtcttcaagtctgtggttgtgggcactaacccagacaagaaa aggaaagacctgcagtatcggtgcaggtcagtgggagagggctactgcaccaggaccctt gggatttcgggatacatgactgctgtcaagaccctaaggggatatggtgatctatggctg ttttataagtcttacaactgcatgcaaggggaaaaagaagaaaactgcccaggatttctg caggaaaggcaaagaagggaacatctcaacatggaaaagctctacaaagaaaatgaagga aagccagagaatgaaagaaacctagaaagtgagggaaagccagaggatgagggaagtaca gaagatgaaggaaagtcagacgaggaagaaaagccggacatggaggggaagacagaatgc gagggaaagcgagaggatgagggagagccaggtgatgagggacaactggaagatgaggga aaccaggaaaagcagggcaagtctgaaggtgaggacaagccacaaagtgagggcaagcca gcctcccaggccaagccagagagccagccgcgggccgccgaaaagcgcccggctgaagat tatgtgccccggaaagcaaaaagaaaaaccgacagggggacggacgattcccccaaggac tctcaggaggacttacaagaaaggcatctgagcagtgaggagatgatgagagaatgtgga gatgtgtcaagggctcaggaggagctaaggaaaaaacagaaaatgggtggttttcattgg atgcaaagagatgtacaggatccattcgccccaaggggccaacggggtgtgaggggagtg aggggcggaggtaggggccagaaagacttagaagatgtcccatatgtttaa >gi568815575r:103153629_103353979|GENSCAN_predicted_peptide_5|173_aa MKAEIKMFFETNENKDTTYQNLWDTFKAVCRGKFIALNAHKRKQERSKIDTLTSQLKELE KQEQTHSKASRRQEITKIRAEQKEIETQKTLQKINESRSWFFEKINKIDRPLARLMKKRE KNQIDAIKNDKGDITTNPTEIQTTIREYYKHLYANKLENLEEMDKFLDTLSQD >gi568815575r:103153629_103353979|GENSCAN_predicted_CDS_5|522_bp atgaaagcagaaataaagatgttctttgaaaccaatgagaacaaagacacaacataccag aatctctgggacacattcaaagcagtgtgtagagggaaatttatagcactaaatgcccac aagagaaagcaggaaagatctaaaattgacaccctaacatcacaattaaaagaactagag aagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaagatcagagca gaacagaaggaaatagagacacaaaaaacccttcaaaaaatcaatgaatccaggagctgg ttttttgaaaagatcaacaaaattgatagaccgctagcaagattaatgaagaaaagagag aagaatcaaatagatgcaataaaaaatgataaaggggatatcaccaccaatcccacagaa atacaaactaccatcagagaatactataaacacctctatgcaaataaactagaaaatctg gaagaaatggataaattcctcgacacactctcccaagactaa >gi568815575r:103153629_103353979|GENSCAN_predicted_peptide_6|566_aa MPSLTTPIQHSVGSSGQGNQAGERNKRYSIRKEEVKLSLFADDMIVYLENPIVSAQNLLK LISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTRDVKDL FKENYKPLLNEIKEDTNKWKNIPCSWVGRINIMKMAILPKVIYRFNAIPIKLPMTFFTEL EKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYWYQNKDIDQWN RTEPSEIMSHIYNYPIFDKPDKNKKWGNNSLFNKWCWENWLAICRKLKLDPFLTPYTKIN SRWIKDLHVRPKTIKTLEENLSNTIQDIGMGKDFMSKTPKAMATKAKIDQWDLIKLKSFC TAKETTIRVNRQPTEWEKIFATHSSDKGLLSRIYNELKQIYKKKTDNPINKWAKDMNRHF SKEDIYAAKRHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRWNISEDPGLAL SCIKEASITVPSLRCAVSIIQGILSNWEDVINVMPWLYSWRKLGADKILSIKIVHNDKAI EIPHEKLNKGVSYLFEREGKLQLRGY >gi568815575r:103153629_103353979|GENSCAN_predicted_CDS_6|1701_bp atgccctctctcaccactcctattcaacatagtgttggaagttctggccagggcaatcag gcaggagaaagaaataaaaggtattcgattaggaaagaggaagtcaaattgtccctgttt gcagatgacatgattgtatatctagaaaaccccattgtctcagcccaaaatctccttaag ctgataagcaacttcagcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagca ttcttatacaccaataacagacaaacagagagccaaatcatgagtgaactcccattcaca attgcttcaaagagaataaaatacctaggaattcaacttacaagggacgtgaaggacctc ttcaaggagaactacaaaccactgctcaatgaaataaaagaggatacaaacaaatggaag aacattccatgctcatgggtaggaagaatcaatatcatgaaaatggccatactgcccaag gtaatttacagattcaatgccatccccatcaagctaccaatgactttcttcacagaattg gaaaaaactactttaaagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatc ctaagccaaaagaacaaagctggaggcatcacgctacctgacttcaaactatactacaag gctacagtaaccaaaacagcatggtactggtaccaaaacaaagatatagaccaatggaac agaacagagccctcagaaataatgtcacatatctacaactatccgatctttgacaaacct gacaaaaacaagaaatggggaaacaattccctatttaataaatggtgctgggaaaactgg ctagccatatgtagaaagctgaaactagatcccttccttacaccttatactaaaattaat tcaagatggattaaagacttacatgttagacctaaaaccataaaaaccctagaagaaaac ctaagcaataccattcaggacataggcatgggcaaggacttcatgtctaaaacaccaaaa gcaatggcaacaaaagccaaaattgaccaatgggatctaattaaactaaagagcttctgc acagcaaaggaaactaccatcagagtgaacaggcaacctacagaatgggagaaaattttt gcaacccactcatctgacaaagggctactgtccagaatctacaatgaactcaaacaaatt tacaagaaaaaaacagacaaccccatcaacaagtgggcgaaggacatgaacagacacttc tcaaaagaagacatttatgcagccaaaagacacatgaaaaaatgctcatcatcactggcc atcagagaaatgcaaatcaaaaccacaatgagataccatctcacaccagttagaatggca atcattaaaaagtcaggaaacaacaggtggaatatatcagaggacccaggtttggctctg tcctgtatcaaggaggcctccatcactgtgccaagcttgcgatgtgctgtgtccattatc cagggcatattgagcaactgggaagatgttatcaatgtaatgccatggctgtattcctgg agaaagctgggtgcagataagatcttgtccataaaaattgtgcacaatgacaaagctatt gagatacctcatgagaagcttaataaaggtgtatcatatctctttgaaagggaaggcaaa ctgcagctgagaggctactga >gi568815575r:103153629_103353979|GENSCAN_predicted_peptide_7|60_aa MEYYAAIKKDEFMSFAGTWMKLETIILSKLSQEQKTTWTHGDNFALHSLIDAGANTDNQE >gi568815575r:103153629_103353979|GENSCAN_predicted_CDS_7|183_bp atggaatactatgcagccataaaaaaggatgagttcatgtcctttgcagggacatggatg aagctggaaaccatcattctcagcaaactatcacaagaacagaaaaccacatggacacat ggagataactttgccctccactctctgatagatgctggtgctaacactgacaatcaagag taa >gi568815575r:103153629_103353979|GENSCAN_predicted_peptide_8|257_aa MTTATRDRPTLSLTSLWRQNRRRSLPSRQWYLFPVSLRTCGPGTAPKVGSGGGAGKGPGG GPWGWCGGAAPACGCPIPGQQRGDAAGPRCSPGAPPRDREPCVCGAKCCGDAPHVENREE ETARIGPGVMESKEERALNNLIVENVNQENDEKDEKEQVANKGEPLALPLNVSEYCVPRG NRRRFRVRQPILQYRWDIMHRLGEPQARMREENMERIGEEVRQLMEKLREKQLSHSLRAV STDPPHHDHHDEFCLMP >gi568815575r:103153629_103353979|GENSCAN_predicted_CDS_8|774_bp atgacaacagccacacgtgatcggccaacactgagtcttacctcgttgtggcgtcagaac cgccgtcgctcgctcccttctcggcagtggtacctgttcccggtgtccctgaggacgtgc gggccaggtacggccccgaaagtaggaagcggagggggagcaggtaagggacccggaggg ggtccctggggttggtgtgggggagcagccccggcctgcggatgccccatccccgggcag cagcgcggagacgcagccggtccacgatgcagccccggggccccgccgcgggaccgcgag ccttgtgtttgcggggccaagtgttgcggcgacgcacctcacgtcgagaatcgggaggag gagactgcaaggataggcccaggagtaatggagtccaaagaggaacgagcgttaaacaat ctcatcgtggaaaatgtcaaccaggaaaatgatgaaaaagatgaaaaggagcaagttgct aataaaggggagcccttggccctacctttgaatgttagtgaatactgtgtgcctagagga aaccgtaggcggttccgcgttaggcagcccatcctgcagtatagatgggacataatgcat aggcttggagagccacaggcaaggatgagagaggagaatatggaaaggattggggaggag gtgagacagctgatggaaaagctgagggaaaagcagttgagtcatagtttgcgggcagtc agcactgatccccctcaccatgaccatcacgatgagttttgccttatgccctga >gi568815575r:103153629_103353979|GENSCAN_predicted_peptide_9|144_aa XLKWKLSWTLSTLSHLTCGAQSAEPFTPEFHISRALGGAYPCCSVPACQSISLGAPQPGL GQCLRSSHLLTTLINLDQAFSTGDVCGPLEIASKLPEPTVSPADKANGLSNRQERRSRKG DGLRSRKFRLAFAPSRLLSSEISV >gi568815575r:103153629_103353979|GENSCAN_predicted_CDS_9|435_bp ntgctcaagtggaagctgtcttggaccctgtctactttatcccacctgacttgcggagca cagtctgctgagcctttcacccctgaattccacatctctagagccctaggaggtgcctat ccttgctgcagtgttcctgcctgccagtccatctccctgggggcccctcagccaggctta ggtcagtgtttaagaagctctcacctgctgaccacattgattaacctagatcaggcgttc tcaactggagatgtttgtggaccacttgagattgcatcaaagttacctgaacccaccgtt tccccagcagacaaagctaatggcctctccaataggcaggagaggcggagcagaaaagga gatggactgcgaagccgcaaatttcgcctagcttttgccccttcccgactcctatcctca gagatctcagtctaa