GENSCAN 1.0 Date run: 6-Nov-116 Time: 16:23:22 Sequence gi568815591r:75669696_75872176 : 202481 bp : 47.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 18655 18770 116 0 2 50 39 131 0.209 2.12 1.02 Intr + 29993 30050 58 2 1 117 98 90 0.865 11.79 1.03 Intr + 34197 34355 159 2 0 52 99 56 0.120 3.28 1.04 Term + 37912 38466 555 0 0 -21 44 232 0.096 2.23 1.05 PlyA + 39226 39231 6 1.05 2.07 PlyA - 39310 39305 6 1.05 2.06 Term - 61911 61843 69 0 0 56 46 56 0.113 -3.86 2.05 Intr - 69294 69106 189 0 0 20 94 174 0.036 10.98 2.04 Intr - 69858 69621 238 2 1 54 62 100 0.014 1.62 2.03 Intr - 88554 88474 81 2 0 35 105 59 0.062 1.05 2.02 Intr - 102308 102194 115 0 1 94 76 78 0.193 6.71 2.01 Init - 102481 102409 73 1 1 80 75 157 0.890 12.83 2.00 Prom - 105683 105644 40 -6.26 3.00 Prom + 106985 107024 40 -3.16 3.01 Init + 109703 109760 58 1 1 42 106 28 0.489 1.47 3.02 Intr + 113590 113724 135 2 0 101 80 30 0.565 4.04 3.03 Intr + 117483 117628 146 1 2 70 93 46 0.314 3.30 3.04 Intr + 123879 124004 126 2 0 50 113 11 0.059 0.68 3.05 Term + 131595 131738 144 2 0 42 42 85 0.143 -2.79 3.06 PlyA + 131788 131793 6 1.05 4.04 PlyA - 134080 134075 6 1.05 4.03 Term - 142269 142101 169 2 1 135 36 194 0.999 16.25 4.02 Intr - 143728 143611 118 2 1 114 45 50 0.995 2.82 4.01 Init - 144020 143948 73 2 1 78 94 49 0.927 4.89 4.00 Prom - 144098 144059 40 -5.76 5.03 PlyA - 144501 144496 6 1.05 5.02 Term - 150501 150313 189 0 0 -17 42 616 0.991 43.95 5.01 Init - 153763 153692 72 1 0 90 105 5 0.395 1.48 5.00 Prom - 154514 154475 40 -6.56 6.00 Prom + 157262 157301 40 -4.66 6.01 Init + 160132 160236 105 0 0 70 16 67 0.212 -2.06 6.02 Term + 166017 166238 222 2 0 38 42 373 0.942 24.62 6.03 PlyA + 167213 167218 6 1.05 7.02 PlyA - 168029 168024 6 1.05 7.01 Term - 169069 168945 125 2 2 32 44 136 0.440 2.15 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 69225 69106 120 0 0 103 94 156 0.890 17.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:75669696_75872176|GENSCAN_predicted_peptide_1|295_aa MRAKEERPGARRVRAPRGVHHAASAGGGGDELSVEPLDGHTLQSFKPLIRVQAYQTLKGS GTELLARAADPKPLVKSLGKSQGPGPGLHRCAKFPQDSCSMEALPWEPVNLFIWNQKRAR IAKSILSQKNKAGGITLPDFKLYYKATVTKTVWYWYQNRDIDQWNRTEPSEIMPHIYNYL IFDKPEKNKQWGKDSLFNKWCWENWLAICRKLKLDPFLTPYTKINSRWIKDLYVRPKTIK TLEENLGITIQDIGVGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIYLL >gi568815591r:75669696_75872176|GENSCAN_predicted_CDS_1|888_bp atgagagccaaggaagaacggccgggggcgcgccgggtccgggctcccaggggagtccat catgctgcgtctgcagggggtgggggggatgagctgtcggtggagcccctcgacggccac accctccagtccttcaagcccctgatccgggtccaggcctaccagaccctgaaggggtct ggaactgagctcctggccagagctgccgacccgaagcctttggtgaagagtctagggaaa agccagggccccgggcctgggctgcataggtgtgccaagttcccccaagactcatgcagc atggaagccctgccctgggaaccagtgaacctgttcatatggaaccaaaaaagagcccgc atcgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacactacctgacttc aaactatactacaaggctacagtaaccaaaacagtatggtactggtaccaaaacagagat atagatcaatggaacagaacagagccctcagaaataatgccgcatatctacaactatctg atctttgacaaacctgagaaaaacaagcaatggggaaaggattccctatttaataaatgg tgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacacct tatacaaaaatcaattcaagatggattaaagatttatacgttagacctaaaaccataaaa accctagaagaaaacctaggcattaccattcaggacataggcgtgggcaaggacttcatg tccaaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaa ctaaagagcttctgcacagcaaaagaaactaccatatatcttctttag >gi568815591r:75669696_75872176|GENSCAN_predicted_peptide_2|254_aa MMGLSLASAVLLASLLSLHLGTATRGSDISKTCCFQYSHKPLPWTWVRSYEFTSNSCSQR AVMPGVPNPWAMDQYWSICGLLGTGPHSRRAGQSGAPGSQARKELTGYSQAGWDSPGGGP TARPASSPPISTGRRGIAPCRSGQGSGRRLAEGSTLGARGDCGRGEERAGGAGGAVGIPG QPRAPDSAPRGDMDRMASSMKQVPNPLPKVLSRRGVGAGLEAAERESFERTQVHKEVTPC LKSEQVPSCKESCS >gi568815591r:75669696_75872176|GENSCAN_predicted_CDS_2|765_bp atgatgggcctctccttggcctctgctgtgctcctggcctccctcctgagtctccacctt ggaactgccacacgtgggagtgacatatccaagacctgctgcttccaatacagccacaag ccccttccctggacctgggtgcgaagctatgaattcaccagtaacagctgctcccagcgg gctgtgatgccaggggtccccaacccctgggccatggaccagtactggtccatctgtggc ctgttaggaactgggcctcacagcaggagggctggtcagtccggcgctcccgggtcccag gcccggaaggagctaacgggctattcgcaggcgggctgggattcccccgggggaggcccc actgcccggcccgcgtcatccccgcccatctccacgggccgtcgcgggatagccccctgc aggagcgggcagggtagtgggcggcgcttggcggagggcagcacgctcggggcgcgcggg gactgcggccgaggggaggagagggcgggcggggcgggcggcgccgtggggatcccgggg cagccgagggcccctgactcggctcctcgcggcgacatggatcggatggccagctccatg aagcaggtgcccaacccactgcccaaggtgctgagccggcgcggggtcggcgctgggctg gaggcggcggagcgcgagagcttcgagcggactcaggtgcacaaagaagtcaccccttgc ctgaagtctgagcaggtgccttcttgcaaggagagctgctcgtga >gi568815591r:75669696_75872176|GENSCAN_predicted_peptide_3|202_aa MTSGPQTDQPKEHLTNFKLDEQERVFSLAQSHTDNRRLHEPDLQEVIRAVPLGDPKWNYQ ADSPDVIPRFGLPTSIQSDNGLAFISQITQAVSQALGIQWKLRTPYHPQSSGKCWDYRHK SLHPAKCLLNSYSPSNVQGSGDTVVDRTPSLSLRKSSLWYKAPQSRDGDTIGDGKTSGAH QLGQPNKKRHGYHIISFPDFLS >gi568815591r:75669696_75872176|GENSCAN_predicted_CDS_3|609_bp atgacctcaggtcctcagaccgaccagcccaaggaacatctcaccaattttaaattggat gaacaggaaagagttttttctctagcccaatctcacactgataaccgccggcttcacgag ccagacctccaggaagttattagagcagttcccctaggagatccaaaatggaactatcag gctgattccccagacgtaattcctcggtttggccttcccacctctatacagtccgataat ggactggcctttattagtcaaatcacccaagcagtttctcaggctcttggtattcagtgg aaacttcgtaccccttaccatcctcaatcttcaggaaagtgctgggattacaggcataag tcactgcatccagccaaatgtttgctgaattcttacagccccagcaatgtccaaggctct ggagacaccgtggtggacaggacaccatccctgtccttaagaaagagtagcctttggtat aaggcaccccagtccagagatggagatactattggcgatgggaaaacatcaggagcacac caactggggcagcccaataaaaagaggcatggatatcacatcatcagcttccctgacttt ttatcataa >gi568815591r:75669696_75872176|GENSCAN_predicted_peptide_4|119_aa MAGLMTIVTSLLFLGVCAHHIIPTGSVVIPSPCCMFFVSKRIPENRVVSYQLSSRSTCLK AGVIFTTKKGQQFCGDPKQEWVQRYMKNLDAKQKKASPRARAVAVKGPVQRYPGNQTTC >gi568815591r:75669696_75872176|GENSCAN_predicted_CDS_4|360_bp atggcaggcctgatgaccatagtaaccagccttctgttccttggtgtctgtgcccaccac atcatccctacgggctctgtggtcatcccctctccctgctgcatgttctttgtttccaag agaattcctgagaaccgagtggtcagctaccagctgtccagcaggagcacatgcctcaag gcaggagtgatcttcaccaccaagaagggccagcagttctgtggcgaccccaagcaggag tgggtccagaggtacatgaagaacctggacgccaagcagaagaaggcttcccctagggcc agggcagtggctgtcaagggccctgtccagagatatcctggcaaccaaaccacctgctaa >gi568815591r:75669696_75872176|GENSCAN_predicted_peptide_5|86_aa MQGRRPSLPRCPSSRPILPAIPDLKEKEKEKEKEKEKEKEEEEEEEKKKKKKKKEEEEEE EEEEEEEEEEEEEEEEEEEEEEEVVY >gi568815591r:75669696_75872176|GENSCAN_predicted_CDS_5|261_bp atgcaggggaggcgcccttccctccccaggtgcccctcctctcgtcccattcttcctgcc atcccagatctgaaggagaaggagaaggagaaggagaaggagaaggagaaggagaaggag gaggaggaggaggaggagaagaagaagaagaagaagaagaaggaagaagaagaagaagag gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagtagtttactaa >gi568815591r:75669696_75872176|GENSCAN_predicted_peptide_6|108_aa MNSEMTAMPQPPNPPGNQLLFISSGHKGSLWTSWIVLKYQTLIFGNSAEFGTLKKAAVHY DGSGRSLGSADMRFERKAHALKAMKQYYGTPLAGRPVNIQLVTSQIDT >gi568815591r:75669696_75872176|GENSCAN_predicted_CDS_6|327_bp atgaacagtgaaatgacggcgatgccccagccacccaacccccctgggaatcagcttctg ttcatcagctcaggccacaaaggaagtctttggacaagctggatagttttgaagtatcag acactgatatttgggaactctgcagaatttggaacgctgaagaaggcggctgtgcactac gatggctctggccgcagcttaggatcagcagacatgcgctttgagcggaaggcacacgcc ctgaaggccatgaagcagtactacggcacccctctggctggccgccctgtgaacattcag cttgtcacatcacagattgatacataa >gi568815591r:75669696_75872176|GENSCAN_predicted_peptide_7|41_aa XELLFHGTGPFYIATNNVEYLQIGPLSDIHGKSIHRYTRLQ >gi568815591r:75669696_75872176|GENSCAN_predicted_CDS_7|126_bp naggaactgcttttccacggtactggaccattttacatcgccaccaacaatgtagagtac ctgcagataggtcctttgtctgatattcacggcaagtcaatacacagatacaccaggttg cagtag