GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:55:36 Sequence gi568815593r:14697126_14898400 : 201275 bp : 44.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 239 287 49 1 1 86 58 47 0.250 0.62 1.02 Intr + 374 416 43 0 1 73 76 67 0.297 1.30 1.03 Intr + 8486 8594 109 2 1 54 74 81 0.236 3.59 1.04 Intr + 10805 11005 201 1 0 45 95 71 0.777 2.98 1.05 Term + 11439 11621 183 2 0 63 48 131 0.987 4.14 1.06 PlyA + 12128 12133 6 -0.45 2.19 PlyA - 12532 12527 6 1.05 2.18 Term - 14185 14072 114 1 0 53 48 141 0.986 5.17 2.17 Intr - 15848 15749 100 1 1 92 94 135 0.999 14.61 2.16 Intr - 16542 16419 124 1 1 82 63 195 0.967 16.04 2.15 Intr - 19710 19581 130 0 1 120 103 131 0.991 18.07 2.14 Intr - 20815 20735 81 1 0 88 33 72 0.738 1.53 2.13 Intr - 21263 21182 82 1 1 82 111 8 0.549 2.14 2.12 Intr - 24321 24245 77 0 2 75 37 123 0.880 4.21 2.11 Intr - 30585 30391 195 0 0 92 87 47 0.842 4.61 2.10 Intr - 36813 36664 150 0 0 28 40 105 0.055 0.06 2.09 Intr - 44797 44702 96 1 0 115 103 139 0.981 18.31 2.08 Intr - 48837 48745 93 0 0 130 100 90 0.998 14.56 2.07 Intr - 52181 52047 135 2 0 100 64 60 0.865 5.56 2.06 Intr - 54114 53944 171 0 0 141 71 193 0.968 23.04 2.05 Intr - 58819 58736 84 1 0 88 99 5 0.701 1.62 2.04 Intr - 60966 60853 114 0 0 40 92 43 0.547 0.54 2.03 Intr - 61473 61355 119 1 2 49 101 89 0.798 6.48 2.02 Intr - 72066 71850 217 0 1 106 66 354 0.601 33.08 2.01 Init - 81074 80916 159 2 0 70 77 88 0.028 5.73 2.00 Prom - 94396 94357 40 -1.46 3.02 PlyA - 95321 95316 6 1.05 3.01 Sngl - 101170 100241 930 1 0 56 45 538 0.933 42.74 3.00 Prom - 103921 103882 40 -3.76 4.00 Prom + 111737 111776 40 -2.46 4.01 Sngl + 138718 139776 1059 0 0 43 43 379 0.990 26.06 4.02 PlyA + 139909 139914 6 1.05 5.06 PlyA - 141634 141629 6 1.05 5.05 Term - 152749 152655 95 2 2 53 42 156 0.441 5.49 5.04 Intr - 163124 162902 223 2 1 100 37 62 0.109 0.00 5.03 Intr - 174372 174227 146 1 2 107 115 160 0.512 20.60 5.02 Intr - 175392 174856 537 1 0 98 11 146 0.082 0.79 5.01 Init - 180792 180735 58 0 1 54 57 73 0.126 2.27 5.00 Prom - 197223 197184 40 -2.86 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 82294 82362 69 0 0 106 42 134 0.880 8.54 S.002 Init + 88314 88425 112 2 1 78 77 115 0.894 9.78 S.003 Sngl - 151529 151323 207 2 0 100 48 163 0.837 7.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:14697126_14898400|GENSCAN_predicted_peptide_1|194_aa MGFYHVGQAGLELLASGYIEIYYQECRNGFCFRQFPDQYIIKQPKYLIKSKEQRICHCKA CGCGSPGKCKQVAFCAPFKESQTGEFPANVQIRQYSLSHVAGRSNPTVTFQAQASSSSFC SVVVSLTPFATEIPLDDGTRLEETQENMPVLSFASKLLGCDLSSPSEENVRRRITCFKVK GLNKSCQISSIVVA >gi568815593r:14697126_14898400|GENSCAN_predicted_CDS_1|585_bp atggggttttaccatgttggccaggctggtctcgagctcctggcctcaggttatattgaa atctactaccaggaatgtcggaatgggttttgctttagacagttccctgatcagtacatc attaagcaaccaaaatatctcattaaatccaaggagcagcgtatttgccactgcaaggct tgtggatgtggcagccctgggaaatgcaaacaagtggccttctgtgccccatttaaagaa tcccaaacgggagagtttcctgccaatgtgcaaatccgtcaatactccctgagccatgtg gccggcaggtccaatcctaccgtgactttccaagcgcaagcctcgtctagttctttttgc tcagttgttgtctcactgacgccctttgccactgagatccctctggatgatgggactcga ttagaagaaacgcaggagaatatgccagttctcagttttgcctcaaagttgttaggctgt gacttaagcagcccaagtgaagaaaatgtacgaagaaggatcacgtgctttaaggtgaaa ggtttgaacaagtcctgccaaatcagctccattgtggtggcctga >gi568815593r:14697126_14898400|GENSCAN_predicted_peptide_2|746_aa MHSYLQDCEDQDDPRPGAACALTFLLPSSGVARNFSCWVTAAVKGHAVPSQVEALNRGIA AVKEDAVEMLASYGLAYSLMKFFTGPMSDFKNVGLVFVNSKRDRTKAVLCMVVAGAIAAV FHTLIAYSDLGYYIINKLHHVDESVGSKTRRAFLYLAAFPFMDAMAESLSSECLGHILCI RSSADGHLRCFHLWVLVTNAAMNAWTHAGILLKHKYSFLVGCASISDVIAQVVFVAILLH SHLECREPLLIPILSLYMGALVRCTTLCLGYYKNIHDIIPDRSGPELGGDATIRKMLSFW WPLALILATQRISRPIVNLFVSRDLGGSSAATEAVAILTATYPVGHMPYGWLTEIRAVYP AFDKNNPSNKLVSTSNTVTAAHIKKFTFVCMALSLTLKDSVQKPDISLTGRLVQTLPTRM RHQRGESKDVAPLASWLSEPSTSSEASQTSSKLTINSQGEGKAKQKLECGTLSIVLRSCE KNQERIKAAEKRNRSWTTLCGLGAWRPLLFELPVIVQTPDQTNRFQFRYPAKTQSGLCSF FHSFDLLTLEAFVKVWFPGCLLILIEYASSKNRKSKMLQNLKLLSADMTLKLCFVMFWTP NVSEKILIDIIGVDFAFAELCVVPLRIFSFFPVPVTVRAHLTGWLMTLKKTFVLAPSSVL RIIVLIASLVVLPYLGVHGATLGVGSLLAGFVGESTMVAIAACYVYRKQKKKMENESATE GEDSAMTDMPPTEEVTDIVEMREENE >gi568815593r:14697126_14898400|GENSCAN_predicted_CDS_2|2241_bp atgcacagctacctccaggactgcgaggatcaggatgaccccaggccgggtgctgcatgt gccctcaccttcctgttgcccagctctggggttgccaggaatttcagctgctgggtgaca gcagcagtcaaaggccatgcagtgccatctcaggtggaggccttgaaccggggcattgct gctgtcaaggaggatgcagtcgagatgctggccagctacgggctggcgtactccctcatg aagttcttcacgggtcccatgagtgacttcaaaaatgtgggcctggtgtttgtgaacagc aagagagacaggaccaaagccgtcctgtgtatggtggtggcaggggccatcgctgccgtc tttcacacactgatagcttatagtgatttaggatactacattatcaataaactgcaccat gtggacgagtcggtggggagcaagacgagaagggccttcctgtacctcgccgcctttcct ttcatggacgcaatggctgagtcactttccagtgaatgtctaggccacatcttgtgtatc cgttcgtcagctgatggacacttgcgttgcttccacctttgggttcttgtgactaacgct gctatgaacgcatggacccatgctggcattctcttaaaacacaaatacagtttcctggtg ggatgtgcctcaatctcagatgtcatagctcaggttgtttttgtagccattttgcttcac agtcacctggaatgccgggagcccctgctcatcccgatcctctccttgtacatgggcgca cttgtgcgctgcaccaccctgtgcctgggctactacaagaacattcacgacatcatccct gacagaagtggcccggagctggggggagatgcaacaataagaaagatgctgagcttctgg tggcctttggctctaattctggccacacagagaatcagtcggcctattgtcaacctcttt gtttcccgggaccttggtggcagttctgcagccacagaggcagtggcgattttgacagcc acataccctgtgggtcacatgccatacggctggttgacggaaatccgtgctgtgtatcct gctttcgacaagaataaccccagcaacaaactggtgagcacgagcaacacagtcacggca gcccacatcaagaagttcaccttcgtctgcatggctctgtcactcacgcttaaagacagt gtacagaaaccagacatcagcctgacggggcgcttagtccagacactcccgacacgcatg cgccatcaacgtggagaaagcaaagatgtggcgcctctggcctcttggctttccgagccc agcaccagctcggaggcttcccaaacaagttccaagttgactattaatagtcaaggagaa ggaaaggctaaacagaagctggagtgtggaactttatcaatagtcctcagatcctgtgag aaaaaccaggaaaggattaaagctgcagagaaaaggaataggagttggacaaccctctgc gggcttggcgcatggcgccccctgctgtttgagttacctgtgattgttcaaaccccagat cagactaaccgcttccagttccgataccccgcgaagacccagtctgggctttgtagtttt ttccatagttttgacctcctcacattggaggcctttgtcaaagtctggtttcctggttgc ctgctcattttgattgagtatgcctcatccaaaaatcgaaaatccaaaatgctccaaaat ctgaaacttttgagcgctgacatgactctcaaactctgtttcgtgatgttttggacaccc aacgtgtctgagaaaatcttgatagacatcatcggagtggactttgcctttgcagaactc tgtgttgttcctttgcggatcttctccttcttcccagttccagtcacagtgagggcgcat ctcaccgggtggctgatgacactgaagaaaaccttcgtccttgcccccagctctgtgctg cggatcatcgtcctcatcgccagcctcgtggtcctaccctacctgggggtgcacggtgcg accctgggcgtgggctccctcctggcgggctttgtgggagaatccaccatggtcgccatc gctgcgtgctatgtctaccggaagcagaaaaagaagatggagaatgagtcggccacggag ggggaagactctgccatgacagacatgcctccgacagaggaggtgacagacatcgtggaa atgagagaggagaatgaataa >gi568815593r:14697126_14898400|GENSCAN_predicted_peptide_3|309_aa MTHALEWPSLTAQWLPDVTRPEGKDFSLHRLVLGTHTSDEQNHLVIASVQLPDDDAQFDA SHYDREKGEFGGFGSVSGKIEIEIKINHEGKVNRARYMPQNPCIITTKTPSSDILVFDYT KHPSKPDPSGECNPDLRLRGHQKEGYGLSWNLNLSGHLLSASDDHTICLWDISAVPKEGK VVDAKTIFTGHTAIVEDVSWHLLCESLFGSVADDQKLMIWDTRSNNTSKPSHSVDTHTAE VNCLSFSPYSEFILATGSADKTVALWDLRNLKLKLHSFESHKDEIFQVQWSPHNETILAS SGTDHRPNV >gi568815593r:14697126_14898400|GENSCAN_predicted_CDS_3|930_bp atgacccatgctctggagtggcccagcctaactgcccagtggcttccagatgtaaccaga ccagaagggaaagatttcagccttcatcgacttgtcctggggacacacacatcggatgaa caaaaccatcttgttatagccagtgtgcagctgcctgatgatgatgctcagtttgatgca tcacactacgacagagagaaaggagaatttggaggttttggttcagttagtggaaaaatt gaaatagaaatcaagatcaaccatgaaggaaaagtaaacagggcccgttatatgccccag aacccttgtatcatcacaacaaagactccttccagtgatattcttgtctttgactataca aaacatccttctaaaccagatccttctggagaatgcaacccagacttgcgtctccgtgga catcagaaggaaggctatgggctttcttggaacctaaatctcagtgggcacttacttagt gcttcagatgaccacaccatctgcctgtgggacatcagtgccgttccaaaggagggaaaa gtggtagatgcgaagaccatctttacagggcatacggcaatagtagaagatgtttcctgg catctactctgtgagtctctgtttgggtcagttgctgatgatcagaaactgatgatttgg gatactcgttcaaacaatacttccaaaccaagccactcagttgacactcacactgctgaa gtgaactgcctttctttcagtccttatagtgagttcattcttgccacaggatcagctgac aagactgttgccttgtgggatctgagaaatctgaaacttaagttgcattcctttgagtca cataaggatgaaatattccaggttcagtggtcacctcacaatgagactattttagcttcc agtggtactgatcacagaccgaatgtctag >gi568815593r:14697126_14898400|GENSCAN_predicted_peptide_4|352_aa MNINAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIHHINRTKDKNHM IISIAAEKAFDKIQQPFMLKTLNKLGIDGTYLKIIRAIYDKPTANIILNGQKLEAFPLKT GTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMIVYLENPIISA PNLLKLISNFSKASGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKRIKYLGIQLTR DVKDLFKEDYKPLLNKIKEDTNKWKNIPCSWIGRINIVKMAILPKVIYRFNAIPIKLPMT FFTELEKTTLKFIWNQKRAHIAKTILSQKNKAGGIMLPDFELYYKATVTKTA >gi568815593r:14697126_14898400|GENSCAN_predicted_CDS_4|1059_bp atgaacatcaatgcaaaaatcctcaataaaatactggcaaatcgaatccagcagcacatc aaaaagcttattcaccacgatcaagttggcttcatccctgggatgcaaggctggttcaac atatgcaaatcaataaacgtaatccatcacataaacagaaccaaagacaaaaaccacatg attatctccatagctgcagaaaaggcctttgacaaaattcaacagcccttcatgctaaaa actctcaataaactaggtattgatgggacgtatctcaaaataataagagctatttatgac aaacccacagccaatatcatactgaatgggcaaaaactggaagcattccctttgaaaact ggcacaagacagggatgccctctctcaccactcctattcaacatagtgttggaagttctg gccagagcaatcaggcaggagaaagaaataaagggtattcaattaggaaaagaggaagtg aaattgtccctgtttgcagatgacatgattgtatatttagaaaaccccatcatctcagcc ccaaatctgcttaagctgataagcaacttcagcaaagcctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaatcaaatacctaggaatccaacttacaagg gatgtgaaggacctcttcaaggaggactacaaaccactgctcaacaaaataaaagaggac acaaacaaatggaagaacattccatgctcatggataggaagaatcaatattgtgaaaatg gccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgact ttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccac attgccaagacaatcctaagccaaaagaacaaagctggaggcatcatgctacctgacttc gaactatactacaaggctacagtaaccaaaacagcatga >gi568815593r:14697126_14898400|GENSCAN_predicted_peptide_5|352_aa METEQPEETFPNIEINSDSGTRATVAPGNPGGQAPSWATFPRFFRPPRSGTSIPPGPLVP RSRPQPRRKLPPPQSPPAASPATAGRRPSQPRAPLPSPPRASEPTSLAPAAKSSRRPGPP GTAREPRPPAPRDEGYRARPASQGTPGSPGGPPPSPPTGVPKPPDRPRRALSAAAPGPAE APRRVPPSAPEAPAPRPEQSPRGSRCVWGQPTAGTMVKFPALTHYWPLIRFLVPLGITNI AIDFGEQVSVFECPSLTAFWRETLFLWDGFGCSLASPEKTPGLEPGVQLVVPHLYPHAEA FHHLRLMPIGLSPKRGQISWHSLKVHMEDHLLLRMHIRVTVTSSVCLDDNVT >gi568815593r:14697126_14898400|GENSCAN_predicted_CDS_5|1059_bp atggaaactgaacagccagaggaaaccttccctaacatcgaaatcaacagtgactctggg acacgggccaccgttgcccctggaaaccctgggggacaggctccctcctgggcgaccttt ccacgtttctttcgccccccccggagcggcacctccatacctcccgggccgctagtcccc cgaagcaggccccagccccggcgcaagctgcctcctccgcagtccccgcccgccgccagc ccggccaccgccggacggcgcccttcccagccccgcgccccgctccccagccctcctcgc gcgtccgagcccacgtcgcttgccccggctgccaagtcttccagacggccgggccctccc ggcacagcgcgggaaccccggcccccggccccacgggacgagggctacagggcccggccg gcgagccagggcaccccggggtctccaggcggcccaccgccctcaccccccaccggcgtc cccaagcccccagaccgtccccgccgcgccctcagcgccgccgcccccgggccggccgag gctccgcggcgagtcccgcccagcgccccggaggcgccagccccacggcccgagcagtcc cctcgcggcagcagatgtgtgtggggtcagcccacggcggggactatggtgaaattcccg gcgctcacgcactactggcccctgatccggttcttggtgcccctgggcatcaccaacata gccatcgacttcggggagcaggtttctgtgttcgaatgtccctccctcaccgccttctgg agagagaccttgtttctttgggatggttttggatgcagccttgcatcacctgagaagaca cctggcctggagcctggcgtgcagctggttgtacctcatttgtacccccacgcagaggca ttccaccacctgcggctcatgcctattgggctgtctcctaaacgtgggcagatttcctgg cattccctgaaagtccacatggaagaccaccttcttttgaggatgcacatccgagtgacc gtgacaagcagcgtctgcctggatgacaatgtgacttaa