GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:07:12 Sequence gi568815593f:21782587_21984309 : 201723 bp : 35.16% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1183 1304 122 0 2 75 48 69 0.331 1.31 1.02 Term + 2994 3162 169 0 1 29 49 161 0.567 2.67 1.03 PlyA + 4538 4543 6 1.05 2.00 Prom + 11736 11775 40 -3.35 2.01 Init + 14576 14662 87 1 0 33 24 276 0.994 16.39 2.02 Intr + 19607 19780 174 1 0 57 53 262 0.458 18.81 2.03 Term + 33609 33674 66 2 0 117 39 104 0.272 5.36 2.04 PlyA + 34039 34044 6 -0.45 3.06 PlyA - 34301 34296 6 1.05 3.05 Term - 34546 34320 227 2 2 63 38 230 0.944 11.46 3.04 Intr - 59742 59575 168 1 0 77 53 176 0.493 12.00 3.03 Intr - 72187 72085 103 1 1 50 99 113 0.163 7.43 3.02 Intr - 77127 76996 132 0 0 35 101 121 0.757 8.02 3.01 Init - 80728 80690 39 1 0 75 69 65 0.409 3.56 3.00 Prom - 84073 84034 40 -6.35 4.00 Prom + 85433 85472 40 -6.55 4.01 Init + 88452 88519 68 2 2 63 92 49 0.039 3.40 4.02 Intr + 99988 100129 142 1 1 64 28 197 0.007 10.73 4.03 Intr + 100134 100593 460 2 1 -11 0 585 0.167 31.63 4.04 Term + 100705 101726 1022 2 2 11 35 1110 0.466 89.28 4.05 PlyA + 101944 101949 6 1.05 5.04 PlyA - 103285 103280 6 1.05 5.03 Term - 109763 109677 87 2 0 71 40 93 0.111 -0.52 5.02 Intr - 113265 112991 275 1 2 60 59 141 0.017 4.73 5.01 Init - 127757 127682 76 2 1 66 73 43 0.018 1.90 5.00 Prom - 135480 135441 40 -3.55 6.00 Prom + 146380 146419 40 -4.55 6.01 Sngl + 165344 165763 420 1 0 57 38 208 0.954 8.75 6.02 PlyA + 166535 166540 6 1.05 7.04 PlyA - 166586 166581 6 1.05 7.03 Term - 169367 169263 105 2 0 70 49 76 0.064 -0.67 7.02 Intr - 183716 183664 53 0 2 94 116 -4 0.017 0.71 7.01 Intr - 192799 192505 295 1 1 110 92 369 0.552 35.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 35651 35612 40 2 1 88 111 11 0.926 3.80 S.002 Init + 83686 83896 211 0 1 49 57 172 0.833 9.19 S.003 Term + 86415 86536 122 1 2 101 48 74 0.845 2.36 S.004 Init + 130085 130146 62 1 2 74 100 37 0.844 4.27 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:21782587_21984309|GENSCAN_predicted_peptide_1|96_aa MSFLNLLNIKTFSSSKKFLNSGYDCVNGVERSTGNNYKEPRETLAVWIEDKASHNIPLSQ SLTQNIALAVFCSMKVERSKETAEEKCELAELVHEV >gi568815593f:21782587_21984309|GENSCAN_predicted_CDS_1|291_bp atgagtttccttaacctgctgaatattaaaactttttcaagcagtaaaaaattccttaac agtggctatgactgtgtaaatggggttgagagaagcactggaaacaactacaaagaaccc agagaaactttagcggtctggatagaagataaagccagtcacaacattcctttaagccaa agtctaacccagaatatagccctagctgtcttctgttctatgaaggtggagagaagtaag gaaactgcagaagaaaagtgtgagctagcagagttggttcatgaggtttaa >gi568815593f:21782587_21984309|GENSCAN_predicted_peptide_2|108_aa MWLFAVLSDDDDDDDDDEDDEDDDDKIPKVLSSDSANDGPYRSVFINLHGVERLAENRWL IYVQHADLHRSCVFERARRVKPVVKVKPTQCEDDEDEDLYDDPLVLNK >gi568815593f:21782587_21984309|GENSCAN_predicted_CDS_2|327_bp atgtggctctttgctgtgcttagtgatgatgatgatgatgatgatgatgatgaagatgat gaagatgatgatgataaaatacccaaggtcttgagcagtgacagcgccaatgatggtccc taccggagtgtcttcataaacctccatggtgtagagcggcttgctgaaaaccggtggctc atctacgtccagcacgctgatcttcaccgtagctgtgtctttgaaagggcccgccgagtg aaaccggtggtcaaggtgaagcctactcaatgtgaagatgatgaggatgaagacctttat gatgatccactagtgcttaacaaatag >gi568815593f:21782587_21984309|GENSCAN_predicted_peptide_3|222_aa MDLDINGEANVLQFNGIIVVSPGVSISPSGTKTTRPAECNVAETGSKKFASGSLGVMVKA TDADDPTYGNSARVVYSILQGQPYFSIDPKTGVIRTALPNMDREVKEQYQVLIQAKDMGG QLGGLAGTTIVNITLTDVNDNPPRFPKSIFHLKVPESSPIGSAIGRIRAVDPDFGQNAEI EYNIVPGDGGNLFDIVTDEDTQEGVIKLKKVYRQMLGQLTTK >gi568815593f:21782587_21984309|GENSCAN_predicted_CDS_3|669_bp atggacctggacatcaacggtgaggcaaatgtcctccagtttaatggaatcattgttgta tctcctggtgtcagcatttctccctctggaactaagactactaggccagcagaatgtaat gtcgcagaaacaggaagcaagaagtttgctagtggatcactaggggtgatggtcaaggcc acagatgcagatgacccgacctatggaaacagtgccagagtcgtttacagcattcttcag ggacaaccttatttctctattgatcccaagacaggtgttattagaacagctttgccaaac atggacagagaagtcaaagaacaatatcaagtactcatccaagccaaggatatgggagga cagcttggaggattagccggaacaacaatagtcaacatcactctcaccgatgtcaatgac aatccacctcgattccccaaaagcatcttccacttgaaagttcctgagtcttcccctatt ggttcagctattggaagaataagagctgtggatcctgattttggacaaaatgcagaaatt gaatacaatattgttccaggagatgggggaaatttgtttgacatcgtcacagatgaggat acacaagagggagtcatcaaattgaaaaaggtatatagacaaatgttgggacagttgact actaaataa >gi568815593f:21782587_21984309|GENSCAN_predicted_peptide_4|563_aa MTNNTNSWALIRQFNLIVLQCDSRPTEMLRLPTVFRQMRPVSRVLAPHLTRAYAKDVKFG ADARALMLQGVDLLADAVAVTMEPKGRTVIIEQSWGSPNVTKDGVTVAKSIDLKDKYKNI GAKLVQDVANNTNEESGDGTTTATVLAGSIAKEGFQKISKGANPVEIRRGVMLAVDAVIA ELKKQSKPVTTPEEIAQVAMISANGDKEIGNIISDAMKKVGRKGQKCEFQDAYVLLSEKK ISSVQSIVPALEIANAHHKPLVIIAEDVDGEALSTLILNRLKVGLQVVAVKAPGFGDNRK NQLKDMAIATGGAVFGEEGLTLNLEDVQPHDLGKVGEVIVTKDDAMLLKGKGDKAQLEKR IQEIIGQLDVTTSEYEKEKLNEWLAKLSDGVVVLKFGGTSDVEVNEKKDRVTDALNATRA AVEGGIVLGGGFALLRCIPALDSLTPANEDQKIGMEIIKRTLKIPAMTTATNAGVEGSLI VEKIMQNSSEVGYDAMVGDFMNMVEKGIIDPTKLVRTALLDAAGVASLLTTAEVVVTEIP KEEKDPGMGAMGGMGGGMGGGMF >gi568815593f:21782587_21984309|GENSCAN_predicted_CDS_4|1692_bp atgactaacaacaccaattcttgggctctcatcagacagtttaatttaattgttcttcag tgtgacagccgccccacagaaatgcttcggttacccacagtctttcgccagatgagacca gtgtccagggtactggctcctcatctcactcgagcttatgccaaagatgtaaaatttggt gcagatgcccgagccttaatgcttcaaggtgtagaccttttagccgatgctgtggccgtt acaatggagccaaagggaagaacagtgattattgagcagagctggggaagtcccaacgta acaaaagatggtgtgactgttgcaaagtcaattgacttgaaggataaatataaaaacatt ggagctaaacttgttcaagatgttgccaataacacaaatgaagaatctggggatggcact accactgctactgtactggcaggctctatagccaaggaaggcttccagaagattagcaaa ggtgctaatccagtggaaatcaggagaggtgtgatgttagctgttgatgctgtaattgct gaacttaaaaagcagtctaaacctgtgaccacccctgaagaaattgcacaggttgctatg atttctgcaaatggagacaaagaaattggcaatatcatctctgatgcaatgaaaaaggtt ggaagaaagggtcagaaatgtgaattccaggatgcctatgttctgttgagtgaaaagaaa atttctagtgtccagtccattgtacctgctcttgaaattgccaatgctcaccataagcct ttggtgataatcgctgaagatgttgatggagaagctctaagtacactcatcttgaatagg ctaaaggttggtcttcaggttgtggcagtcaaggctccagggtttggtgacaatagaaag aaccaacttaaagatatggctattgctactggtggtgcagtgtttggagaagaggggttg accctgaatcttgaagatgttcagcctcatgacttaggaaaagttggagaggtcattgtg accaaagacgatgccatgctcttaaaaggaaaaggtgacaaggctcaacttgaaaaacgt attcaagaaatcattgggcagttagatgtcacaactagtgaatatgaaaaggaaaaactg aatgaatggctggcaaaactttcagatggagtagttgtgctgaagtttggtgggacaagt gatgttgaagtgaatgaaaagaaagacagagttacagatgcccttaatgctacaagagct gctgttgaaggaggcattgttttgggagggggttttgccctccttcgatgcattccagcc ttggactcattgactccagctaatgaagatcaaaaaattggtatggaaattattaaaaga acactcaaaattccagcaatgaccactgctacgaatgcaggtgttgaaggatctttgata gttgagaaaattatgcaaaattcctcagaagttggttatgatgctatggttggagatttt atgaatatggtagaaaaaggaattattgacccaacaaagcttgtgagaactgctttattg gatgctgctggtgtggcctctctgttaactacagcagaagttgtagtcacagaaattcct aaagaagagaaggaccctggaatgggtgcaatgggtggaatgggaggtggtatgggaggt ggcatgttctaa >gi568815593f:21782587_21984309|GENSCAN_predicted_peptide_5|145_aa MGPDDILGASEFDVLDARLLLTFLLCLGDLPICTLMHSYLKEASFLLIFTFNHEGYYLGT WAGLDSLQNGRIQLQSLTGWGAVAPLGSDTLLVAQGCLTSAAELSPEKEKRSCVANKESS YQTTRQLITMIDNHITLSLAEHLHD >gi568815593f:21782587_21984309|GENSCAN_predicted_CDS_5|438_bp atgggtcccgatgacatccttggagcttctgaatttgatgttcttgatgccaggctgttg ctgacttttttgttatgcctgggggatcttcctatctgtaccctcatgcacagttatttg aaggaagccagtttcctgctgatctttaccttcaatcatgaaggctactacctgggcact tgggcagggctagattctctccagaatggtagaattcaactccagtccctgactgggtgg ggggcagtggctcctctgggctctgacacacttcttgtggcacaaggctgtctaacatca gcagctgagctgagccctgagaaagaaaagcgcagctgtgtggctaacaaggaaagttca tatcaaacaacaagacagctcatcaccatgattgataaccacatcactctgagccttgcc gaacatctacatgactaa >gi568815593f:21782587_21984309|GENSCAN_predicted_peptide_6|139_aa MYSSPVDSEGASPKPWRIPYDVGPEGTQKSRIQVWKPLPRFHRLYGNAWMSRPRLAAGVE PSWRTSARVVQKGNVGLELPHRVPTGALPSEAVRRGPLFSRPLNGRSTNSLYCAPGKATD TQCQPVKELPKAMGAHPLH >gi568815593f:21782587_21984309|GENSCAN_predicted_CDS_6|420_bp atgtacagctcacctgttgattcagagggtgcaagccccaagccttggaggattccatat gatgttgggcctgagggtacacagaagtcaagaattcaggtgtggaaacctctgcctaga tttcacaggttgtatggaaatgcctggatgtccaggccgagattggctgcaggggtggag ccatcatggagaacctctgctagggtagtgcagaagggaaatgtggggttggagctccca cacagagtccccactggggcactgcctagtgaagctgtgagaagagggccactgttctcc agacccctgaatggaagatccaccaacagcttgtactgtgcacctggaaaagccacagac actcaatgccagcctgtgaaggaactgcccaaggccatgggagcccaccccttgcattaa >gi568815593f:21782587_21984309|GENSCAN_predicted_peptide_7|150_aa LHSDLDKGEGTVKYTLSGDGAGTVFTIDETTGDIHAIRSLDREEKPFYTLRAQAVDIETR KPLEPESEFIIKVQDINDNEPKFLDGPYVATVPEMSPVEKYVDPLHSGLSLREEIKFSSN RNKQTLFKSFSSLNEDVPRVGVDELSLTLA >gi568815593f:21782587_21984309|GENSCAN_predicted_CDS_7|453_bp ctccattccgacttagacaagggagagggcactgtgaaatacaccctctcaggagatggc gctggcaccgtttttaccattgatgaaaccacaggggacattcatgcaataaggagccta gatagagaagagaaacctttctacactcttcgtgctcaggctgtggacatagaaaccaga aagcccctggagcctgaatcagaattcatcatcaaagtgcaggatattaatgataatgag ccaaagtttttggatggaccttatgttgctactgttccagaaatgtctcctgtggaaaag tatgttgacccattacatagtggattgagtttgagagaggagattaagttcagttctaac aggaataaacagacgctattcaaaagcttttcaagtctgaatgaagatgtacctagggtt ggtgttgatgaactttccctcactcttgcttaa