GENSCAN 1.0 Date run: 6-Nov-116 Time: 08:57:19 Sequence gi568815585f:49915494_50120451 : 204958 bp : 39.10% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 6087 5985 103 0 1 105 103 17 0.236 3.73 1.03 Intr - 12592 12426 167 2 2 62 89 148 0.818 11.06 1.02 Intr - 15641 15525 117 0 0 54 77 67 0.717 1.72 1.01 Init - 20742 20637 106 0 1 91 82 131 0.988 11.27 1.00 Prom - 36271 36232 40 -1.85 2.04 PlyA - 36728 36723 6 1.05 2.03 Term - 40029 39458 572 1 2 19 48 248 0.736 7.51 2.02 Intr - 40960 40718 243 2 0 -209 53 428 0.670 6.15 2.01 Init - 41895 41799 97 0 1 78 66 53 0.321 2.62 2.00 Prom - 42943 42904 40 -6.25 3.06 PlyA - 43017 43012 6 1.05 3.05 Term - 62881 62754 128 2 2 133 43 86 0.834 6.06 3.04 Intr - 73567 73423 145 1 1 57 99 44 0.552 1.33 3.03 Intr - 78468 78300 169 2 1 67 108 164 0.980 15.33 3.02 Intr - 81834 81581 254 0 2 87 88 314 0.236 26.61 3.01 Init - 86003 85989 15 2 0 30 111 5 0.311 -2.63 3.00 Prom - 86125 86086 40 -6.45 4.00 Prom + 91385 91424 40 -4.75 4.01 Init + 96448 97569 1122 0 0 63 61 672 0.043 56.26 4.02 Intr + 100481 100578 98 1 2 87 68 49 0.333 0.69 4.03 Term + 104721 104961 241 0 1 109 45 168 0.999 9.11 4.04 PlyA + 105395 105400 6 1.05 5.00 Prom + 107679 107718 40 -5.65 5.01 Init + 136366 136612 247 0 1 114 86 23 0.549 2.51 5.02 Term + 137536 137633 98 2 2 115 39 85 0.626 3.45 5.03 PlyA + 139587 139592 6 1.05 6.00 Prom + 159430 159469 40 -4.15 6.01 Init + 165890 166086 197 1 2 79 38 194 0.604 11.95 6.02 Term + 166678 166786 109 1 1 87 45 93 0.329 1.90 6.03 PlyA + 167573 167578 6 1.05 7.05 PlyA - 167604 167599 6 1.05 7.04 Term - 171562 171443 120 1 0 84 43 84 0.163 0.99 7.03 Intr - 184048 183957 92 2 2 76 75 43 0.121 0.59 7.02 Intr - 184351 184181 171 2 0 -1 100 180 0.114 9.29 7.01 Init - 198106 198001 106 1 1 47 116 44 0.298 3.53 7.00 Prom - 200725 200686 40 -2.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 96448 97671 1224 0 0 63 38 683 0.870 56.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:49915494_50120451|GENSCAN_predicted_peptide_1|165_aa MATSVLCCLRCCRDGGTGHIPLKEMPAVQLDTQHMGTDVVIVKNGRRICGTGGCLASAPL HQNKSYFEFKIQSTGIWGIGVATQKVNLNQIPLGRDMHSLVMRNDGALYHNNEEKNRLPA NSLPQEGDVVGITYDHVELNVYLNGKNMHCPASGIRGTVYPVVYX >gi568815585f:49915494_50120451|GENSCAN_predicted_CDS_1|495_bp atggccacctcggtgttgtgctgcctgcggtgctgcagagacggggggactggccacatc cctctgaaggagatgccggccgtgcagctggacacgcagcacatgggaacagatgttgtt attgtaaagaatggaagaagaatatgtggaacaggaggttgtttagccagcgcaccttta catcaaaacaaaagctattttgaattcaaaatccagtccacaggaatctggggtattggt gttgcaactcagaaggttaacttgaatcagattcctcttggccgagatatgcacagtctg gtgatgagaaatgatggagccctttaccacaacaatgaagagaaaaataggctgccagca aacagtcttccgcaggaaggagatgtggtgggtattacttatgaccatgtcgaattaaat gtatacttgaatggaaaaaacatgcattgtccagcatcaggtatacgagggacagtgtat ccagttgtttatgnn >gi568815585f:49915494_50120451|GENSCAN_predicted_peptide_2|303_aa MISLVDLKLCWSNNGHSLCKEESLSPILPTSDGEERRRKRKKRKRKKRKQKQQQQQEAAA PAEADAEATAAEAAAEEAAARKEEEEEGEEGEGNGEEEEEKAAAAARGDWSQQEKACKSS LQAILFGHAKLEPGRLPRPTQCRVEATVLVGTNSPLDNPLWSISWSLWQKPVGTSQQQPL GLWTREFPLEGHLLTRNETFTEATPLIPEIGMLSQMMSEEHSNGDGSAQKSSMTQWKWFI EDHATQGMQRISFPLGLTLELCEKLLDSTVPNKQLSTDQTRASWFMDGNSKAQNARKNCS VSQ >gi568815585f:49915494_50120451|GENSCAN_predicted_CDS_2|912_bp atgatctccctggttgatcttaagctctgctggtcaaataacggacattccttgtgtaaa gaggagagcctctcaccaatcctccctaccagtgatggagaagaaaggagaagaaaaagg aagaagaggaagaggaagaagaggaagcagaagcagcagcagcagcaagaagcagcagca ccagcagaagcagatgcagaagcaacagcagcagaagcagcagctgaagaagcagcagca agaaaggaggaggaggaggaaggggaggagggggaggggaatggggaggaggaggaggag aaggcagcggcggcagcaagaggggactggtcccagcaggagaaggcatgtaaatcgtcc cttcaagcaatattatttggacatgctaagctggaaccagggagactgcccaggcccaca cagtgcagagtagaagctacggtgctggtagggacaaattctccacttgataaccctcta tggagcatttcctggagcttatggcaaaagcctgtgggcacctcccagcaacaaccactg ggactttggactagagaatttccacttgaagggcatttactgactcgcaatgaaacattt actgaagccaccccactgatacctgaaataggcatgctgtctcagatgatgtcagaggaa cattctaatggggatggcagtgcccagaagagttccatgacacaatggaaatggtttata gaggatcatgctactcaaggcatgcagagaatttcttttcccctaggactgactctggaa ctgtgtgagaaactgctggattctacagtgccaaataaacaactttcaactgaccaaaca agagcttcttggtttatggatggcaattccaaggcacaaaatgctaggaagaactgttct gtttctcagtaa >gi568815585f:49915494_50120451|GENSCAN_predicted_peptide_3|236_aa MFTSLAAPPQAHDSAPLVSAALPDWSSDGLRQPPQPTQIPADLGNSPPSLKVRGPPRFPE RSRAAGTYLRPPASGKGRATARPPTTGTARTATGKLPLEKDDCPTKTCIHSYILPWKSTV ELDPSTDSTGIVNILVTLKFPSTDLQAFGLALVLPEAHTASSSRAPAPLVPSLTRPSPAT SHPHPPTIAQLTVSGRASKTDGSEVLHLRAALALRAGQGLSGPLTGPQDCPFPQHA >gi568815585f:49915494_50120451|GENSCAN_predicted_CDS_3|711_bp atgttcactagcttggccgctcctccccaggcccatgacagtgccccgctggtttctgcc gcgctgccggactggagctcagacggccttcgccagcccccccaacccacgcagatccct gctgacctgggcaactccccaccctcgctgaaggttcgaggaccaccccgctttcccgag aggagccgggcggcgggtacttatctccgacctccggctagtgggaaaggccgcgcgacc gcccgtcctccaactacagggaccgcacgaacagcgactgggaaactgccactagaaaaa gatgactgtccaactaaaacctgcatacacagctacatccttccctggaagagcacagtg gaactagatcctagtacagatagcacagggattgtcaacattcttgtcacactgaaattt ccttctactgatctccaagcatttgggttagcccttgtgttacccgaggctcatactgct tccagcagcagggctcctgcacccttagttccttccctaacgaggccctcaccagccacc tcccatccccatcctccaaccatagcacaactgacagtctcaggtagggcctctaaaacc gatggaagtgaagtgctacacctcagagctgctttagcccttagagctggccagggtctg agtgggcctctcacagggccacaggactgccctttcccccagcatgcctag >gi568815585f:49915494_50120451|GENSCAN_predicted_peptide_4|486_aa MELLEEDLTCPICCSLFDDPRVLPCSHNFCKKCLEGILEGSVRNSLWRPAPFKCPTCRKE TSATGINSLQVNYSLKGIVEKYNKIKISPKMPVCKGHLGQPLNIFCLTDMQLICGICATR GEHTKHVFCSIEDAYAQERDAFESLFQSFETWRRGDALSRLDTLETSKRKSLQLLTKDSD KVKEFFEKLQHTLDQKKNEILSDFETMKLAVMQAYDPEINKLNTILQEQRMAFNIAEAFK DVSEPIVFLQQMQEFREKIKVIKETPLPPSNLPASPLMKNFDTSQWEDIKLVDVDKLSLP QDTGTFISKIPWSFYKLFLLILLLGLVIVFGPTMFLEWSLFDDLATWKGCLSNFSSYLTK TADFIEQSVFYWEQMTLLPLPPQRPSYHDLVFQCGSDSTTDNQTGVRYVSIKPDNRKLAN GTNVLGLLIDTLLKEGFHLVSTRTVSSEDKTECYSFERIKSPEVLITNETPKPETIIIPE QSQIKK >gi568815585f:49915494_50120451|GENSCAN_predicted_CDS_4|1461_bp atggagctgcttgaagaagatctcacatgccctatttgttgtagtctgtttgatgatcca cgggttttgccttgctcccacaacttctgcaaaaaatgcttagaaggtatcttagaaggg agtgtgcggaattccttgtggagaccagctccattcaagtgtcctacatgccgtaaggaa acttcagctactggaattaatagcctgcaggttaattactccctgaagggtattgtggaa aagtataacaagatcaagatctctcccaaaatgccagtatgcaaaggacacttggggcag cctctcaacattttctgcctgactgatatgcagctgatttgtgggatctgtgctactcgt ggggagcacaccaaacatgtcttctgttctattgaagatgcctatgctcaggaaagggat gcctttgagtccctcttccagagctttgagacctggcgtcggggagatgctctttctcgc ttggataccttggaaactagtaagaggaaatccctacagttactgactaaagattcagat aaagtgaaggaattttttgagaagttacaacacacactggatcaaaagaagaatgaaatt ctgtctgactttgagaccatgaaacttgctgttatgcaagcatatgacccagagatcaac aaactcaacaccatcttgcaggagcaacggatggcctttaacattgctgaggctttcaaa gatgtgtcagaacccattgtatttctgcaacagatgcaggagtttagagagaaaatcaaa gtaatcaaggaaactcctttacctccctctaatttgcctgcaagccctttaatgaagaac tttgataccagtcagtgggaagacataaaactagtcgatgtggataaactttctttgcct caagacactggcacattcattagcaagattccctggagcttttataagttatttttgcta atccttctgcttggccttgtcattgtctttggtcctaccatgttcctagaatggtcatta tttgatgacctggcaacttggaaaggctgtctttcaaacttcagttcctatctgactaaa acagccgatttcatagaacaatcagttttttactgggaacagatgaccttacttccactg cctccacaaagaccttcttaccatgacctggttttccagtgtggttctgacagcactact gataaccaaactggagtcaggtatgtttctataaaacctgataaccgaaaattggccaac ggaacaaatgtcctcggcttactgattgacactttattaaaggaaggctttcatttggtc agcactagaacagtatcttctgaagacaaaactgaatgctatagctttgaaaggataaaa agccctgaagtgctcatcacgaatgaaacaccaaaaccagagactatcatcataccagag caatctcagataaagaaatga >gi568815585f:49915494_50120451|GENSCAN_predicted_peptide_5|114_aa MPSPGWVFNADKYALFWKKMPQRTFISKEEKQAPGFKAGRDRQTLLFCANAVGLLPLAIK LLIPEPQREKTPAASLLVVQQEETAKPTPFLPQPAQCENSEDEGLYDDPLPFNQ >gi568815585f:49915494_50120451|GENSCAN_predicted_CDS_5|345_bp atgcccagcccaggatgggtttttaatgctgataaatatgccctattctggaagaaaatg ccacaaaggacatttattagtaaggaagaaaagcaagcaccaggatttaaggcaggaaga gataggcaaactctactgttctgtgcaaatgcagtcggtttgctgcccttagccataaag ctgctaatccctgagcctcaaagggaaaaaacaccagcagccagtcttttggttgtacaa caagaagagacagcaaaaccaaccccttttcttcctcagcctgctcaatgtgaaaacagt gaagatgaaggcctgtatgatgatccacttccatttaatcaatag >gi568815585f:49915494_50120451|GENSCAN_predicted_peptide_6|101_aa MTAITGSRKRREKAARFGNGGGARVQSSCQGAAACRDPAVRRPRLRSRAGGLADRSAGRR LLRRSRHMRRIIVVHGSPFASSVAVLLLLARACSAFAKPRR >gi568815585f:49915494_50120451|GENSCAN_predicted_CDS_6|306_bp atgacggccatcaccgggtcgagaaaacggcgagaaaaggcggcccggttcggaaatggg ggaggggcgcgcgtccagtcctcctgtcaaggagcggccgcctgcagagaccccgcagtg cgccgtccccggctccggtcccgggcggggggtcttgctgacagatccgctgggcggcgg ctgctccgccgcagccggcacatgcgcagaatcatcgtggtgcacggctctccctttgct tcttcggttgcagtcctcttgcttcttgcgcgtgcgtgtagcgcttttgcaaagccgcgg aggtga >gi568815585f:49915494_50120451|GENSCAN_predicted_peptide_7|162_aa MTLQGKSYFQLLPQETFFTDRKILPEPHIADQNEQASDVLEFLVPGVDTGLSVAMLGSQG LAEMPVHFLALVRPREQDGATAFEGVQGQLVKGSFIFQIIQERDRDGHLVQLINNLFTTT WLKLLELMNYSLAHGLLPPSESAMAFSHLIIDTDSSASPLHT >gi568815585f:49915494_50120451|GENSCAN_predicted_CDS_7|489_bp atgaccctacaaggtaaatcttatttccagttacttccacaagaaaccttttttactgat cgtaaaatcttacctgaacctcacatagctgatcaaaatgaacaagcctctgatgtgctt gaatttctggtccctggagtggacacagggcttagtgtggctatgctgggttcccagggc cttgctgaaatgccagtacacttcttggcccttgtgaggcccagagagcaggatggtgcc acagcctttgagggtgtccagggtcagctggtcaaaggaagcttcatcttccagatcatc caggaaagagacagagatggccatttggtgcaactcataaataacctcttcaccacaact tggttgaagcttctagagctcatgaactattctttggctcatggcctccttccaccttca gagtcagcaatggctttctcacatctcatcattgacacggattcttctgcttccccttta cacacttaa