GENSCAN 1.0 Date run: 16-Aug-121 Time: 14:36:39 Sequence gi568815590r:79938119_80164595 : 226477 bp : 39.24% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 1031 1174 144 1 0 116 38 84 0.842 3.13 1.02 PlyA + 1857 1862 6 1.05 2.00 Prom + 2909 2948 40 -3.25 2.01 Sngl + 45199 45528 330 0 0 88 44 306 0.992 21.97 2.02 PlyA + 45612 45617 6 1.05 3.00 Prom + 46612 46651 40 -10.15 3.01 Init + 46705 47726 1022 0 2 49 72 470 0.060 34.70 3.02 Intr + 48307 49703 1397 1 2 2 24 364 0.001 9.26 3.03 Intr + 56946 57071 126 1 0 72 69 73 0.017 3.53 3.04 Intr + 66147 66304 158 1 2 0 -4 144 0.027 -4.79 3.05 Term + 66477 66740 264 2 0 67 37 295 0.406 16.82 3.06 PlyA + 67289 67294 6 1.05 4.00 Prom + 67822 67861 40 -6.15 4.01 Sngl + 68279 69691 1413 1 0 65 42 548 0.989 43.59 4.02 PlyA + 69991 69996 6 1.05 5.00 Prom + 70187 70226 40 -6.35 5.01 Sngl + 70241 70609 369 1 0 67 41 200 0.866 9.06 5.02 PlyA + 71305 71310 6 1.05 6.03 PlyA - 71709 71704 6 1.05 6.02 Term - 91197 91150 48 0 0 121 46 9 0.589 -3.67 6.01 Init - 92130 91903 228 0 0 92 80 421 0.814 40.12 6.00 Prom - 92312 92273 40 -13.01 7.04 PlyA - 92982 92977 6 1.05 7.03 Term - 94832 94705 128 0 2 94 48 71 0.154 1.16 7.02 Intr - 95017 94920 98 0 2 58 58 113 0.154 4.03 7.01 Init - 96235 96051 185 1 2 53 89 181 0.165 11.37 7.00 Prom - 99775 99736 40 -4.75 8.14 PlyA - 100793 100788 6 1.05 8.13 Term - 105476 105377 100 1 1 58 54 109 0.214 1.12 8.12 Intr - 115312 115164 149 1 2 81 32 195 0.937 11.31 8.11 Intr - 126475 126360 116 2 2 41 98 204 0.986 15.95 8.10 Intr - 135508 135378 131 0 2 81 53 29 0.085 -1.88 8.09 Intr - 136726 136472 255 0 0 89 105 84 0.105 5.84 8.08 Intr - 146390 146321 70 0 1 104 31 44 0.000 -2.38 8.07 Intr - 149596 149525 72 2 0 31 96 81 0.003 1.66 8.06 Intr - 150201 149944 258 1 0 69 13 136 0.000 0.71 8.05 Intr - 164238 164154 85 0 1 30 87 87 0.001 1.27 8.04 Intr - 171817 171710 108 1 0 76 76 79 0.022 5.06 8.03 Intr - 195054 194988 67 2 1 57 115 60 0.387 3.59 8.02 Intr - 195647 195560 88 0 1 98 64 71 0.208 3.81 8.01 Init - 212654 212489 166 2 1 63 90 84 0.038 5.94 8.00 Prom - 213668 213629 40 -2.55 9.02 PlyA - 214988 214983 6 1.05 9.01 Term - 217410 217299 112 2 1 97 36 96 0.257 2.35 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 176537 176743 207 1 0 78 48 150 0.805 6.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:79938119_80164595|GENSCAN_predicted_peptide_1|47_aa NDFAHIEKTSLVVTSHSYSFESRTNTPLTKSLLAKEHTDSLSLIMMQ >gi568815590r:79938119_80164595|GENSCAN_predicted_CDS_1|144_bp aatgattttgctcacattgaaaaaacatctctagtggtcacatcacattcttactcattt gaaagcaggaccaacactcctttgacaaagtcattgcttgccaaggagcacacggattcc ttaagtctcatcatgatgcagtaa >gi568815590r:79938119_80164595|GENSCAN_predicted_peptide_2|109_aa MGKKQSRKTGNSKKQSASPPPKERSSSPATEQSWTENDFDELREEGFRRSNYSELQEETQ TKGKEVENFEKNLDECITRITNTEKCLKELTELKAKARELREECRSLRS >gi568815590r:79938119_80164595|GENSCAN_predicted_CDS_2|330_bp atggggaaaaaacagagcagaaaaactggaaactctaaaaagcagagcgcctctcctcct ccaaaggaacgcagctcctcaccagcaacggaacaaagctggacggagaatgactttgac gagttgagagaagaaggcttcagacgatcaaactactccgagctacaggaggaaacccaa accaaaggcaaagaagttgaaaactttgaaaaaaatttagacgaatgtataactagaata accaatacagagaagtgcttaaaggagctgacagagctgaaagccaaggctcgagaacta cgtgaagaatgcagaagcctcaggagctga >gi568815590r:79938119_80164595|GENSCAN_predicted_peptide_3|988_aa MGDFNTPQSTLDRSMRQKVNKDTQELNSALHHADLIHIYRILHPKSTEYTFFSAPHHTYS KIDHIVGSKALLSKCKRSEIITNCLSDHSAIKLQLRIRKLTQNRSTTWKLNNLLLNDYWV HNEMKAEIKMFFETNKNKDTTYRNLWDTVKSVCRGKFIALNAHKRKQERSKMDTLTSQLE ELKKQEQTHSKASRRQEITKIRAELKEIETQKTLQKINESRSWFFERINKIDRPLARLIK KKREKNHIDAIKNDKGDITTDPTEIQTNIREYYKHLYANKLENLEEMDKFLDTYNLPRLN QEEVDSLNRPITGSEIVAIINSLPTKKSPGPDGFTAEFYQRAIRQEKEIKGIQLGKEEVN LSLFADDMIVYLENPVVSAQNLLKLIRNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSE LPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRINIMKMA ILPKVIYRFNAIPIKLPMTFFTELEKTTFKLIWNQKRAHIAKSILSQKNKAGGITLPDFK LYYKATVTKTAWYWYQNRDIDQWNRTEPSEITPHSYNYLIFDKPDKNKQWGKDSLFNKWC WENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMEKDFMS KTPKAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWEKIFTTYSSDKGLISRIYNE LKQIYKKKSNNPIKKWAKDMNRHFSKEDIYAAKKHMKKCSPSLAIREMLIKTTMRYHLTR VRMAIIKKSGNNRCWRGCGEIGTLLHEYNEQNLYDVIFIGLLLYPPDQINSLEAGIVISL YVTSLKYTAGYPSETKLPEEQSGSNICCSSVFAVPQPLLLIPRQTGSGVDLQQTPTDLQL RECSSLPAMEQSWTENDFDDLREEGFRRSNFSKLKKEVRTHRKEAKNLEKRLDKWLTRIT NAEKSLNDLMELKTTAQELRDECTSFSS >gi568815590r:79938119_80164595|GENSCAN_predicted_CDS_3|2967_bp atgggagactttaacaccccacagtcaacattagacagatcaatgagacagaaagttaat aaggatacccaggaattgaactcagctttgcaccacgcggacctaatacacatctacaga attctccaccccaaatcaacagaatatacatttttttcagcaccacaccacacctattcc aaaattgaccacatagttggaagtaaagctctcctcagcaaatgtaaaagatcagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactacaactcaggattaggaaactc actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgactactgggta cataacgaaatgaaggcagaaataaagatgttctttgaaaccaacaagaacaaagacaca acataccggaatctctgggacacagtcaaatcagtgtgtagagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatccaaaatggacaccctaacatcacaattagaa gaactaaaaaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataactaaa atcagagcagaactgaaggaaatagagacacaaaaaacccttcaaaaaattaatgaatcc aggagctggttttttgaaaggatcaacaaaattgatagaccgctagcaaggctaataaag aagaaaagagagaagaatcacatagacgcaataaaaaatgataaaggggatatcaccacc gatcccacagaaatacaaactaacatcagagaatactataaacacctctacgcaaataaa ctagaaaatctagaagaaatggataaattcctcgacacatacaacctcccaagactaaac caggaagaagttgactctctgaatagaccaataacaggctctgaaattgtggcaataatc aatagcttaccaaccaaaaagagtccaggaccagatggattcacagccgaattctaccag agggcaattaggcaggagaaggaaataaagggtattcaattaggaaaagaggaagtcaac ttgtccctgtttgcagatgacatgattgtatatctagaaaaccccgttgtctcagcccaa aatctccttaagctgataagaaacttcagcaaagtctcaggatacaaaatcaatgtgcaa aaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagtgaa ctcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagggat gtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaagaggataca aacaaatggaagaacattccatgctcatgggtaggaagaatcaatatcatgaaaatggcc atactgcccaaggtaatttatagattcaatgccatccccatcaagctaccgatgactttc ttcacagaattggaaaaaactactttcaagctcatatggaaccaaaaaagagcccacatc gccaagtcaatcctgagccaaaagaacaaagctggaggcatcacgctacctgacttcaaa ctatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacagagatata gatcaatggaacagaacagagccctcagaaataacgccgcatagctacaactatctgatc tttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaatggtgt tgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttacaccttat acaaaaatcaattcaagatggattaaagacttaaatgttagacctaaaaccataaaaacc ctagaagaaaacctaggcattaccattcaggacataggcatggaaaaggacttcatgtct aaaacaccaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaacta aagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctacagaatgg gagaaaattttcacaacctactcatctgacaaagggctaatatccagaatctacaatgaa ctcaaacaaatttacaagaaaaaatcaaacaaccccatcaaaaagtgggcaaaggacatg aacagacacttctcaaaggaagacatttatgcagccaaaaaacacatgaaaaaatgctca ccatcactggccatcagagaaatgctaatcaaaaccacaatgagataccatctcacacga gttagaatggcaatcattaaaaagtcaggaaacaacaggtgctggagaggatgtggagaa ataggaacacttttacacgaatacaatgaacagaacttatatgatgttatatttattggt ttacttctctatccccctgaccagatcaactccctggaagcagggattgtcattagtctg tatgtcactagccttaaatacacagccgggtacccctctgagacgaagcttccagaggaa caatcaggcagcaacatttgctgttcatcagtattcgctgttccgcagcctctgctgctg atacccaggcaaacagggtctggagtggatctccagcaaactccaacagacctgcagctg agggaatgcagctccttgccagcaatggaacaaagctggacagagaatgactttgatgac ttgagagaagaaggcttcagacgatcaaacttctctaagctaaagaaggaagtcagaacc catcgcaaagaagctaaaaaccttgaaaaaagattagacaaatggttaactagaataacc aatgcagagaagtccttaaatgacctgatggagctgaaaaccacggcacaagaactacgt gacgaatgcacaagcttcagtagctga >gi568815590r:79938119_80164595|GENSCAN_predicted_peptide_4|470_aa MNAEITKIRAELKETETQKNLQIINESRSWFFERINKIDRPLARLIKKKREKNHIDAIKN DKGDITTDPTEIQTNIREYYKHLYANKLENLEEMDKFLDTYTLPRLNQEEVESLNRPITG SEIEAIINSLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLFQSIEKEGTLPNSFYEASII LIPKPGRDTTKKENFRPISLMNIDAKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFN ICKSINVIQHINRTKDKNHLIISIDAEKAFDKIQQPFMLKTLKQLGIDGTYLKIIRAIYE KPTADIILNGQKLEAFPLKTGARKGCPLSPLLFNTVLEVLARKIRQEKEIKGIQLGKEEV KLSLFADDMIVYLENPIISAQNLLKLIRNFSKVSGYKINVQKSQAFLYTNNRQTESQIMS ELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCS >gi568815590r:79938119_80164595|GENSCAN_predicted_CDS_4|1413_bp atgaacgcagaaataactaagatcagagcagaactgaaggagacagagacacaaaaaaac cttcaaataatcaatgaatccaggagctggttttttgaaaggatcaacaaaattgataga ccgctagcaaggctaataaagaagaaaagagagaagaatcacatagacgcaataaaaaat gataaaggggatatcaccaccgatcccacagaaatacaaactaacatcagagaatactat aaacacctctacgcaaataaactagaaaatctagaagaaatggataaattcctcgacaca tacaccctcccaagactaaaccaggaagaagttgaatccctgaatagaccaataacaggc tctgaaattgaggcaataattaatagcctaccaaccaaaaaaagtccaggaccagacgga ttcacagctgaattctaccagaggtacaaggaggagctggtaccattccttctgaaacta tttcaatcaatagaaaaagagggaaccctccctaactcattttatgaggccagtatcatc ctgataccaaagcctggcagagacacaaccaaaaaagagaattttagaccaatatccttg atgaacatcgatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacatc aaaaagcttatccaccatgatcaagtaggcttcattcctgggatgcaaggctggttcaac atatgcaaatcaataaatgtaatccagcacataaacagaaccaaagacaaaaaccacttg attatctcaatagatgcagaaaaggcctttgacaaaattcaacaacccttcatgctaaaa actctcaagcaattaggtattgatggcacgtatctcaaaataataagagctatctatgaa aaacccacagccgatatcatactgaatgggcaaaaactggaagcattccctttaaaaact ggtgcaagaaagggatgccctctctcaccactcttattcaacacagtgttggaagttcta gccaggaaaatcaggcaggagaaagaaataaagggtattcaattaggaaaagaggaagtc aaactgtccctgtttgcagatgacatgattgtatatttagaaaaccccatcatctcagcc caaaatctccttaagctgataagaaacttcagcaaagtctcaggatacaaaatcaatgtg caaaaatcacaagcattcttatacaccaataacagacaaacagagagccaaatcatgagt gaactcccattcacaattgcttcaaagagaataaaatacctaggaatccaacttacaagg gacgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaagaggat acaaacaaatggaagaacattccatgctcatga >gi568815590r:79938119_80164595|GENSCAN_predicted_peptide_5|122_aa MGKDFMSKTPKAMATKAKIDKWDLIKLKSFCTAKETTIGVNRQPTEWENISAIYPSDKEL ISRIYKELKQIYKKKSNNPIKKWVKDMNRHFSKEDIYATKRHMKKCSPSLAIREMQIKTT MG >gi568815590r:79938119_80164595|GENSCAN_predicted_CDS_5|369_bp atgggcaaggacttcatgtctaaaacaccaaaagcaatggcaacaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagccaaagaaactaccatcggagtg aacaggcaacctacagaatgggagaacatttctgcaatctacccatctgacaaagagcta atatccagaatctacaaagaactcaaacaaatttacaagaaaaaatcaaacaaccccatc aaaaagtgggtgaaggatatgaacagacacttctcaaaagaagacatttatgcaaccaaa agacacatgaaaaaatgttcaccatcactggccatcagagaaatgcaaatcaaaaccaca atgggataa >gi568815590r:79938119_80164595|GENSCAN_predicted_peptide_6|91_aa MAALCRTRAVAAESHFLRVFLFFRPFRGVGTESGSESGSSNAKEPKTRAGGFASALERHS ELLQKVEPLQKVEPGRGVVPNSSQDWQCPGL >gi568815590r:79938119_80164595|GENSCAN_predicted_CDS_6|276_bp atggcggcgctgtgtcggacccgtgctgtggctgccgagagccattttctgcgagtgttt ctcttcttcaggccctttcggggtgtaggcactgagagtggatccgaaagtggtagttcc aatgccaaggagcctaagacgcgcgcaggcggtttcgcgagcgcgttggagcggcactcg gagcttctacagaaggtggagcccctacagaaggtggagcccgggcggggagtagtacca aacagttcacaagactggcagtgtcctgggttatag >gi568815590r:79938119_80164595|GENSCAN_predicted_peptide_7|136_aa MLAGQELVLLGAAAATQLQLQIQASVCSRGPGKSPAHAGLEVPAPVRSKVVAELGCCHNL ARHWCLQKLVVVHLVQLQPCTEPVPVAAHPAAAAGVGSGPAAQDEHSLLGRVGRMSPAGM SKTPAEALQAMEVSSW >gi568815590r:79938119_80164595|GENSCAN_predicted_CDS_7|411_bp atgttggcagggcaggagcttgtgctcctgggtgcagctgcagccacccagctgcagctc cagatccaggcatctgtgtgctctcgggggcctggaaagtcccctgcccatgcaggcctg gaagtgcctgctccagttcggagcaaagttgtggctgagcttgggtgctgtcacaacctg gccagacactggtgcctgcagaagctggttgtggtccatctggtccagctgcagccttgc acagagccggtgcctgtagctgctcaccctgctgcagcagctggtgtgggatctgggcca gcagcacaagatgagcacagcctactgggcagagtgggcagaatgagcccagcaggcatg agcaaaactccagcagaggcgctgcaggccatggaagtttccagctggtga >gi568815590r:79938119_80164595|GENSCAN_predicted_peptide_8|554_aa MKVQALGKYSHSKWEKLANMEGLQAPCKSEIQWGSQILKLQNDLHVSHPGHADARASAVP SSSWTLELDEIEIVSVGLHLELRSSNYMSYMCIPGFTNAGKKDRSDQGFRSRAVLVQSDS KSEKWEKIGWRNLEELRIQCKLQLTCSDQVHKEVNIERRKRLLSSYVNELETSIRVLPSP CYSKCDPMWFQGQQPPITRMQTLHHSKIPRAIRCEGLAYPYCPALGYSHRMERTGTWRHV PGFKLRLLYLLALCPKTEGKPTAASVEGGESPSSSQKGPGAVSALSLLPIGLVREGAASP TQQLRFYCSCHGHTWENSSLKNLKLSRPELLHPSLLKALSGRQHPLRCSLALSQWFSTTV LPHCGHLTMFVDTLLVVTTGCGCGAATDICVDTHLLCARHCASEILLRLKTQDWSMKAKY CPAAVCILYNRLDRLLRTDPVPEEGEDVAATISATETLSEEEQEELRRELAKVEEEIQTL SQVLAAKEKHLAEIKRKLGINSLQELKQNIAKGWQDVTATSAASDPVELGWDLKIGIAYD FLIAAAAAGLGPTL >gi568815590r:79938119_80164595|GENSCAN_predicted_CDS_8|1665_bp atgaaggtacaggcattgggtaaatacagccattccaaatgggagaaattggccaatatg gaggggctacaggctccatgcaagtccgaaatccagtggggcagtcagatcttaaagcta caaaatgatctccatgtctcacatccaggtcacgctgatgcaagagcatcagcggtacca tcaagttcctggactcttgaacttgatgaaatcgagatagtttctgtagggctccatttg gaattgaggagcagcaactatatgagttatatgtgcatacctggttttacaaatgcagga aaaaaggaccggagtgaccagggtttcaggagcagagcagttcttgttcagagtgattcc aagtcagaaaagtgggagaaaattggatggaggaacctagaagaactgaggattcagtgc aaactacagctgacatgtagtgatcaggtccataaagaggtcaacatcgagaggagaaaa aggctcctcagtagctacgtgaatgagcttgaaacctccattcgagttcttccatcacct tgttactccaagtgtgatcccatgtggttccagggccagcagcctcccatcaccaggatg cagactctgcatcacagcaagatccccagggccattagatgtgagggactggcttatcct tactgcccggctctaggctacagccacaggatggagaggacagggacttggaggcacgta ccaggcttcaaactgcggctcctttacttgctagctctatgcccaaagacagaaggtaaa ccaactgctgcttcggtggaaggtggagagagcccttccagctctcagaaggggcctgga gccgtctcagcactgagtctcttgcccattggcctggtgagggaaggagctgccagcccc acccaacagctcaggttttactgttcttgccatgggcacacatgggaaaattccagcctt aagaatctaaagttgtcacggcctgagctgctgcatccctctcttctgaaggcactcagt ggcagacagcatcctcttagatgctccctcgccttaagtcagtggttctcaaccacagtt ttgccccactgtgggcatttgaccatgtttgtagatactcttttggttgttacaactggg tgtgggtgtggggctgctactgacatctgtgttgacacgcatctgctgtgtgccagacac tgtgcatcagagatcctgctaaggctaaagacacaggattggagcatgaaggccaagtat tgcccagctgcagtctgtattctatacaatagactggaccgtctgctgagaacagaccca gtccctgaggaaggagaagatgttgctgccacgatcagtgccacagagaccctctcggaa gaggagcaggaagagctaagaagagaacttgcaaaggtagaagaagaaatccagactctg tctcaagtgttagcagcaaaagagaagcatctagcagagatcaagcggaaacttggaatc aattctctacaggaactaaaacagaacattgccaaagggtggcaagacgtgacagcaaca tctgcagcctctgatcctgtagagttggggtgggacctgaaaattggcattgcttatgac ttcctgatagctgcagcagcagctggtctggggcccacactttga >gi568815590r:79938119_80164595|GENSCAN_predicted_peptide_9|37_aa XCLESPSTAQRLTIPTLSLAEIQHSHPMTPPTWFEDS >gi568815590r:79938119_80164595|GENSCAN_predicted_CDS_9|114_bp nnatgcctcgagtccccctccactgctcaacgtctgactattccaaccctgagccttgca gaaatccagcactcacaccccatgacaccccctacctggtttgaggattcataa