GENSCAN 1.0 Date run: 2-Nov-116 Time: 19:00:27 Sequence gi568815597f:19782514_20007743 : 225230 bp : 44.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 2990 2893 98 1 2 51 97 85 0.388 5.35 1.05 Intr - 4604 4513 92 2 2 111 106 7 0.455 3.59 1.04 Intr - 9197 9018 180 2 0 46 66 98 0.105 3.46 1.03 Intr - 13921 13805 117 1 0 67 92 23 0.616 1.26 1.02 Intr - 14867 14790 78 2 0 122 84 10 0.702 3.75 1.01 Init - 17664 17599 66 0 0 36 59 124 0.748 3.38 1.00 Prom - 22062 22023 40 -2.36 2.03 PlyA - 23487 23482 6 1.05 2.02 Term - 32708 31905 804 2 0 -5 48 888 0.260 68.73 2.01 Init - 43498 43460 39 1 0 64 92 59 0.446 4.09 2.00 Prom - 44921 44882 40 -3.46 3.00 Prom + 58020 58059 40 -2.66 3.01 Init + 62645 62705 61 1 1 93 76 11 0.415 1.81 3.02 Intr + 77979 78053 75 1 0 83 89 44 0.204 3.49 3.03 Term + 89892 90028 137 1 2 77 42 153 0.445 7.78 3.04 PlyA + 92336 92341 6 1.05 4.00 Prom + 94069 94108 40 -5.86 4.01 Init + 100001 100221 221 1 2 65 76 454 0.754 40.00 4.02 Intr + 107872 108020 149 1 2 68 78 96 0.691 6.48 4.03 Intr + 111855 111967 113 1 2 71 96 66 0.993 5.80 4.04 Intr + 115027 115149 123 0 0 108 76 185 0.983 20.08 4.05 Intr + 121760 121885 126 1 0 73 13 122 0.596 4.08 4.06 Intr + 122378 122474 97 1 1 70 92 81 0.987 6.28 4.07 Intr + 123919 124103 185 2 2 36 95 83 0.904 3.31 4.08 Term + 125057 125233 177 1 0 108 41 230 0.950 17.99 4.09 PlyA + 127899 127904 6 1.05 5.05 PlyA - 128139 128134 6 1.05 5.04 Term - 137936 137794 143 0 2 72 54 240 0.986 17.09 5.03 Intr - 139891 139785 107 0 2 68 78 136 0.850 10.46 5.02 Intr - 140242 140104 139 2 1 142 81 288 0.999 33.02 5.01 Init - 141046 141007 40 1 1 41 106 50 0.886 0.67 5.00 Prom - 141966 141927 40 -6.66 6.00 Prom + 142063 142102 40 -5.46 6.01 Init + 142839 142897 59 2 2 39 105 2 0.772 -2.22 6.02 Term + 143889 144090 202 0 1 59 50 383 0.831 28.46 6.03 PlyA + 145561 145566 6 1.05 7.13 PlyA - 146469 146464 6 1.05 7.12 Term - 148380 148271 110 1 2 31 47 98 0.646 -1.33 7.11 Intr - 150591 150501 91 0 1 48 101 171 0.716 13.97 7.10 Intr - 152902 152834 69 1 0 63 80 52 0.321 1.28 7.09 Intr - 169956 169866 91 2 1 74 89 53 0.293 4.00 7.08 Intr - 171765 171593 173 0 2 50 21 111 0.062 -0.66 7.07 Intr - 191913 191795 119 1 2 87 75 13 0.249 0.08 7.06 Intr - 192468 192427 42 1 0 92 81 23 0.334 0.21 7.05 Intr - 195608 195502 107 1 2 39 83 87 0.739 3.16 7.04 Intr - 196011 195867 145 1 1 87 85 151 0.982 14.04 7.03 Intr - 199400 199369 32 1 2 118 41 12 0.091 -2.73 7.02 Intr - 201376 201183 194 1 2 -12 81 150 0.219 2.59 7.01 Init - 202097 201936 162 2 0 88 80 149 0.628 11.88 7.00 Prom - 202615 202576 40 -3.06 8.02 PlyA - 202704 202699 6 1.05 8.01 Sngl - 210580 210065 516 1 0 21 43 209 0.496 5.74 8.00 Prom - 210715 210676 40 -4.96 9.00 Prom + 216088 216127 40 -2.26 9.01 Sngl + 223119 223574 456 2 0 61 41 162 0.628 4.99 9.02 PlyA + 223685 223690 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 215561 215818 258 1 0 72 53 165 0.886 6.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:19782514_20007743|GENSCAN_predicted_peptide_1|211_aa MWRRGLAGHLQRPAPLSRWVVEVTSSHSSHFSPSQTCTPNAQATRLLLGDRHISNRPLCP RRGGGAQTKSLELQSWQAPHSLDKSREGTEVLLVLTEGAFSRRRNPALSWNQCNFSDTLE IQFYFGLDPDPKSDQGVEAPKNQYTQQGNLHISPKNHPEPEQPVFRQREAHGCFLKSGKY FLNAYSDPGPVQKARGVAVKKQPGNCLVHTX >gi568815597f:19782514_20007743|GENSCAN_predicted_CDS_1|633_bp atgtggcgccgcgggcttgccggtcacctgcagaggccggcacccctgtcgcgatgggtg gtggaggtaacatcttctcattcttcacattttagccctagccagacttgtactccaaat gcccaggccacaagacttctactgggagacagacacattagcaaccggccattatgtcca aggagaggtggaggggcccagacgaagtctttggagcttcagagttggcaggccccacat tcgctggacaaatccagggagggcacagaggtactgctggtcctgacagaaggggctttt tctcgtcgcaggaatccagcactctcttggaatcagtgtaacttctcagacacacttgag atccagttttattttggactggatcctgatcccaaatcagatcaaggggtggaggccccc aagaaccagtatacacagcagggcaatttgcatatttctccaaagaaccatccagaacct gagcagcctgtcttcagacagagagaggcccacggctgtttcttgaaatctggcaagtat ttcctgaatgcctacagtgatccaggtcctgtccagaaagctcggggtgtggcagtgaaa aaacaaccagggaactgcctcgtacacacagnn >gi568815597f:19782514_20007743|GENSCAN_predicted_peptide_2|280_aa MSEKSAKDQDKVQAPGQSEKEGQIPSKSQRRQLTLSANAQVLSDSVPPVPVPRMACTKTL QQSQPISAGATTTTTAVAPAGGHSGSTECDLECLVCREPYSCPRLPKLLACQHAFCAICL KLLLCVQDNTWSITCPLCRKVTAVPGGLICSLRDHEAVVGQLAQPCTEVSLCPQGLVDPA DLAAGHPSLVGEDGQDEVSANHVAARRLAAHLLLLALLIILIGPFIYPGVLRWVLTFIIA LALLMSTLFCCLPSTRGSCWPSSRTLFCREQKHSHISSIA >gi568815597f:19782514_20007743|GENSCAN_predicted_CDS_2|843_bp atgagcgagaagtcagcaaaagaccaggacaaagtccaggcacctggccagagtgaaaag gaggggcagattccatccaagtcccagcggcgccagctgacactctctgccaatgcccag gtgctgagcgacagtgtcccaccggtccctgtgcccagaatggcctgcaccaagaccctg caacagtcccagcccatctccgcaggagccaccacaaccaccaccgctgtggcccctgct gggggtcattctggctccacagaatgtgacctggagtgtctggtgtgccgggagccctac agctgtccccggttgcccaagctgctggcctgccagcatgccttctgcgccatctgcctg aagctcctgctgtgcgtgcaggacaacacctggtccatcacctgcccgctgtgccgcaag gtcaccgccgtccccgggggcctcatctgcagcctgcgcgaccatgaggcggtggtgggg cagctggcccagccatgcacagaggtatcgctctgtcctcaggggctggtggatcctgct gacttggcagcaggacaccccagcttggtgggagaggatggacaggatgaagtaagtgca aaccacgtggcagcccggcgcctggccgcgcacctactcctgctggccttgctcattatc ctcatcgggcccttcatctacccgggtgtcttacgatgggtgctcaccttcatcatcgcc ctggccctgctgatgtccaccctcttctgctgtctccccagcacccggggcagctgctgg ccctcctccaggactctcttctgcagagagcagaaacacagccacatctcttccattgcc tga >gi568815597f:19782514_20007743|GENSCAN_predicted_peptide_3|90_aa MTERGGKKDTMFSKCDTEFKEKRREQQWEELAPFMAIWVVEQVSPGWGPGTGADDAVVNN EFTQDDDDCKYCGPLGSLAQTPNKDIAMEL >gi568815597f:19782514_20007743|GENSCAN_predicted_CDS_3|273_bp atgactgaaaggggaggaaagaaagacacaatgttttccaaatgtgatacagaattcaag gaaaagagaagagagcaacagtgggaagaacttgctccattcatggccatctgggtggtg gagcaggtgtctccagggtggggtcctggcacaggtgctgatgatgcagttgtaaataat gagtttacacaggatgacgatgactgcaaatactgtgggccattaggatctctggcacag acaccaaacaaagacattgctatggagctgtaa >gi568815597f:19782514_20007743|GENSCAN_predicted_peptide_4|396_aa MSRKQAAKSRPGSGSRKAEAERKRDERAARRALAKERRNRPESGGGGGCEEEFVSFANQL QALGLKLREVPGDGNCLFRALGDQLEGHSRNHLKHRQETVDYMIKQREDFEPFVEDDIPF EKHVASLAKPGTFAGNDAIVAFARNHQLNVVIHQLNAPLWQIRGTEKSSVRELHIAYRYG EHYDSVRRINDNSEAPAHLQTDMLHQDESNKREKIKTKGMDSEDDLRDEVEDAVQKVCNA TGCSDFNLIVQNLEAENYNIESAIIAVLRMNQGKRNNAEENLEPSGRVLKQCGPLWEEGG SGARIFGNQGLNEGRTENNKAQASPSEENKANKNQLAKVTNKQRREQQWMEKKKRQEERH RHKALESRGSHRDNNRSEAEANTQVTLVKTFAALNI >gi568815597f:19782514_20007743|GENSCAN_predicted_CDS_4|1191_bp atgtcccgaaagcaggcggcgaagagccggccgggcagcggcagccggaaagccgaggcc gagcgcaagcgggacgagcgggcggcgcgccgggccctggccaaggagcggcggaatcgg ccggagtctggcggcggcggcggctgcgaggaggagttcgtcagcttcgccaaccagctg caggccctggggctgaagctgcgggaggtgccgggggacggcaattgcttgttcagagct cttggtgatcaattggagggacactcacgaaatcatctcaagcacagacaggagacagtg gactacatgataaagcagcgggaagattttgaaccctttgtagaagatgacattcctttt gagaagcatgtggccagtttggcaaagcctggtacttttgctggcaatgatgcaattgta gcctttgcaagaaatcatcagttgaatgtagtgattcatcaacttaatgcccctttgtgg cagattcgtggtacagagaaaagcagcgtgagggagttacacatcgcatatcggtatgga gagcactacgacagtgttcggaggatcaatgacaactcagaggcacctgcacatctccag acggatatgcttcatcaagatgaatcaaataaaagagaaaagatcaagacaaagggaatg gactctgaagacgacctgagagatgaagtagaggatgctgtccagaaagtttgtaatgca actggatgttcagattttaatttaatagtccagaacctggaagctgaaaattataatatt gaatctgcaataattgccgtgcttcggatgaaccaagggaagagaaataatgcagaagag aatcttgagcccagtggtcgagtgctgaagcagtgtggccctttgtgggaggagggtggc agtggtgccagaatctttggaaatcagggcttaaatgaaggcaggaccgaaaacaataag gcacaggccagccctagtgaagaaaacaaagcaaataaaaaccagctcgcaaaggtcaca aacaaacagaggcgagaacagcagtggatggagaagaagaagcggcaggaggagaggcac cgccacaaagccctggagagcagaggtagccacagggacaataacagaagcgaagcagag gcgaacacgcaggtcaccttggtgaagaccttcgccgctctcaacatctga >gi568815597f:19782514_20007743|GENSCAN_predicted_peptide_5|142_aa MKSPHVLVFLCLLVALVTGNLVQFGVMIEKMTGKSALQYNDYGCYCGIGGSHWPVDQTDW CCHAHDCCYGRLEKLGCEPKLEKYLFSVSERGIFCAGRTTCQRLTCECDKRAALCFRRNL GTYNRKYAHYPNKLCTGPTPPC >gi568815597f:19782514_20007743|GENSCAN_predicted_CDS_5|429_bp atgaaatctccccacgtgctggtgttcctttgcctcctggtggctctggtcaccgggaac ctggttcagtttggggtgatgatcgagaagatgacaggcaagtccgccctgcagtacaac gactatggctgttactgcggcatcggtggctcccactggccggtggaccagactgactgg tgctgccacgcccacgactgctgctacgggcgtctggagaagctgggctgtgagcccaaa ctggaaaagtatcttttctctgtcagcgaacgtggcattttctgcgccggcaggaccacc tgccagcggctgacctgcgagtgtgacaagagggctgccctctgctttcgccgcaacctg ggcacctacaaccgcaaatatgcccattatcccaacaagctgtgcaccgggcccaccccg ccctgctga >gi568815597f:19782514_20007743|GENSCAN_predicted_peptide_6|86_aa MLGSSHPVSEKTCGEKGSESLEATTIIIIIITISIIIVNITIITITIIVITINITIIITI AITITIFITINIIYMVFICVPTQISH >gi568815597f:19782514_20007743|GENSCAN_predicted_CDS_6|261_bp atgctgggctcttctcaccctgtttcagagaagacttgcggggagaaggggtcagagagc ttagaagccaccaccatcatcatcatcataatcactatcagtatcatcatcgtcaatatt accatcatcaccatcaccatcattgtcatcaccatcaatattaccatcatcatcaccatt gccatcaccatcaccatcttcatcaccatcaatatcatttatatggtttttatctgtgtc cctacccaaatctcacattga >gi568815597f:19782514_20007743|GENSCAN_predicted_peptide_7|444_aa MPEPPLSAAVGSCAARASPTSAIPCSMAPSPIDHPRAEECGTQRSTCRQLHLRQVGSFTP EPVRPRTHQKEETPNTSEHQKEQTPDTPPLRTVTLTVRDRGFILEVSETKNPPIPVTVCV TTYHMMSFTSLLQAHGNLVNFHRMIKLTTGKEAALSYGFYGCHCGVGGRGSPKDATDRCC VTHDCCYKRLEKRGCGTKFLSYKFSNSGSRITCGNSSSYWPAGRKQEGFRWEIPTWTSYS PTCFSNNTHLGCWVWIPSQRHPAQGTQMVKEVDLRLSFHLLGCSTRLNPSSLAILVISVI GFLFGEQQDLHQTPGVSVTEFLTVWQWFSNLSKHRNYTEPCKKYFQGPIPGVSNSAPCIW TGIVTCFDQMDVAKVLLQDNDDDGGGDDDDDGGGVSGNDCDGDDSDSNVRQGPNEPYPDF IACLKDAAQKAILDAHVRETIVQL >gi568815597f:19782514_20007743|GENSCAN_predicted_CDS_7|1335_bp atgcctgagcctcccctctccgccgccgtgggctcctgcgcggcccgagcctcccccacg agcgccatcccctgctccatggcgcccagtcccattgaccacccaagggctgaggagtgt ggcacacagcgcagcacttgcaggcagctccacctgcggcaggtcggcagcttcactcct gagccagtgagaccacgaacccaccagaaggaagaaactccaaacacatctgaacatcag aaggaacaaactccggacacgccgcctttaagaactgtaacactcactgtgagggaccgc ggcttcattcttgaagtcagtgagaccaagaacccaccaattccggtcacagtatgtgtc accacctatcacatgatgtcatttactagcctactgcaggcccatgggaatttggtgaat ttccacagaatgatcaagttgacgacaggaaaggaagccgcactcagttatggcttctac ggctgccactgtggcgtgggtggcagaggatcccccaaggatgcaacggatcgctgctgt gtcactcatgactgttgctacaaacgtctggagaaacgtggatgtggcaccaaatttctg agctacaagtttagcaactcggggagcagaatcacctgtggcaatagcagctcctactgg cctgctgggagaaagcaggaagggttccgctgggaaattcccacctggaccagctacagc cccacctgtttctccaataacacgcatctgggctgctgggtgtggatcccttcacaaagg caccctgcacaggggacacagatggtcaaggaggtggatttgagactgagttttcatctc cttggctgcagcacccgattaaatccttcttccttggcaatacttgtcatctcagtgatt ggctttctgtttggcgagcagcaggacctacatcaaacccctggtgtttcagtaacagaa tttctaacagtgtggcagtggttctccaacttgagcaagcatcggaattacacggagcct tgtaaaaaatatttccagggcccaatccctggagtttctaattcagctccttgcatctgg actggcattgtaacttgctttgatcaaatggatgtggcaaaagtgctgctgcaggataat gatgatgatggtggtggtgatgatgatgatgatggtggtggtgttagtggtaatgattgt gatggtgatgacagtgatagtaatgtcagacagggccctaatgaaccttacccagacttc attgcctgcctaaaagatgcagctcaaaaggctatcttggacgcacatgtccgagagaca attgtccaactatag >gi568815597f:19782514_20007743|GENSCAN_predicted_peptide_8|171_aa MRQKIKKDIQDLNAALDQEDLIDIYRTLHHKSRDYTFFSVPHSTYSKIDHIIGNKTLLSR CKRIEIITVSLSDHSAIKLELRIKKLSQNCTTTWKLNNLLLNDYWVTNKIKAEITKFFET SEKKETTYQNLWDTFKAVCKGKFVTLNAHIRKQETSKIGTLILQLKELEKQ >gi568815597f:19782514_20007743|GENSCAN_predicted_CDS_8|516_bp atgagacagaaaattaaaaaggacattcaggacttgaacgcagctctggaccaagaggac ctaatagatatctacagaaccctccaccacaaatcaagagactatacgttcttctcagta ccacatagcacttattctaaaatcgaccacataattggaaataaaacactcctcagcaga tgcaaaagaatagaaatcataacagtcagcctctcagaccatagtgcaataaaattagaa ctaaggattaagaaactcagtcaaaactgcacaactacatggaaactgaacaaccttctc ctgaatgactactgggtaaccaacaaaattaaagcagaaataacgaagttctttgaaacc agtgagaaaaaagagacaacataccagaatctctgggacacatttaaagcagtgtgtaaa ggcaaatttgtaacactaaatgcccacatcagaaagcaggaaacatctaaaattggcacc cttatattacaattaaaagaactagagaagcaatag >gi568815597f:19782514_20007743|GENSCAN_predicted_peptide_9|151_aa MNKFLDTYTLRRLIQEEVESLNRPITGSEIEAIMNSLPTKKSPGPDGFTAKFYQRYEEEL VSFLLKQFQSIEKEAILPNAFYEASVILIPKPGRDTTKKENFRPMSLMNIDAKILNKILA NKSSSTSKSLSTMIKLASCLGYKAGSTYTNQ >gi568815597f:19782514_20007743|GENSCAN_predicted_CDS_9|456_bp atgaataaattcctggacacatacaccctccgaagactaatccaggaggaagttgaatct ctgaatagaccaataacaggctctgaaattgaggcaataatgaatagcctaccaaccaaa aaaagtccaggaccagacggattcacagccaaattctaccagaggtatgaagaggagctg gtatcattccttctgaagcaattccagtcaatagaaaaagaggcaatcctccctaatgca ttttatgaggccagcgtcatcctgataccaaagcctggcagagacacaacaaaaaaagag aattttagaccaatgtccctgatgaacattgatgcaaaaatccttaataaaatactggca aacaaatccagcagcacatctaaaagcttatccaccatgatcaagttggcttcatgcctg ggatacaaggctggttcaacatacacaaatcaataa