GENSCAN 1.0 Date run: 5-Nov-116 Time: 00:53:18 Sequence gi568815576f:24333405_24541486 : 208082 bp : 46.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1006 1169 164 1 2 66 86 77 0.786 4.92 1.02 Intr + 4982 5073 92 0 2 79 109 24 0.674 3.31 1.03 Intr + 17330 17425 96 1 0 60 110 25 0.323 2.11 1.04 Intr + 24575 24592 18 1 0 114 96 -2 0.245 0.01 1.05 Intr + 29120 29138 19 1 1 100 95 16 0.399 -0.32 1.06 Intr + 32170 32228 59 2 2 74 75 66 0.514 2.50 1.07 Intr + 35814 35916 103 2 1 70 89 34 0.890 1.45 1.08 Term + 38694 39025 332 1 2 29 43 202 0.920 4.62 1.09 PlyA + 39077 39082 6 1.05 2.00 Prom + 40896 40935 40 -2.46 2.01 Init + 63940 63991 52 0 1 83 71 29 0.177 2.04 2.02 Intr + 70871 70973 103 0 1 73 65 37 0.087 -0.87 2.03 Intr + 74648 74675 28 2 1 87 87 24 0.175 0.32 2.04 Intr + 78184 78300 117 0 0 104 102 73 0.997 10.96 2.05 Intr + 79244 79303 60 1 0 62 93 82 0.967 5.03 2.06 Term + 81130 81219 90 0 0 82 53 157 0.999 9.32 2.07 PlyA + 82146 82151 6 1.05 3.04 PlyA - 83208 83203 6 1.05 3.03 Term - 91415 90567 849 2 0 49 42 229 0.786 7.25 3.02 Intr - 93800 93469 332 0 2 21 72 240 0.533 11.05 3.01 Init - 95440 95272 169 1 1 31 121 95 0.062 6.73 3.00 Prom - 95818 95779 40 -10.74 4.00 Prom + 98324 98363 40 -10.74 4.01 Init + 100001 100332 332 1 2 92 105 807 0.827 79.58 4.02 Term + 107179 108085 907 1 1 136 45 1110 0.817 103.27 4.03 PlyA + 108912 108917 6 1.05 5.07 PlyA - 116061 116056 6 1.05 5.06 Term - 122978 122811 168 2 0 88 39 46 0.222 -2.42 5.05 Intr - 126817 126706 112 0 1 114 84 18 0.575 4.38 5.04 Intr - 127367 127335 33 1 0 126 66 16 0.357 0.54 5.03 Intr - 127649 127506 144 1 0 65 43 96 0.453 2.20 5.02 Intr - 138491 138364 128 2 2 121 44 59 0.173 4.28 5.01 Init - 145448 145314 135 2 0 81 36 112 0.834 5.44 5.00 Prom - 157220 157181 40 -3.86 6.00 Prom + 157725 157764 40 -8.56 6.01 Init + 159185 159232 48 1 0 85 65 18 0.571 0.15 6.02 Intr + 161447 161515 69 1 0 38 94 61 0.519 1.08 6.03 Intr + 161955 162103 149 2 2 25 62 245 0.795 14.63 6.04 Intr + 166703 166874 172 2 1 94 61 130 0.919 10.85 6.05 Intr + 168722 168809 88 1 1 79 64 74 0.597 3.64 6.06 Intr + 177345 177439 95 1 2 88 97 59 0.924 6.48 6.07 Intr + 179920 180081 162 0 0 67 83 120 0.956 9.57 6.08 Intr + 181797 181966 170 2 2 62 43 214 0.876 13.04 6.09 Intr + 186983 187064 82 2 1 87 103 127 0.966 13.74 6.10 Intr + 188582 188624 43 1 1 97 95 8 0.942 0.21 6.11 Intr + 190215 190369 155 1 2 61 100 209 0.957 19.19 6.12 Term + 192307 192390 84 0 0 95 42 173 0.999 11.05 6.13 PlyA + 193159 193164 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 97597 97482 116 1 2 84 53 97 0.860 5.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:24333405_24541486|GENSCAN_predicted_peptide_1|294_aa XQEEERGRVYNYMNAVERDLAALRQGMGLSRRSSTSSEPTPTVKTLIKSFDSASQVPNPA AAAIPRTPLSPSPMKTPPAAAVSPMQVCPSSVNSQPFDLLQSALSVVRASASLQQPLRDI GHVQTKVHMPAFPVLFSNGICDSHHPKPNKRRKERPSLSIGQRIWRIKEERLAEVVSEEN RRLSGSRPITGSEIEAIINSLPTKKSPRPDGFTAEFYQRYKEELVPFIPKLFQSIEKEGI LPNSFYEASIILIPKPGRDTTKKENFRSISLMNIDAKILNKILANEIQLLLDPP >gi568815576f:24333405_24541486|GENSCAN_predicted_CDS_1|885_bp nngcaagaggaggagcgaggccgggtatacaattacatgaatgccgttgagagagatttg gcagccttaaggcagggaatgggactgagtagaaggtcctcgacttcctcagagccaact cctacagtaaaaaccctcatcaagtcctttgacagtgcatctcaagtaccaaaccctgct gcagctgcaattcctcgaacgcccctgagcccaagtcctatgaaaacccctcctgcagca gctgtgtcccctatgcaggtgtgccctagttctgtcaatagtcagccttttgaccttctt cagtctgcgttaagtgtggtcagagccagtgcaagtctgcagcagcctctgagggacata ggccatgttcagactaaagtccacatgccagctttccctgtcctcttctccaacggcatc tgtgactcccaccacccgaagccgaataagagaagaaaggaaagaccctctctcagcatt ggccagagaatatggaggatcaaagaggaacgccttgctgaagtggtgtcagaagaaaac agaaggctatcaggaagtagaccaataacaggctctgaaattgaggcaataattaatagc ttaccaaccaaaaaaagtccaagaccagatggattcacggccgaattctaccagaggtac aaggaggagctggtaccattcattccgaaattattccaatcaatagaaaaagagggaata ctccctaactcgttttatgaggccagcatcatcctgataccaaagcctggcagagacaca acaaaaaaagagaattttagatcaatatccctgatgaacattgatgcaaaaatcctcaat aaaatactggcaaacgaaatccagctgctgctggatccaccatga >gi568815576f:24333405_24541486|GENSCAN_predicted_peptide_2|149_aa MVHFISRLLYSDGLDVPGDPIQDSDDAALSVSPHCSPDSRVSTTKHLWVSPRLLRVAVRV KNIDITNFSSSWNDGLAFCALLHTYLPAHIPYQELNSQDKRRNFMLAFQAAESVGIKSTL DINEMVRTERPDWQNVMLYVTAIYKYFET >gi568815576f:24333405_24541486|GENSCAN_predicted_CDS_2|450_bp atggtgcatttcatatccaggctgctgtactcagatggcctggatgtcccaggggatccc atacaggactccgatgatgccgcactttctgtctctccgcactgctctcctgactcacgt gtcagcaccaccaagcacctttgggtatctccaagactgctgcgggtggctgtgagagtg aaaaatattgacattacaaacttcagcagcagctggaatgatgggctggccttctgtgcc ctcctgcatacatatctccctgcccacattccatatcaagaactgaacagccaggataag agaaggaacttcatgctggctttccaggcagctgaaagtgtcggcatcaaatccacactg gacattaatgaaatggtacggactgaacgacccgactggcagaacgtgatgctgtatgtg acggcgatctacaagtactttgagacctga >gi568815576f:24333405_24541486|GENSCAN_predicted_peptide_3|449_aa MAGDLKGGKDPGHSWEAAAALQFQALQATPCLHNIGSAKRSRKSISKRKRYMKAKKGFAE ATEGPEASPPVTRADQPGASGEKSPAQMPEETPSGTIGLPLLGQLRQPLHQVTADARDQG CSGTSEEWAQKSSSSPCPVTPGTSFPCGVLAGGGERIPMATHRNIIWPLGRLAPNPSRAA TIPAIPTSAPGSQTDGRGQSEARTHPHPSPVVSWVPQTQFPPPLSTEDASARPSVPDSPF LGASRALGYLCPRLTFPGNRSPPGSSTWGISLAPGPGPPPLRKRLCAPGPSLLEELAKVH ASARGRSAAHGRTDGLTDGRSPPATCGSAPRAAPAAALAFPPRLPLRAFRCSLGQLGGSR TRRRNCAWDVTVPGPFTGSAGSRRGAGLGPRAGELVSGRRERNPEGSRTPASQSSRPRTP DGPTPPGHQPRTPGTPHPGPGTQDPARQT >gi568815576f:24333405_24541486|GENSCAN_predicted_CDS_3|1350_bp atggcaggggacctcaaaggaggaaaggacccaggacattcctgggaggcggctgctgcc ctgcagtttcaggcactgcaggccactccctgtctccacaacattggctcagctaagagg agtcgcaaatcaatttcaaagcgcaagaggtatatgaaggctaagaaagggtttgcagag gccactgagggaccagaggcgagtcctcctgtgacacgggcagaccagcccggggcctct ggggagaagtctcctgcccagatgccagaggagacaccaagtggcaccatcggcctccct ctgcttggccagctccggcagcctctccaccaggtgacggccgatgccagagaccagggc tgcagcggcacatccgaggagtgggcacagaagagcagcagcagcccctgcccagtgact cctggaacaagcttcccctgcggggtgttggcggggggtggtgagagaatccccatggca actcacagaaacatcatctggcccttgggccgccttgcgccaaatccctcgagagccgcc accattcccgccattcctacctccgctccgggcagccagacagacggccggggccagtcc gaggcgcgcacccacccccatccgtcccccgtcgtctcctgggtcccccaaacccagttc ccgccaccactgtccacggaagacgccagcgctcggcccagcgttcctgattcccccttt ctcggcgcgtcccgggcgctcgggtatctctgcccgcgcctcaccttcccggggaaccgg tctccgcccggctcctccacctggggaatctccctcgccccggggcccggcccgccaccc ctcaggaagcgcctctgcgcccccggtccatccctgctggaggagctcgcgaaggttcat gcgagcgcgcggggccggtctgcggcgcatggacggacggacggactgacggacggacgc tccccgcccgctacctgcggctccgcaccccgcgccgcgccggccgccgcacttgccttc cctccccgcctacccctgcgcgccttccggtgcagtttgggacagctcggagggtcccgc acgcggcgccgcaactgcgcctgggacgtgactgtgcctgggcccttcacaggcagtgcc ggcagccgccgcggggcaggactcggaccccgcgccggggaactggtctcggggcggcgg gaaaggaaccctgaggggagcagaaccccagcctcccagtcctcccggccccgcaccccc gacggccccacgccgcccggccaccagccccggacccccggcaccccgcaccccggaccc ggcacccaggaccccgcacgccagacctag >gi568815576f:24333405_24541486|GENSCAN_predicted_peptide_4|412_aa MPIMGSSVYITVELAIAVLAILGNVLVCWAVWLNSNLQNVTNYFVVSLAAADIAVGVLAI PFAITISTGFCAACHGCLFIACFVLVLTQSSIFSLLAIAIDRYIAIRIPLRYNGLVTGTR AKGIIAICWVLSFAIGLTPMLGWNNCGQPKEGKNHSQGCGEGQVACLFEDVVPMNYMVYF NFFACVLVPLLLMLGVYLRIFLAARRQLKQMESQPLPGERARSTLQKEVHAAKSLAIIVG LFALCWLPLHIINCFTFFCPDCSHAPLWLMYLAIVLSHTNSVVNPFIYAYRIREFRQTFR KIIRSHVLRQQEPFKAAGTSARVLAAHGSDGEQVSLRLNGHPPGVWANGSAPHPERRPNG YALGLVSGGSAQESQGNTGLPDVELLSHELKGVCPEPPGLDDPLAQDGAGVS >gi568815576f:24333405_24541486|GENSCAN_predicted_CDS_4|1239_bp atgcccatcatgggctcctcggtgtacatcacggtggagctggccattgctgtgctggcc atcctgggcaatgtgctggtgtgctgggccgtgtggctcaacagcaacctgcagaacgtc accaactactttgtggtgtcactggcggcggccgacatcgcagtgggtgtgctcgccatc ccctttgccatcaccatcagcaccgggttctgcgctgcctgccacggctgcctcttcatt gcctgcttcgtcctggtcctcacgcagagctccatcttcagtctcctggccatcgccatt gaccgctacattgccatccgcatcccgctccggtacaatggcttggtgaccggcacgagg gctaagggcatcattgccatctgctgggtgctgtcgtttgccatcggcctgactcccatg ctaggttggaacaactgcggtcagccaaaggagggcaagaaccactcccagggctgcggg gagggccaagtggcctgtctctttgaggatgtggtccccatgaactacatggtgtacttc aacttctttgcctgtgtgctggtgcccctgctgctcatgctgggtgtctatttgcggatc ttcctggcggcgcgacgacagctgaagcagatggagagccagcctctgccgggggagcgg gcacggtccacactgcagaaggaggtccatgctgccaagtcactggccatcattgtgggg ctctttgccctctgctggctgcccctacacatcatcaactgcttcactttcttctgcccc gactgcagccacgcccctctctggctcatgtacctggccatcgtcctctcccacaccaat tcggttgtgaatcccttcatctacgcctaccgtatccgcgagttccgccagaccttccgc aagatcattcgcagccacgtcctgaggcagcaagaacctttcaaggcagctggcaccagt gcccgggtcttggcagctcatggcagtgacggagagcaggtcagcctccgtctcaacggc cacccgccaggagtgtgggccaacggcagtgctccccaccctgagcggaggcccaatggc tatgccctggggctggtgagtggagggagtgcccaagagtcccaggggaacacgggcctc ccagacgtggagctccttagccatgagctcaagggagtgtgcccagagccccctggccta gatgaccccctggcccaggatggagcaggagtgtcctga >gi568815576f:24333405_24541486|GENSCAN_predicted_peptide_5|239_aa MGIEEIRMWKDEKYKVPETEMFREVRGDAGISRPPVSVSSNINAAEIPWNETNGSPGFGR VSSEQMGEQQPERLLPLAENTILDLNVLCLSRDMLHTVLLLEYLTTELTLRNHHFNILKQ KLSSTQEYDQSLTCRASGPEVGGCPSRPYLLTSHYGRRGFSSFFFICPLYTYASLSVPPC SQYSFQRLQVFSVLVFSSYEPAPRKGPVYSIHTEMTDMTDMSFIKDKNPHLACAKEVKL >gi568815576f:24333405_24541486|GENSCAN_predicted_CDS_5|720_bp atgggaatagaagagatcaggatgtggaaagatgagaagtataaagtacctgagacagaa atgttcagagaagttagaggagatgcaggaatcagcaggcccccagtgtcagtgtccagc aatattaatgctgcagagattccttggaatgaaaccaatgggagtccagggtttgggaga gtctcatcagagcagatgggcgagcagcagccagaacgcttgttgcctttggctgagaac actatcctggatctgaatgtcttgtgtttatcccgagatatgcttcacactgtgctgctg ctggagtatctgacaacagagctcactctcagaaaccaccatttcaacattctgaaacag aaactctccagcacacaggagtacgatcagagtctgacatgcagggcatcaggacctgaa gtaggtggctgcccatccaggccttatctgttgacttcccactatggaagacgaggattt agctccttcttcttcatttgccccctctatacatatgcatccctctctgtgcccccttgc tcccagtatagttttcagaggctacaagtcttcagtgttctagtcttctcttcgtatgag cctgctcccaggaaaggaccagtatactccattcacactgagatgactgatatgactgat atgtcttttataaaagacaaaaacccacatttggcttgtgccaaagaagtaaagctctaa >gi568815576f:24333405_24541486|GENSCAN_predicted_peptide_6|438_aa MEELQMTLKNLANQWVPPGARAQDSLTTSPGSRPDTKGQGRQFVRGHKHWRTVAMAGAEW KSLEECLEKHLPLPDLQEVKRVLYGKELRKLDLPREAFEAASREDFELQGYAFEAAEEQL RRPRIVHVGLVQNRIPLPANAPVAEQVSALHRRIKAIVEVAAMCGVNIICFQEAWTMPFA FCTREKLPWTEFAESAEDGPTTRFCQKLAKNHDMVVVSPILERDSEHGDVLWNTAVVISN SGAVLGKTRKNHIPRVGDFNESTYYMEGNLGHPVFQTQFGRIAVNICYGRHHPLNWLMYS INGAEIIFNPSATIGALSESLWPIEARNAAIANHCFTCAINRVGTEHFPNEFTSGDGKKA HQDFGYFYGSSYVAAPDSSRTPGLSRSRDGLLVAKLDLNLCQQVNDVWNFKMTGRYEMYA RELAEAVKSNYSPTIVKE >gi568815576f:24333405_24541486|GENSCAN_predicted_CDS_6|1317_bp atggaggagttgcagatgactctcaaaaacttggcaaatcagtgggtgcctcccggagcc cgcgcccaggactccctcacgacctctccagggtcccggcccgacaccaaggggcagggc aggcagttcgtgcgcggacacaagcactggcggaccgtggccatggcgggcgctgagtgg aagtcgctggaggaatgcttggagaagcacctgccgctccccgacttgcaggaagtgaag cgcgttctctatggcaaggaactcaggaagcttgatctgcccagggaagctttcgaagct gcctccagagaagactttgaactgcagggatatgcctttgaagcagcggaggagcagctg agacgaccccgcattgtgcacgtggggctggttcagaacagaatccccctccccgcaaat gcccctgtggcagaacaggtctctgcccttcatagacgcataaaggctatcgtagaggtg gctgcaatgtgtggagtcaacatcatctgtttccaggaagcatggactatgccctttgcc ttctgtacgagagagaagcttccttggacagaatttgctgagtcagcagaggatgggccc accaccagattctgtcagaagctggcgaagaaccatgacatggtggtggtgtctcccatc ctggaacgagacagcgagcatggggatgttttgtggaatacagccgtggtgatctccaat tccggagcagtcctgggaaagaccaggaaaaaccacatccccagagtgggtgatttcaac gagtcaacttactacatggagggaaacctgggccaccccgtgttccagacgcagttcgga aggatcgcggtgaacatttgctacgggcggcaccaccccctcaactggcttatgtacagc atcaacggggctgagatcatcttcaacccctcggccacgataggagcactcagcgagtcc ctgtggcccatcgaggccagaaacgcagccattgccaatcactgcttcacctgcgccatc aatcgagtgggcaccgagcacttcccgaacgagtttacctcgggagatggaaagaaagct caccaggactttggctacttttatggctcgagctatgtggcagcccctgacagcagccgg actcctgggctgtcccgtagccgggatggactgctagttgctaagctcgacctaaacctc tgccagcaggtgaatgatgtctggaacttcaagatgacgggcaggtatgagatgtacgca cgggagctcgccgaagctgtcaagtccaactacagccccaccatcgtgaaagagtag