GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:07:55 Sequence gi568815583r:41287447_41497059 : 209613 bp : 46.03% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4387 4559 173 0 2 62 63 133 0.821 7.12 1.02 Intr + 44710 45084 375 1 0 131 66 194 0.022 15.63 1.03 Intr + 54245 54296 52 2 1 41 74 26 0.023 -4.59 1.04 Intr + 54940 55008 69 0 0 65 109 53 0.745 4.48 1.05 Intr + 61652 61795 144 1 0 62 68 103 0.903 6.18 1.06 Intr + 63587 63683 97 1 1 19 97 77 0.320 1.28 1.07 Intr + 70703 70812 110 0 2 34 95 75 0.591 2.80 1.08 Intr + 77956 78143 188 0 2 60 103 21 0.557 -0.71 1.09 Intr + 84081 84238 158 0 2 86 58 106 0.909 7.05 1.10 Intr + 88266 88382 117 1 0 61 95 25 0.467 0.94 1.11 Term + 94699 94820 122 2 2 49 55 55 0.171 -3.06 1.12 PlyA + 96583 96588 6 1.05 2.06 PlyA - 97028 97023 6 1.05 2.05 Term - 100147 99998 150 1 0 45 43 93 0.601 -1.59 2.04 Intr - 101076 101002 75 0 0 72 87 23 0.386 0.31 2.03 Intr - 107598 107413 186 0 0 50 87 137 0.833 9.69 2.02 Intr - 109569 109041 529 2 1 47 73 236 0.686 10.94 2.01 Init - 116556 116483 74 0 2 88 116 45 0.917 7.84 2.00 Prom - 125330 125291 40 -3.46 3.00 Prom + 127985 128024 40 -6.76 3.01 Init + 129670 129867 198 0 0 54 94 392 0.996 33.30 3.02 Intr + 150875 150985 111 1 0 76 90 92 0.913 8.78 3.03 Intr + 165455 165602 148 1 1 101 105 100 0.836 12.81 3.04 Intr + 170226 170430 205 1 1 80 116 208 0.984 21.06 3.05 Intr + 177325 177439 115 1 1 40 91 116 0.914 7.55 3.06 Intr + 178695 178806 112 2 1 108 115 173 0.999 21.95 3.07 Intr + 182811 182946 136 1 1 98 57 261 0.981 23.63 3.08 Intr + 183726 183903 178 0 1 89 11 106 0.752 2.92 3.09 Intr + 187174 187256 83 0 2 94 70 30 0.567 0.24 3.10 Intr + 188079 188166 88 0 1 34 88 65 0.694 1.07 3.11 Intr + 188266 188373 108 0 0 75 94 58 0.915 5.58 3.12 Intr + 189000 189077 78 2 0 85 116 66 0.992 8.85 3.13 Intr + 189719 189840 122 1 2 25 89 188 0.995 11.89 3.14 Intr + 190012 190069 58 1 1 117 83 117 0.999 12.99 3.15 Intr + 191102 191179 78 1 0 76 82 58 0.930 3.75 3.16 Intr + 191657 191761 105 1 0 61 26 93 0.606 0.81 3.17 Intr + 192768 192879 112 2 1 76 91 113 0.942 10.35 3.18 Term + 193135 193241 107 2 2 121 53 70 0.921 5.37 3.19 PlyA + 194824 194829 6 1.05 4.00 Prom + 194865 194904 40 -8.76 4.01 Init + 194999 195006 8 1 2 85 89 4 0.759 0.41 4.02 Intr + 196218 196288 71 0 2 90 58 89 0.832 4.93 4.03 Term + 198732 198949 218 1 2 89 43 133 0.614 6.21 4.04 PlyA + 201707 201712 6 -0.45 5.00 Prom + 202074 202113 40 -7.56 5.01 Init + 206509 206970 462 0 0 92 96 766 0.954 71.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:41287447_41497059|GENSCAN_predicted_peptide_1|534_aa MVLNKTVLTSDASCTSEVPPATCIFDQVATDLESPMTLLEPAFDFVGFLIVFSSFKVRWF SATRWRKREHPLGLASAFPERFTALTGEDHGPEGPRQVPGEVHRVGEHCVTLCALEHSAP LRLQPRRQLRAGGWLLSPQPCGAERRGPLHHLRIPLHGGRKRSLVNRPLSATTKVPPGRR CTTPQHFWLIPIKCILHIVQATKLLKALKGYIKHEARKGNENQDESQTSASSCDETEIQI SNQEEAERQPLGHVTKTRRRCKTVRVDPDSQNHEKQESQDLRATAKVPSPPDEHQEAENA VSSDFKKLHEAHFKEMESIDQYIERKKKHFEEHNSMNELKQQPINKGGVRTPVPPRGRLS VASTPISQRRSQGRSCGPASQSTLGLKGSLKRSAISAAKTGVRFSAATKDNEHKRSLTKT PARKSAHVTVSGGTPKGEAVLGTHKLKTITGNSAAVITPFKLTTEATQTPVSNKKPVFDL KASLSRPLNYEPHKGSEERLFPAAITSRSGERLCPAAHRLRCGERLCPAAPSGM >gi568815583r:41287447_41497059|GENSCAN_predicted_CDS_1|1605_bp atggtcctcaacaagactgtcctcacttcagatgccagctgtacttcggaggtaccccca gccacctgcattttcgaccaagtggctacagatttggagagtcccatgaccctcttagaa ccagcttttgatttcgttggttttcttattgtttttagttcctttaaagttaggtggttc tccgccacccggtggagaaagcgggaacaccctctcgggctagcctctgcctttcccgaa cgcttcactgcactcactggagaagaccacggccccgagggaccgcgacaggtcccaggc gaggtgcaccgagtcggcgagcactgcgtgacactgtgcgcactggaacacagcgcacct ctcaggctgcagccaagacggcagctgcgggccggcggctggctcctcagcccccagccc tgcggggccgagcggcgaggaccccttcaccacctgcgtatcccactccatggaggtcgt aaaagaagcttggtcaatcgccctctcagtgccaccacaaaagtccccccggggcggcgt tgcacaacgccacagcatttctggctcatccccatcaaatgcatcttgcacattgtgcag gcaaccaagttgttaaaagccttgaaaggctacattaaacatgaggcaagaaaaggaaat gagaatcaggatgaaagtcaaacttctgcatcctcttgtgatgagactgagatacagatc agcaaccaggaagaagctgagagacagccacttggccatgtcaccaaaacaaggagaagg tgcaagactgtccgtgtggaccctgactcacagaatcatgaaaagcaggaaagccaggat ctcagagctactgcaaaagttccttctccaccagacgagcaccaagaagctgagaatgct gtttcctcagactttaagaagcttcatgaagctcattttaaggaaatggagtccattgat caatatattgagagaaaaaagaaacattttgaagaacacaattccatgaatgaactgaag cagcagcccatcaataagggaggggtcaggactccagtacctccaagaggaagactctct gtggcttctactcccatcagccaacgacgctcgcaaggccggtcttgtggccctgcaagt cagagtaccttgggtctgaaggggtcactcaagcgctctgctatctctgcagctaaaacg ggtgtcaggttttcagctgctactaaagataatgagcataagcgttcactgaccaagact ccagccagaaagtctgcacatgtgaccgtgtctgggggcaccccaaaaggcgaggctgtg cttgggacacacaaattaaagaccatcacggggaattctgctgctgttattaccccattc aagttgacaactgaggcaacgcagactccagtctccaataagaaaccagtgtttgatctt aaagcaagtttgtctcgtcccctcaactatgaaccacacaaaggaagtgaggagcgcctc ttcccagccgccatcacatctaggagtggggagcgtctctgcccggccgcccatcgtctg agatgtggggagcgcctctgccccgccgccccatctgggatgtga >gi568815583r:41287447_41497059|GENSCAN_predicted_peptide_2|337_aa MGHGETARIKGCGKRIEESVMGAERKFSKPTSALYPFLGIRFAEYSSSLQKPVASPGKAS SQRKTEGDLQGDHQKEVALDITSSEEKPDVSFDKAIRDEAIYHFRLLKDEIVDHWRGPEG HPLHEVLLEQAKVVWQFRGKEDLDKWTVTSDKTIGGRSEVFLKMGKNNQSALLYGTLSSE APQDGESTRSGYCAMISRIPRGAFERKMSYDWSQFNTLYLRVRGDGRPWMVNIKEDTDFF QRTNQMYSYFMFTRGGPYWQEVKIPFSKFFFSNRGRIRDVQHELPLDKISSIGFTLADKV DGPFFLEIDFIGVFTDPAHTEEFAYENSPELNPRLFK >gi568815583r:41287447_41497059|GENSCAN_predicted_CDS_2|1014_bp atgggccatggagagacagcccgcattaaaggttgtgggaagagaatagaagagtcagtg atgggggcagaaagaaaattctctaagccaacttctgccttgtatccatttttgggtatt cgctttgcagagtattccagtagtcttcagaaaccagtggcttctcctggcaaagcctcc tcacagaggaagactgaaggggatttgcaaggagatcaccagaaagaagttgctttggat ataacttcttctgaggagaagcctgatgttagtttcgataaagcaattagagatgaagca atataccattttaggcttttgaaggatgaaattgtggatcattggagaggaccggaaggc caccctctgcatgaggtcttgctggaacaagccaaggttgtctggcaattccgggggaaa gaagatttggataagtggacagtgacttctgataagacgattggaggcagaagtgaagtg tttttgaaaatgggcaagaataaccaaagtgcactgctatatggaactctgagctctgag gcgcctcaggacggggagtctacccgaagtgggtactgtgcaatgatatccaggattcca aggggtgcttttgagaggaagatgtcttacgattggtcccagttcaatactctgtatctc cgtgtacgtggggatggtcggccttggatggtgaatatcaaggaggacacagatttcttc cagaggacgaatcagatgtatagttacttcatgttcacccgcgggggaccctactggcag gaggtcaagattcctttttccaaatttttcttctctaatcgaggaagaatccgggatgtt cagcatgagcttccgcttgataagatctcttctataggattcaccttggctgataaagtg gatggtccattcttcctggagatagattttattggcgtgtttactgatccagctcataca gaagaatttgcctatgaaaattctccagagcttaacccaaggctttttaaataa >gi568815583r:41287447_41497059|GENSCAN_predicted_peptide_3|713_aa MRGRLCVGRAAAAAAAVAVPLAGGQEGSPGGGRRGSRGTTMVKKRKGRVVIDSDTEDSGS DENLDQELLSLAKRKRSDSEEKEPPVSQPAASSDSETSDSDDEWTFGSNKNKKKGKARKI EKKGTMKKQANKTASSGSSDKDSSAESSAPEEGEVSDSDSNSSSSSSDSDSSSEDEEFHD GYGEDLMGDEEDRARLEQMTEKEREQELFNRIEKREVLKRRFEIKKKLKTAKKKEKKEKK KKQEEEQEKKKLTQIQESQVTSHNKERRSKRDEKLDKKSQAMEELKAEREKRKNRTAELL AKKQPLKTSEVYSDDEEEEEDDKSSEKSDRSSRTSSSDEEEEKEEIPPKSQPVSLPEELN RVRLSRHKLERWCHMPFFAKTVTGCFVRIGIGNHNSKPVYRVAEITGVVETAKVYQLGGT RTNKGLQLRHGNDQRVFRLEFVSNQEFTESEFMKWKEAMFSAGMQLPTLDEINKKELSIK EALNYKFNDQDIEEIVKEKERFRKAPPNYAMKKTQLLKEKAMAEDLGDQDKAKQIQDQLN ELEERAEALDRQRTKNISAISYINQRNREWNIVESEKALVAESHNMKNQQMDPFTRRQCK PTIVSNSRDPAVQAAILAQLNAKYGSGVLPDAPKEMSKASVGQGKDKDLNSKSASDLSED LFKVHDFDVKIDLQVPSSESKALAITSKAPPAKDGAPRRSLNLEDYKKRRGLI >gi568815583r:41287447_41497059|GENSCAN_predicted_CDS_3|2142_bp atgcgcggtcgcctttgtgtgggtcgagcagcggcggcggcggcggcagtggcggtccca ctggcaggcgggcaagaggggagtccgggcggcggccggcgtgggagccgggggaccacc atggtaaagaagcggaaaggccgcgtcgtgatcgactcggacacagaggacagcggcagc gacgagaacctggatcaggagctcttgtccctggcaaagcgaaagcgcagtgactctgag gagaaggagccgcctgtgagtcagcctgcagcctcgtcagactcggagacgtctgacagt gacgatgagtggacatttgggagcaataaaaataagaagaaaggaaaagccagaaaaata gagaagaaaggaaccatgaagaaacaggccaacaaaactgcctcctcaggcagttcagac aaagacagttcagctgagagctcagcccctgaggaaggtgaagtgtcagactctgacagc aacagctcctcttccagttcagattcagactcttcctcagaagatgaagagttccatgat ggctatggagaagacctcatgggagatgaggaagacagggcccgtctggaacagatgaca gagaaagagagagagcaagaactgttcaatcgcatagagaagagggaggtgttgaaaaga agatttgaaatcaagaaaaaactaaaaacagccaaaaagaaagaaaagaaagaaaagaag aaaaagcaagaagaggagcaagaaaagaaaaaactgacacagattcaagaatctcaggta acatcccacaacaaggaacggcgttccaagcgggatgagaaactagacaagaaatctcaa gccatggaggagctaaaagcagagcgagaaaaacgaaagaacagaacagctgagctcctt gccaaaaaacagccattaaaaaccagtgaggtctactctgatgatgaagaggaggaagag gatgacaaatccagtgaaaagtcagaccgctcatcacgaacatcatcgtctgatgaagaa gaggagaaagaagagatccctcccaaatcccaaccagtttccttacctgaagaattgaat cgggttcgattatcacggcataagctagaacgctggtgtcacatgcccttctttgctaaa actgtcacaggatgttttgtgcggattggcatcggaaaccacaacagcaaaccagtttac cgggtcgctgagattacgggtgttgtggaaactgccaaagtttaccaactaggtggcacc agaacaaacaaagggctgcaactacggcatggcaatgaccaacgcgtgttccgtttagag tttgtctcaaaccaagaattcaccgaaagtgagtttatgaagtggaaagaagcgatgttc tctgctggcatgcagttgcccactctagatgaaatcaataaaaaggaattatctattaaa gaagctcttaattataaattcaatgatcaggacattgaagagattgtaaaagagaaagaa aggttcagaaaagctccacccaactacgctatgaagaagactcagctactgaaggaaaag gccatggctgaggacctgggggatcaggacaaggccaaacaaatccaagatcaactgaat gagctggaggaacgggcagaggccctggaccgccagcggaccaagaacatatccgctatc agttacatcaaccagcggaaccgggagtggaacattgtagagtctgagaaggcccttgtg gctgaaagtcacaacatgaaaaaccaacagatggatccctttactcggcggcagtgcaag cctaccatcgtttctaattccagagacccagctgttcaagctgccatcttggcccagctg aatgcaaaatacggttctggagtgttaccagatgctccaaaggaaatgagcaaggcaagt gtgggtcaaggcaaagataaagatttgaattctaagtcagccagtgacctctcagaagat ctgttcaaagtacacgattttgatgtgaagattgacttacaagttcccagctcagagtca aaggctttagccatcacctccaaggctccgccagccaaggatggggctccaaggagatct ctgaacttggaagactacaaaaaacgacgagggcttatttga >gi568815583r:41287447_41497059|GENSCAN_predicted_peptide_4|98_aa MDFPVPTKVKVAIVNSLAAWFPGSLSAMDSPASLSACDAAQPFTWQARKPQVDSISFAGR ALRRSPLGVSTTPRTGLGATLVRANGPRIPGPVRLLRR >gi568815583r:41287447_41497059|GENSCAN_predicted_CDS_4|297_bp atggacttccctgtgcccacaaaagtgaaagtggccatcgtcaactctttagcagcctgg ttccccggaagcctctctgccatggatagccctgcttcgctaagcgcgtgcgatgcagca cagcccttcacctggcaagcccggaagcctcaggttgactccatcagttttgccgggaga gcccttcggcgctccccgcttggtgtctccaccaccccccgcaccggcctgggcgccacc cttgtccgcgccaacggtccccgcatccctggccccgtgcgcctcctgcgccgttag >gi568815583r:41287447_41497059|GENSCAN_predicted_peptide_5|154_aa MARPGGARPCSPGLERAPRRSVGELRLLFEARCAAVAAAAAAGEPRARGAKRRGGQVPNG LPRAPPAPVIPQLTVTAEEPDVPPTSPGPPERERDCLPAAGSSHLQQPRRLSTSSVSSTG SSSLLEDSEDDLLSDSESRSRGNVQLEAGEDVGQ >gi568815583r:41287447_41497059|GENSCAN_predicted_CDS_5|462_bp atggcgcggccggggggcgcgaggccctgcagcccggggctggagcgggccccgcgccgg agtgtcggggagctgcgcctgctcttcgaggcgcgctgtgcggcggtcgctgcggccgcc gccgcgggggagccccgggcccgcggggccaagcggcgtgggggacaggtccccaacggg cttccgcgggctcccccggccccggtgatccctcagctgaccgtgacagccgaggagccc gacgtgcccccgaccagccctgggccgccggagcgggagagggactgcctcccggcagcg ggctcttcgcacctgcagcagccgcgccgcctttccacctcgtcggtctcctccactggc tcctcgtcgctgctcgaggactcggaggacgacctgctgagcgacagtgagagccggagc cgcggcaacgtgcagctggaagcgggcgaggacgtgggtcag