GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:29:53 Sequence gi568815592r:39806360_40027578 : 221219 bp : 45.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 984 979 6 1.05 1.03 Term - 3194 3181 14 0 2 116 54 4 0.333 -1.84 1.02 Intr - 6758 6687 72 0 0 114 47 74 0.871 5.28 1.01 Init - 8528 8459 70 2 1 59 69 105 0.898 6.91 1.00 Prom - 22478 22439 40 -1.36 2.00 Prom + 28344 28383 40 -2.46 2.01 Init + 46212 46369 158 2 2 75 57 156 0.518 8.66 2.02 Intr + 47287 47300 14 1 2 55 116 11 0.487 -4.98 2.03 Intr + 49888 50111 224 2 2 118 49 291 0.454 26.05 2.04 Intr + 54569 54658 90 1 0 107 44 113 0.932 8.99 2.05 Intr + 58074 58148 75 2 0 119 101 118 0.999 15.91 2.06 Intr + 58621 58715 95 0 2 88 76 145 0.988 12.06 2.07 Intr + 61151 61484 334 2 1 41 94 401 0.994 31.57 2.08 Intr + 62464 62574 111 0 0 101 59 248 0.999 23.78 2.09 Intr + 63981 64084 104 2 2 71 108 159 0.999 15.17 2.10 Intr + 65147 65213 67 2 1 101 77 105 0.999 9.61 2.11 Intr + 66879 66996 118 2 1 148 95 206 0.999 27.34 2.12 Intr + 68971 69109 139 2 1 58 58 268 0.999 20.32 2.13 Intr + 71844 71902 59 0 2 77 100 87 0.999 7.33 2.14 Intr + 72045 72205 161 1 2 96 28 203 0.511 14.81 2.15 Intr + 73023 73118 96 2 0 92 83 61 0.952 6.21 2.16 Intr + 77603 77710 108 1 0 40 110 156 0.995 13.48 2.17 Intr + 81127 81233 107 0 2 73 55 183 0.978 12.51 2.18 Intr + 82320 82404 85 0 1 20 94 152 0.978 8.82 2.19 Intr + 84982 85088 107 0 2 100 47 228 0.632 18.91 2.20 Intr + 85275 85363 89 0 2 82 97 163 0.999 16.21 2.21 Intr + 90453 90621 169 1 1 88 94 244 0.999 24.00 2.22 Intr + 90816 90923 108 0 0 96 61 107 0.995 8.20 2.23 Intr + 92518 92578 61 1 1 115 89 24 0.542 3.94 2.24 Intr + 93718 93849 132 0 0 84 116 213 0.947 24.54 2.25 Intr + 94943 95113 171 1 0 84 77 278 0.999 26.44 2.26 Term + 95454 95678 225 2 0 80 43 351 0.999 26.58 2.27 PlyA + 98221 98226 6 1.05 3.10 PlyA - 98985 98980 6 1.05 3.09 Term - 100488 99998 491 1 2 92 42 539 0.842 44.52 3.08 Intr - 102743 102696 48 0 0 112 72 73 0.989 6.75 3.07 Intr - 103596 103476 121 0 1 22 111 120 0.993 7.77 3.06 Intr - 106015 105905 111 1 0 101 105 125 0.999 16.08 3.05 Intr - 106645 106533 113 2 2 54 72 157 0.923 10.80 3.04 Intr - 107069 106958 112 2 1 81 61 130 0.984 9.55 3.03 Intr - 107476 107415 62 2 2 75 100 166 0.993 14.95 3.02 Intr - 109873 109709 165 2 0 109 105 226 0.994 26.33 3.01 Init - 110834 110825 10 2 1 78 74 16 0.619 -1.07 3.00 Prom - 118349 118310 40 -6.46 4.06 PlyA - 118404 118399 6 1.05 4.05 Term - 119486 119233 254 0 2 121 44 452 0.997 39.90 4.04 Intr - 121096 120970 127 1 1 109 76 150 0.992 16.15 4.03 Intr - 128261 127936 326 0 2 78 83 199 0.783 13.89 4.02 Intr - 138662 138597 66 0 0 115 76 6 0.112 0.98 4.01 Init - 152291 152135 157 2 1 80 18 152 0.032 7.47 4.00 Prom - 156006 155967 40 -4.86 5.00 Prom + 163765 163804 40 -4.76 5.01 Init + 165105 165449 345 2 0 49 55 114 0.323 1.42 5.02 Intr + 168361 168417 57 0 0 91 92 55 0.536 5.28 5.03 Intr + 176989 177081 93 0 0 88 92 18 0.015 2.36 5.04 Intr + 186480 186650 171 2 0 44 36 260 0.000 16.54 5.05 Intr + 193275 193538 264 2 0 35 -15 287 0.076 10.81 5.06 Term + 193639 193845 207 0 0 14 48 227 0.097 8.64 5.07 PlyA + 194222 194227 6 1.05 6.03 PlyA - 196235 196230 6 1.05 6.02 Term - 216315 216199 117 0 0 88 34 88 0.489 1.94 6.01 Intr - 216736 216636 101 2 2 109 86 33 0.293 5.03 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 117003 117121 119 2 2 86 81 76 0.879 4.50 S.002 Sngl - 152291 152052 240 2 0 80 37 211 0.825 10.28 S.003 Init - 178103 178055 49 2 1 72 88 53 0.813 5.04 S.004 Sngl + 186456 186689 234 2 0 78 51 317 0.928 22.10 S.005 Sngl + 193594 193845 252 0 0 87 48 265 0.880 17.49 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815592r:39806360_40027578|GENSCAN_predicted_peptide_1|51_aa MEEERDEYPVEFNLENFLEEEPREENVIRLVPSDELGRQAPRLWFVRGQPL >gi568815592r:39806360_40027578|GENSCAN_predicted_CDS_1|156_bp atggaggaggagagggatgagtatccagtggaattcaacctggaaaacttcctggaggaa gagccacgtgaggaaaatgtcatccggttagtgccctcagatgaactgggaaggcaagct ccccggttgtggtttgtgaggggacaaccattgtga >gi568815592r:39806360_40027578|GENSCAN_predicted_peptide_2|1068_aa MGMEARCRVALPLVSPSVLQAFHFYYFLYSSLKPSEEDPVIRVSRMRKLGLQSRAKKDHN EDLGHLSADAPWPAVTMAPRKRSHHGLGFLCCFGGSDIPEINLRDNHPLQFMEFSSPIPN AEELNIRFAELVDELDLTDKNREAMFALPPEKKWQIYCSKKKEQEDPNKLATSWPDYYID RINSMAAMQSLYAFDEEETEMRNQVVEDLKTALRTQPMRFVTRFIELEGLTCLLNFLRSM DHATCESRIHTSLIGCIKALMNNSQGRAHVLAQPEAISTIAQSLRTENSKTKVAVLEILG AVCLVPGGHKKVLQAMLHYQVYAAERTRFQTLLNELDRSLGRYRDEVNLKTAIMSFINAV LNAGAGEDNLEFRLHLRYEFLMLGIQPVIDKLRQHENAILDKHLDFFEMVRNEDDLELAR RFDMVHIDTKSASQMFELIHKKLKYTEAYPCLLSVLHHCLQMPYKRNGGYFQQWQLLDRI LQQIVLQDERGVDPDLAPLENFNVKNIVNMLINENEVKQWRDQAEKFRKEHMELVSRLER KERECETKTLEKEEMMRTLNKMKDKLARESQELRQARGQVAELDPYPSSDVPLRKKRVPQ PSHPLKSFNWVKLNEERVPGTVWNEIDDMQVFRILDLEDFEKMFSAYQRHQKELGSTEDI YLASRKVKELSVIDGRRAQNCIILLSKLKLSNEEIRQAILKMDEQEDLAKDMLEQLLKFI PEKSDIDLLEEHKHEIERMARADRFLYEMSRIDHYQQRLQALFFKKKFQERLAEAKPKVE AILLASRELVRSKRLRQMLEVILAIGNFMNKGQRGGAYGFRVASLNKIADTKSSIDRNIS LLHYLIMILEKHFPDILNMPSELQHLPEAAKVNLAELEKEVGNLRRGLRAVEVELEYQRR QVREPSDKFVPVMSDFITVSSFSFSELEDQLNEARDKFAKALMHFGEHDSKMQPDEFFGI FDTFLQAFSEARQDLEAMRRRKEEEERRARMEAMLKEQRERERWQRQRKVLAAGSSLEEG GEFDDLVSALRSGEVFDKDLCKLKRSRKRSGSQALEVTRERAINRLNY >gi568815592r:39806360_40027578|GENSCAN_predicted_CDS_2|3207_bp atgggaatggaggcccgctgcagggtggcgcttcctctagtgtcgcccagtgtgctccaa gcatttcacttctattacttcctgtactcctcactgaaacccagcgaggaagaccctgtg atcagggtgtcacggatgaggaagctggggcttcagagtagagccaagaaagatcacaat gaggacctagggcatctgtctgctgacgccccctggcctgcagtgaccatggccccccgc aagaggagccaccatggcctgggcttcctgtgctgcttcgggggcagtgacatccccgaa atcaacctccgggacaaccaccctctgcagttcatggagttctccagccccatcccgaac gcagaggagctcaacatccgctttgcagagctggtggatgaattggatctcactgacaaa aaccgagaggctatgtttgcactgccccctgagaagaaatggcagatctactgcagcaag aagaaggagcaggaggaccccaacaagctggcaaccagctggcctgactattacatcgac cgcatcaattccatggctgcgatgcagagtctgtacgcgtttgatgaggaggagacggag atgaggaaccaagtcgtggaagacctgaagacagccctccggacacagcctatgaggttt gtgacccgcttcattgagctggagggcttgacctgtctgctaaatttcctccggagcatg gaccacgccacctgtgagagccgcatccacacctcactcattggctgcatcaaagcattg atgaacaactcccaggggcgggcacatgtgctggcacagcctgaggccattagtaccata gcccagagcctacgcacagagaacagcaagaccaaggtggctgtgctggagatcctgggt gctgtgtgcctcgtgcctggtggccacaagaaggtgctgcaggccatgctgcactaccag gtgtatgcagcagagcgaacccgcttccagaccctgctgaacgagctagaccgaagtctg ggccggtaccgggatgaagtgaatctgaaaacagccatcatgtccttcatcaatgctgtc ctcaatgctggagctggagaggataatctggagttccgcctacatctacggtatgaattc ctgatgctgggtatacagcctgtgattgacaagctccggcaacatgaaaatgccatcctg gacaaacatttagacttcttcgagatggtgcggaatgaggatgacctggagctagccagg aggtttgacatggtccacatcgacaccaagagtgcttcccagatgtttgagttgatccac aagaagctgaagtacacggaggcctacccctgcctgctctctgtgctgcaccactgcctg cagatgccctacaaacggaacggtggctacttccagcagtggcagctcctggaccgcatc ctccagcagattgtcctccaggatgagcggggtgtggaccctgacctggctcccttggag aacttcaatgtcaagaacatcgtcaacatgctcatcaacgagaatgaagtgaaacagtgg cgagaccaggcagagaagttccggaaagaacacatggagcttgtgagccgtctggagagg aaggagcgggaatgcgagacaaagacattggagaaggaagagatgatgcggacgctgaac aaaatgaaggacaagctggcccgggagtcccaggagctgcgccaggctcggggacaagtg gcagagctggacccctaccccagcagtgacgtcccactcaggaaaaagcgtgtcccccag ccttctcacccactgaagtccttcaactgggtgaagctgaatgaggagcgtgtccctggc accgtatggaatgagattgatgacatgcaggtatttcggatcctggacctagaggatttt gaaaagatgttttcagcctaccagaggcaccagaaagagctgggctccactgaagacatc tacctggcttcccgcaaggtcaaagagctgtcggtcattgatggccggagggcccaaaac tgcatcatccttctttccaagttgaagctttctaacgaggagatccggcaggccatcttg aagatggatgagcaggaggaccttgctaaggacatgctggagcagctcctcaagttcatc ccagagaagagtgacattgacctcctggaggagcacaagcatgaaattgagcggatggcc cgtgctgaccgcttcctctatgaaatgagcaggattgaccactaccagcagcgactgcaa gccctcttcttcaagaagaaattccaggagcggctggctgaggcaaagcccaaagtggaa gccatcctgttggcctcccgggagctggtccgcagcaagcgtcttagacagatgctagag gtcatcctagccataggcaacttcatgaacaaagggcagcgtgggggcgcctacgggttc cgggtggccagcctcaacaagatcgctgacaccaagtccagcatcgacagaaacatctct ctgctccattacctgatcatgatcctggagaagcattttcctgatattctaaacatgcct tcagagctgcaacatcttccagaagctgccaaagtcaacctagcagaactggagaaggag gtgggcaacctcaggaggggcctgagagcggtggaggtggagctggagtatcagaggcgc caggtacgggagcccagtgacaagtttgtccctgtcatgagcgacttcatcacggtgtcc agcttcagcttctccgagctggaggaccagctaaatgaggccagggacaagttcgccaag gccttgatgcacttcggggagcatgacagcaagatgcagccagacgaattctttggcatc tttgataccttcttgcaggccttctcagaggcccggcaggatctagaggccatgaggagg aggaaggaggaggaggagcggcgggcgcgcatggaagccatgctgaaggagcagagggaa cgtgagcggtggcagcggcagcggaaggtcctggctgcaggcagctcgctggaggaggga ggagagttcgatgacctggtgtcggccctgcgctctggggaggtcttcgacaaggactta tgcaagctcaagcgcagccgcaagcgatcagggagccaggccctggaagttacccgggag cgggcaataaaccggctaaattattga >gi568815592r:39806360_40027578|GENSCAN_predicted_peptide_3|410_aa MLNSQLQRLEGLRTIGVTTNGINLARLLPQLQKAGLSAINISLDTLVPAKFEFIVRRKGF HKVMEGIHKAIELGYNPVKVNCVVMRGLNEDELLDFAALTEGLPLDVRFIEYMPFDGNKW NFKKMVSYKEMLDTVRQQWPELEKVPEEESSTAKAFKIPGFQGQISFITSMSEHFCGTCN RLRITADGNLKVCLFGNSEVSLRDHLRAGASEQELLRIIGAAVGRKKRQHAGMFSISQMK NRPMILIGPQLTSEQLTHVDSEGRAAMVDVGRKPDTERVAVASAVVLLGPVAFKLVQQNQ LKKGDALVVAQLAGVQAAKVTSQLIPLCHHVALSHIQVQLELDSTRHAVKIQASCRARGP TGVEMEALTSAAVAALTLYDMCKAVSRDIVLEEIKLISKTGGQRGDFHRA >gi568815592r:39806360_40027578|GENSCAN_predicted_CDS_3|1233_bp atgctgaactcccagctccagcggctggaagggctgagaaccataggtgttaccaccaat ggcatcaacctggcccggctactgccccagcttcagaaggctggtctcagtgccatcaac atcagcctggacaccctggtgcctgccaagtttgagttcattgtccgcaggaaaggcttc cacaaggtcatggagggcatccacaaggccatcgagctgggctacaaccctgtgaaggtg aactgtgtggtgatgcgaggccttaacgaggatgaactcctggactttgcggccttgact gagggcctccccctggatgtgcgcttcatagagtatatgccctttgatggcaacaagtgg aacttcaagaagatggtcagctataaggagatgctagacactgtccggcagcagtggcca gagctggagaaggtgccagaggaggaatccagcacagccaaggcctttaaaatccctggc ttccaaggccagatcagcttcatcacatccatgtctgagcatttctgtgggacctgcaac cgcctgcgaatcacagctgatgggaacctcaaggtctgcctctttggaaactctgaggta tccctgcgggatcacctgcgagctggggcctctgagcaggagctgctgagaatcattggg gctgctgtgggcaggaagaagcggcagcatgcaggcatgttcagtatttcccagatgaag aaccggcccatgatcctcatcggaccccagctaacctcagaacaactaactcatgtggac tcggaaggacgggcagctatggtagatgtgggcaggaagccagacacagagcgggtggct gtggcttcagccgtggtcctcctgggaccggtagccttcaagcttgtccagcagaaccag ctcaagaaaggagatgccctagtggtggcccagctggctggagtccaggcagccaaggtg accagccagctgatccctctgtgccaccacgtggccctgagccacatccaggtgcagctg gagctggacagcacacgccatgccgtgaagatccaggcatcttgccgggctcggggcccc accggggtggagatggaggccctgacctctgctgcagtggccgccctcaccctgtatgac atgtgcaaggctgtcagcagggacatcgtgttggaggagatcaagctcattagcaagact ggtggtcagcggggggacttccatcgggcttag >gi568815592r:39806360_40027578|GENSCAN_predicted_peptide_4|309_aa MATVKKGKPELRKKVHPAVVIRQRKSYRRKDGVFLYFEDNAGVTVNNKGEMKVSCQGSPL DRPHQNQMGRVIPAERYPGPGKWAQGAYYSSHKATRDCVLAYWLRGSRPPPAPPRPCWDD PGGECPASVLPKRAAVPLVSGFMAARPLSRMLRRLLRSSARSCSSGAPVTQPCPGESARA ASEEVSRRRQFLREHAAPFSAFLTDSFGRQHSYLRISLTEKCNLRCQYCMPEEGVPLTPK ANLLTTEEILTLARLFVKEGIDKIRLTGGEPLIRPDVVDIVGELDESHRHFSQRCLIRLD TDADVNSGS >gi568815592r:39806360_40027578|GENSCAN_predicted_CDS_4|930_bp atggccacagtcaagaaaggcaaaccagagctcagaaaaaaggtacatccagcagtggtt attcgacaacgaaagtcataccgcagaaaagatggcgtgtttctttattttgaagataat gcaggggtcacagtgaacaataaaggcgagatgaaagtctcctgccagggctccccactg gacaggccccatcagaaccagatggggagggtcatccctgcagagcgctaccccggccca ggaaagtgggcacagggagcgtactacagctcccacaaggccacgcgggactgcgtcctc gcctattggctgcgcgggtcccgccctccccctgcccctccccgcccttgctgggatgac cccgggggcgagtgcccggccagtgtgctcccgaagcgggctgcggttccgctcgtatca ggcttcatggcggcgcggccactgtcccggatgctgcggcggcttctgaggtccagcgcc cggagctgcagctcaggggctccggtgacccagccctgccccggggagtccgcgcgagct gcctcggaggaggtgtccaggcggaggcagttcctgcgggagcatgcggcccccttctcc gccttcctcacagacagcttcggccggcagcacagctacctgcggatctccctcacagag aagtgcaacctcagatgtcagtactgcatgcccgaggagggggtcccgctgacccccaaa gccaacctgctgaccacagaggagatcctgaccctcgcccggctctttgtgaaggaaggc atcgacaagatccggctcacaggtggagagccgcttatccggccggacgtggtggacatt gtgggtgagttggacgaaagtcatcgccacttctcccagcgctgcctcatccggctggac actgatgccgatgttaacagtggcagctga >gi568815592r:39806360_40027578|GENSCAN_predicted_peptide_5|378_aa MWKQLWNWATGRSWNSLEGSEEDRKIWESLELRDLLNGFDQNPDSDMDNDVQAEVVSDGD EELVGNWSKGDSCYVLVKRLAAFCPCPTDLWNVELERDDLGYLVEEISKQQNIQKDPVSP ESGAEQATCDGKGENCELRFIVLTQGYSQPGMRPAPSLGQVPPFKASHCGNQISVKFLEV IHDEHGIDPTITYHGDSDLQLDCISMYYDEATGGKHVLHTILCSAFTQSDTKVEPYSATL SVHQLVENTDETYSIDNEALYNICSHTLKRTTPTYRDMNPLVSATMNSVITCLCFPGQLN ADLHKLAVNMHVFNAKNMMAACEPRHGLYFTLAAVFTGQISMKDVNDKMLNMQNKNSSYF VEWISNNVKNSLTSHLMA >gi568815592r:39806360_40027578|GENSCAN_predicted_CDS_5|1137_bp atgtggaagcaactttggaactgggcaacaggcagaagttggaacagtttggagggctca gaagaagacaggaaaatatgggaaagtttggaacttagagacttgttgaatggctttgac caaaatcctgatagtgatatggacaatgatgtccaggctgaggtggtctcagatggagat gaggaacttgttgggaactggagcaaaggtgactcttgttatgttttagtaaagagactg gcggcattttgcccctgccctacagatttgtggaatgttgagcttgagagagatgattta gggtatctggtagaagaaatttctaagcagcaaaacattcaaaaggacccagtgagcccg gagtctggagcagagcaggccacctgtgatggcaaaggagagaactgtgagctcaggttt attgtcctcactcaaggctatagtcaaccaggaatgagacctgctccttcacttgggcaa gttcctcccttcaaggccagtcactgtggcaaccagatcagtgtcaaattcttggaagtg atccatgatgaacatggcattgaccccaccatcacctaccatggtgacagtgacctgcag ctggactgcatctccatgtactacgacgaagccacaggtggcaaacatgttcttcatacc atcctgtgtagtgccttcacccaaagtgacaccaaggtcgagccctacagtgccaccctc tctgtccatcagttggtagagaacactgatgagacgtattccattgacaatgaagccctg tataacatctgctcccacactctgaagcggaccacaccaacctacagggatatgaacccc ctcgtctcagcaaccatgaacagtgtcatcacctgcctctgtttccctggccagctcaat gccgacctccacaagttggcagtcaacatgcatgtcttcaatgccaagaacatgatggct gcctgtgaaccccgccatggcttatacttcaccttggctgctgtcttcactggtcagata tccatgaaggacgtcaatgacaaaatgttaaatatgcaaaacaaaaacagcagctacttt gtggaatggatctccaacaacgtcaagaacagtctgacatcccatctcatggcctga >gi568815592r:39806360_40027578|GENSCAN_predicted_peptide_6|72_aa XVPAKAGSERRPWAECEMCGSSAGVGMTAVARVKILLHLQLVSYPGYLQMQVKGQLSLSE DAAQWRDLEGRE >gi568815592r:39806360_40027578|GENSCAN_predicted_CDS_6|219_bp nctgtgcctgcaaaagccggaagtgaaagaaggccctgggcagaatgtgagatgtgcggt tccagtgccggagttggtatgacagctgtagctagagtaaagatcttgttacatctccag ctggtgtcctacccaggctacctgcagatgcaggttaaggggcaattaagtctctcggaa gatgctgcccagtggagggacttggaagggagagaatag