GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:30:37 Sequence gi568815586f:2917431_3140475 : 223045 bp : 49.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3333 3492 160 2 1 75 66 77 0.859 3.76 1.02 Intr + 4832 4972 141 0 0 94 77 180 0.961 17.82 1.03 Intr + 12818 12915 98 0 2 72 50 90 0.866 3.33 1.04 Intr + 13607 13810 204 1 0 95 86 184 0.998 18.30 1.05 Intr + 15988 16100 113 0 2 120 35 129 0.971 9.98 1.06 Intr + 17017 17131 115 1 1 71 73 95 0.934 6.75 1.07 Intr + 20201 20299 99 1 0 104 92 58 0.968 8.11 1.08 Intr + 20684 20855 172 1 1 58 77 140 0.902 9.42 1.09 Intr + 21881 21994 114 0 0 108 72 94 0.514 10.22 1.10 Intr + 41326 41472 147 2 0 34 40 137 0.242 3.71 1.11 Intr + 53203 53330 128 2 2 30 57 73 0.080 -1.20 1.12 Intr + 53664 53796 133 2 1 44 63 146 0.663 7.92 1.13 Intr + 57049 57116 68 2 2 108 34 18 0.208 -2.98 1.14 Intr + 57288 57370 83 2 2 122 39 58 0.312 2.74 1.15 Intr + 60892 60935 44 1 2 81 98 56 0.314 3.78 1.16 Intr + 77308 77562 255 2 0 120 91 367 0.862 37.62 1.17 Term + 82585 82601 17 2 2 145 43 6 0.693 0.20 1.18 PlyA + 84347 84352 6 -0.45 2.00 Prom + 88961 89000 40 -4.86 2.01 Init + 93083 93266 184 1 1 75 68 109 0.698 6.88 2.02 Intr + 93574 93701 128 2 2 90 86 121 0.506 12.50 2.03 Intr + 94740 94802 63 2 0 54 116 98 0.930 8.11 2.04 Intr + 99968 100096 129 1 0 107 98 102 0.995 13.99 2.05 Intr + 101115 101158 44 2 2 101 61 -8 0.607 -5.16 2.06 Intr + 101685 101740 56 0 2 145 94 3 0.850 5.22 2.07 Intr + 103204 103343 140 2 2 90 90 118 0.678 12.38 2.08 Intr + 104414 104587 174 1 0 85 94 277 0.826 28.14 2.09 Intr + 109043 109133 91 1 1 87 67 46 0.739 1.97 2.10 Intr + 111059 111127 69 0 0 60 103 35 0.602 1.35 2.11 Intr + 115970 116010 41 0 2 96 97 14 0.857 1.04 2.12 Intr + 120538 120678 141 0 0 124 75 314 0.971 34.25 2.13 Intr + 122677 122829 153 0 0 73 69 326 0.996 29.47 2.14 Term + 122935 123048 114 0 0 166 54 119 0.955 14.87 2.15 PlyA + 123214 123219 6 1.05 3.14 PlyA - 124043 124038 6 1.05 3.13 Term - 132162 132102 61 2 1 100 48 85 0.174 2.98 3.12 Intr - 153255 153183 73 1 1 87 100 89 0.514 8.46 3.11 Intr - 154731 154004 728 2 2 -15 22 910 0.777 65.18 3.10 Intr - 160949 160831 119 2 2 33 110 17 0.000 -2.34 3.09 Intr - 168399 168187 213 0 0 116 91 141 0.895 16.11 3.08 Intr - 175448 175265 184 1 1 64 82 53 0.806 2.09 3.07 Intr - 175720 175582 139 2 1 24 66 86 0.811 -0.48 3.06 Intr - 179078 178916 163 2 1 84 100 56 0.797 5.95 3.05 Intr - 198015 197892 124 2 1 91 27 70 0.140 1.79 3.04 Intr - 199519 199441 79 2 1 83 57 65 0.158 1.51 3.03 Intr - 205565 205397 169 2 1 95 44 70 0.263 2.82 3.02 Intr - 213651 213514 138 0 0 -24 74 243 0.937 12.36 3.01 Init - 214330 214025 306 1 0 33 100 157 0.918 8.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 155990 155986 5 2 2 87 92 0 0.970 -0.23 S.002 Term - 173683 173432 252 1 0 63 39 191 0.950 7.34 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:2917431_3140475|GENSCAN_predicted_peptide_1|696_aa RLLLEKRQRKKRLEPFMVQPNPEARLRRAKPRASDEQTPLVNCHTPHSNVILHGIDGPAA VLKPDEVHAPSVSSSVVEEDAENTVDTASKPGLQERLQKHDISESVNFDEETDGISQSAC LERPNSASSQNSTDTGTSGSATAAQPADNLLGDIDDLEDFVYSPAPQGVTVRCRIIRDKR GMDRGLFPTYYMYLEKEENQKIFLLAARKRKKSKTANYLISIDPVDLSREGESYVGKLRS NLMGTKFTVYDRGICPMKGRGLVGAAHTRQELAAISYETNVLGFKGPRKMSVIIPGMTLN HKQIPYQPQNNHDSLLSRWQNRTMENLVELHNKAPVWNSDTQSYVLNFRGRVTQASVKNF QIVHKNDPDYIVMQFGRVADDVFTLDYNYPLCAVQAFGIGLSSFDMAVLQMPKGQERKTE RAKAGEGKRGGTGSGSGNGAREFLLPGSPEQVSFVLHNEMDFRSEDPNLLFAVKSTCLLE VRNDSILMTRQVPIPRQVQREYGTEALVPVLMELAGKWTVKLAVTYCKTKERAQAGSVAV PEPWPSPEARVFILPPGGADAFPEGRACTEAPPQMPVLLGHLVWADPRVLTGTFSSSETE TLYPLNSPTSAPPSGALEGTAGTITSNEWSSPTSPEGSTASGGSQALDKPIDNDAEGVWS PDIEQSFQEALAIYPPCGRRKIILSDEGKMYGGPSY >gi568815586f:2917431_3140475|GENSCAN_predicted_CDS_1|2091_bp aggctactacttgagaagaggcaaaggaaaaagcgccttgagccatttatggtgcagccc aatccagaagccaggctacgtcgggcaaagccaagggccagtgatgagcagactcccttg gtgaactgtcatactccccacagcaatgtcatcttacatggtattgatggtccagctgct gtcctgaaaccagacgaagttcatgctccatcagtaagctcctctgttgtggaagaagat gctgaaaacaccgtggatactgcttccaagccaggacttcaggagcgtctccaaaagcat gatatctctgaaagtgtgaacttcgatgaggagactgatggaatatcccagtcagcatgt ttagaaagacccaattctgcatcaagccagaattcaaccgatacaggcacttccggttct gctactgccgcccaaccagctgataacctcctgggagacatagacgacctggaggacttt gtgtatagtcctgcccctcaaggtgtcacagtaagatgtcggataatccgggataaaagg ggaatggatcggggtctcttccccacctactatatgtacttggaaaaagaagaaaatcag aagatatttcttcttgcagctagaaagcggaaaaagagcaaaacagccaactaccttatc tccattgatccagttgatttatctcgtgaaggagaaagttatgtcggcaagcttagatcc aacctcatggggaccaagtttacagtttatgaccgtggcatctgccccatgaagggccgg ggtttggtaggagcggcccacacccggcaggagctggctgccatctcctatgaaacaaac gtacttggatttaaaggtcctaggaaaatgtctgtgatcattcctggaatgacactgaat cataagcagatcccctatcagccacaaaacaaccatgacagtttgctctcaaggtggcag aacagaactatggaaaatctggttgagctgcacaacaaggcccccgtctggaacagtgac actcagtcctatgtcctcaacttccgtggccgggtcactcaggcgtctgtgaagaacttc cagatagtccacaaaaatgaccctgattatatagtcatgcagtttggacgtgtggcagat gacgtgttcacactggattacaactacccactttgtgcagtacaggcctttggcatcggt ctttctagctttgacatggcggttttacagatgcccaaggggcaggagagaaaaacggaa agagctaaagccggggagggcaagcgagggggcactggcagcggatctgggaatggagcc cgcgagttcctgcttcctggttccccggagcaagtgtcctttgttctccataatgaaatg gacttcagatccgaagatcctaatctactctttgctgtgaagagcacctgccttctggaa gtgagaaatgacagcatcctgatgacaagacaggttcccatccctcggcaggtgcagcgg gagtatggaacggaagccctggttcctgtcctcatggaacttgcagggaaatggactgtg aagctagcagtaacatactgcaagacgaaagaacgtgcacaagcagggtctgtggcagtg cctgagccatggcccagtccagaggctcgagtcttcattcttccccctgggggtgcagat gcttttccagagggcagggcgtgcaccgaggccccaccccagatgcccgtgctgcttgga cacctggtttgggctgaccctcgggttctgacaggaactttttcatcttccgaaactgaa acgctgtacccgttaaacagtccaacgagcgctcctccaagcggagccttggagggcacg gccggcaccattacctccaacgagtggagctctcccacctcccctgaggggagcaccgcc tctgggggcagtcaggcactggacaagcccatcgacaatgacgcagagggcgtgtggagc ccggatattgagcagagtttccaggaggccctcgccatctacccgccctgtggcaggcgc aaaatcatcctgtcggacgagggcaagatgtatggtgggccatcctattag >gi568815586f:2917431_3140475|GENSCAN_predicted_peptide_2|508_aa MPAAACRQDKKSGRFYSFTEAEGTVAVTQGKAEGRAAPWSLPSVKKAQEKVWGEERRPPA LGRNELIARYIKLRTGKTRTRKQVGLKRRVGVPGVVPQHQSAPKVSSHIQVLARRKAREI QAKLKDQAAKDKALQSMAAMSSAQIISATAFHSSMALARGPGRPAVSGFWQGALPGQAGT SHDVKPFSQQTYAVQPPLPLPGFESPAGPAPSPSAPPAPPWQGRSVASSKLWMLEFSAFL EQQQDPDTYNKHLFVHIGQSSPSYSDPYLEAVDIRQIYDKFPEKKGGLKDLFERGPSNAF FLVKFWCMTSVLRVMTAGVPATVAIFQPSKEASLAPSILANVKWYLIVVFICISLMTDDE PRKSSRGHLSLGPADLNTNIEDEGSSFYGVSSQYESPENMIITCSTKVCSFGKQVVEKVE TEYARYENGHYSYRIHRSPLCEYMINFIHKLKHLPEKYMMNSVLENFTILQVVTNRDTQE TLLCIAYVFEVSASEHGAQHHIYRLVKE >gi568815586f:2917431_3140475|GENSCAN_predicted_CDS_2|1527_bp atgcccgctgctgcctgcagacaggacaagaagtctggacgcttctactcgtttacagag gcggagggcacggtggcagtcacacagggcaaggccgagggccgggcagctccctggagc ctcccaagtgtgaagaaggcccaggagaaagtgtggggagaggagaggaggccgccagcg ctgggtcggaacgagctgattgcccgctacatcaagctccggacagggaagacccgcacc aggaagcaggtgggcctcaagagacgggtaggggtcccgggggtggtaccccagcaccag tctgctcccaaggtctccagccacatccaggtgctggctcgtcgcaaagctcgcgagatc caggccaagctaaaggaccaggcagctaaggacaaggccctgcagagcatggctgccatg tcgtctgcacagatcatctccgccacggccttccacagtagcatggccctcgcccggggc cccggccgcccagcagtctcagggttttggcaaggagctttgccaggccaagccggaacg tcccatgatgtgaagcctttctctcagcaaacctatgctgtccagcctccgctgcctctg ccagggtttgagtctcctgcagggcccgccccatcgccctctgcgcccccggcaccccca tggcagggccgcagcgtggccagctccaagctctggatgttggagttctctgccttcctg gagcagcagcaggacccggacacgtacaacaagcacctgttcgtgcacattggccagtcc agcccaagctacagcgacccctacctcgaagccgtggacatccgccaaatctatgacaaa ttcccggagaaaaagggtggactcaaggatctcttcgaacggggaccctccaatgccttt tttcttgtgaagttctggtgtatgacttctgtactcagggtcatgactgctggagttcca gccactgtagccatcttccaaccatcaaaagaagccagtttagctccctccatcctagca aatgtgaagtggtatctcattgtggtttttatctgcatttccctgatgactgatgatgag ccccgcaagtccagcagaggccacctgtctctggggccggcagacctcaacaccaacatc gaggatgaaggcagctccttctatggggtctccagccagtatgagagccccgagaacatg atcatcacctgctccacgaaggtctgctctttcggcaagcaggtggtggagaaagttgag acagagtatgctcgctatgagaatggacactactcttaccgcatccaccggtccccgctc tgtgagtacatgatcaacttcatccacaagctcaagcacctccctgagaagtacatgatg aacagcgtgctggagaacttcaccatcctgcaggtggtcaccaacagagacacacaggag accttgctgtgcattgcctatgtctttgaggtgtcagccagtgagcacggggctcagcac cacatctacaggctggtgaaagaatga >gi568815586f:2917431_3140475|GENSCAN_predicted_peptide_3|831_aa MFLKNCEDFGKDTIPGNKHLRAYPTRVQGQDLSCKFPEYGTSRKPSSSITAHITASIPCT PATVRPEAFISIRGFSHFIPTTALGGRWDCPQFRVKKMRLEKVERVYPVKGGEEMQHSKE PLSKNKYHTGTGPNDLEGSALEIRCGYNGPLTNPEASTVQTDIPVSCLQVPGAKRKLSVW HFGSPPRGKEAGSPAFNLPVNRNQEPPRAEDESRKQQTTTKPECVQGRPQRYLPKRTENI WPHKSMAHELHSHIMSKPEKVVITHKSVNDEWVPGNEKTGKAPVNGKPRPLLFRRNKETM HKRYLGEMLEMSLPLRMPVRPQKSHAMCPTIRVSIPKASGPKSFPREDNVYDSGQRQDSE CFPCSRFNLNIRTQLTAASLGKPHTGSKDVATVLLKNNPQRPGSPLSILREGPSLSLRGH PMCQTTETHLDKQWVRGREGTSQRQKAGPGLEAKTQAYQLIPPQKPPPAEPERTLAVGHP TCRFVGEENGLKKPIDLPKFAQQLQSRKASWRRKYLSHPREKDSCRSEGHAHIMRDWVSE LRVTRRPGKTAEVQVLVLGGRGHLLGRLAAIVAKQVLLGQKVVVVRCEGINISGNFYRNM LHVKAPGLPPQADELQPFLRPLPRPGPQRIFLRAVRGMLPPRTKRGQAALDRLKVSDGIP PPYDEKNRMVVPAALKIVRLRPTRKFAYLGRLAHEVGWKYQAVTATLEEKRKEKAKIHYR KKKQLMRLRKQAEKNMEKKTDKYTGVLKTHRPWSEPNKDCSFLMLGLACPSSIITLECAG PGGSSSPGWLLVTDTYQAPRGPQTLELPQSSRSSRGWLLYRLREAFKDHPL >gi568815586f:2917431_3140475|GENSCAN_predicted_CDS_3|2496_bp atgtttttaaaaaattgcgaggactttgggaaggacacaatacctggtaacaaacacctg agggcctatccgacgagggtccaaggacaggacctgtcctgcaagttcccagagtatgga acctcccggaagccaagctccagcatcacagcacacatcacagccagcattccctgcaca cctgccacggtcaggccagaagctttcatcagcattagaggattttctcacttcataccc acaacagccctaggaggtagatgggactgcccccagtttcgggtgaagaaaatgaggctg gagaaggtagagcgtgtgtaccccgtcaaaggaggcgaggagatgcagcacagcaaagag ccgctcagcaaaaacaaataccacacgggcactggccccaacgaccttgaaggttctgcc cttgaaatccgatgcggctacaacggacccctgaccaaccctgaagcatccactgtccaa acagacatccctgtttcctgcctccaagtgcctggggcaaaacgcaagctctctgtttgg cattttggcagccctcctaggggcaaggaggcggggtcacctgctttcaacctgccagta aatagaaaccaagagccacctcgggctgaggatgagtccaggaaacaacaaacaaccacc aagccagaatgcgtccaagggcggccacagaggtatttgcccaagagaactgaaaacata tggccacacaaaagcatggcacatgaacttcacagccacattatgtctaagccagaaaag gtggtaataactcacaagtcagtcaatgacgaatgggtgccaggcaatgagaaaacaggg aaagcaccagtgaatggaaaaccaaggcctctgctcttcagaaggaataaagagacaatg cacaagagatacttaggagagatgttagaaatgtccctgcctctaagaatgccagtgagg ccgcagaagtcccatgccatgtgtcccacaattcgtgtcagcatcccaaaggccagcggc cccaaaagctttcccagagaagacaatgtgtatgattcaggacaacgccaggactctgaa tgctttccgtgttccaggttcaatctgaacatcagaacccagttgactgcggcctctctg ggaaagccacacactggttccaaagatgttgctactgtgctcctgaagaacaatcctcag cgcccagggagtccactgagcatcctcagggaaggaccatccttgtccttacggggacat ccaatgtgccaaacaactgagacccatttggataagcagtgggtcagaggccgggaaggc actagtcagaggcagaaggcaggcccaggactggaagccaaaacccaggcataccaactc atcccacctcagaaaccacctccagcagagccagaaagaaccttggcagtgggtcaccca acctgccgctttgtaggtgaggaaaatggcctgaagaagccaatcgacttgcccaagttc gcacagcagttacagtccaggaaagcttcctggaggaggaaatacctgagtcaccccagg gaaaaggacagctgcaggtcagaagggcacgcgcacataatgagagactgggtctctgaa ctaagagttaccaggcgaccggggaagacggcggaggtgcaggtcctggtgctcggtggt cgaggccatctcctgggccgcctggcggccatcgtggctaagcaggtactgctgggccag aaggtggtggtcgtacgctgcgagggcatcaacatttctggcaatttctacagaaacatg ttgcatgttaaagcacctggccttcctccacaagcggatgaactccaacccttcctgagg cccctgccacgtccgggcccccaacgcatcttcctgcgggcggtgcgaggcatgctgccc cccaggaccaagcgaggccaggccgccctggaccgcctcaaggtgtctgacggcatccca ccgccctacgacgagaaaaaccggatggtggttcctgctgccctcaagatcgtgcgtctg aggcctacaagaaagtttgcctatctggggcgcctggctcacgaggttggctggaagtac caggcagtgacagccaccctggaggagaagaggaaggagaaggccaagatccactaccgg aagaagaaacagctcatgaggctacggaaacaggccgaaaagaacatggagaagaaaact gacaaatacacaggggtcctcaagacccacagaccctggtctgagcccaataaagactgt tcattcctcatgcttggcctggcctgcccttcctccatcatcaccctggaatgtgcggga cccgggggcagcagcagtccggggtggttactcgtcacggacacctaccaggcaccaaga ggacctcaaacgctggagctcccacagtcgtcaaggtcttcacgtggctggctgctgtat cgcctcagagaggccttcaaggaccacccactctga